1. 25 Jan, 2014 4 commits
    • Ronald S. Bultje's avatar
      vp9/x86: use explicit register for relative stack references. · c9e6325e
      Ronald S. Bultje authored
      Before this patch, we explicitly modify rsp, which isn't necessarily
      universally acceptable, since the space under the stack pointer might
      be modified in things like signal handlers. Therefore, use an explicit
      register to hold the stack pointer relative to the bottom of the stack
      (i.e. rsp). This will also clear out valgrind errors about the use of
      uninitialized data that started occurring after the idct16x16/ssse3
      optimizations were first merged.
      c9e6325e
    • Ronald S. Bultje's avatar
      vp9/x86: iwht4x4 (lossless) mmx. · 97474d52
      Ronald S. Bultje authored
      97474d52
    • Ronald S. Bultje's avatar
      vp9/x86: 4x4 iadst SIMD (ssse3) variants. · d43efa68
      Ronald S. Bultje authored
      Cycle measurements for intra itxfm_4x4_add on ped1080p.webm:
      idct_idct:    66 -> 67 cycles (noise measurement)
      idct_iadst:  199 -> 79 cycles
      iadst_idct:  165 -> 70 cycles
      iadst_iadst: 183 -> 82 cycles
      d43efa68
    • Ronald S. Bultje's avatar
      vp9/x86: 8x8 iadst SIMD (ssse3/avx) variants. · baf47020
      Ronald S. Bultje authored
      Cycle measurements for intra itxfm_8x8_add on ped1080p.webm:
      idct_idct:   133 -> 135 cycles (noise measurement)
      idct_iadst:  900 -> 241 cycles
      iadst_idct:  864 -> 215 cycles
      iadst_iadst: 973 -> 310 cycles
      baf47020
  2. 24 Jan, 2014 30 commits
  3. 23 Jan, 2014 6 commits