1. 21 Feb, 2017 6 commits
  2. 20 Feb, 2017 11 commits
  3. 19 Feb, 2017 21 commits
  4. 18 Feb, 2017 2 commits
    • James Darnley's avatar
      avcodec/h264: sse2, avx h luma mbaff deblock/loop filter · 53368878
      James Darnley authored
      x86-64 only
      
      Yorkfield:
      - sse2: ~2.17x (434 vs. 200 cycles)
      
      Nehalem:
      - sse2: ~2.94x (409 vs. 139 cycles)
      
      Skylake:
      - sse2: ~3.10x (370 vs. 119 cycles)
      - avx:  ~3.29x (370 vs. 112 cycles)
      53368878
    • James Darnley's avatar
      x86util: import MOVHL macro · 7627df15
      James Darnley authored
      Originally committed to x264 in 1637239a by Henrik Gramner who has
      agreed to re-license it as LGPL.  Original commit message follows.
      
          x86: Avoid some bypass delays and false dependencies
      
          A bypass delay of 1-3 clock cycles may occur on some CPUs when transitioning
          between int and float domains, so try to avoid that if possible.
      7627df15