1. 20 Dec, 2015 3 commits
  2. 30 Sep, 2015 1 commit
  3. 04 Aug, 2015 1 commit
    • Henrik Gramner's avatar
      x86inc: Drop SECTION_TEXT macro · f0b7882c
      Henrik Gramner authored
      The .text section is already 16-byte aligned by default on all supported
      platforms so `SECTION_TEXT` isn't any different from `SECTION .text`.
      f0b7882c
  4. 26 Jul, 2015 1 commit
  5. 01 Mar, 2015 1 commit
  6. 12 Feb, 2015 1 commit
  7. 09 Feb, 2015 1 commit
  8. 06 Feb, 2015 2 commits
  9. 05 Feb, 2015 2 commits
    • James Almer's avatar
      x86/hevcdsp: add ff_hevc_sao_edge_filter_{10,12}_{sse2,avx2} · 15574c50
      James Almer authored
      Original x86 intrinsics code by Pierre-Edouard Lepere.
      Yasm port, refactoring and optimizations by James Almer.
      
      Benchmarks of BQTerrace_1920x1080_60_qp22.bin with an Intel Core i5-4200U
      
      Width 32
      342694 decicycles in sao_edge_filter_10, 16384 runs, 0 skips
      29476 decicycles in ff_hevc_sao_edge_filter_32_10_ssse3, 16384 runs, 0 skips
      13996 decicycles in ff_hevc_sao_edge_filter_32_10_avx2, 16381 runs, 3 skips
      
      Width 64
      581163 decicycles in sao_edge_filter_10, 8192 runs, 0 skips
      59774 decicycles in ff_hevc_sao_edge_filter_64_10_ssse3, 8192 runs, 0 skips
      28383 decicycles in ff_hevc_sao_edge_filter_64_10_avx2, 8191 runs, 1 skips
      Signed-off-by: 's avatarJames Almer <jamrial@gmail.com>
      15574c50
    • James Almer's avatar
      x86/hevcdsp: add ff_hevc_sao_edge_filter_8_{ssse3,avx2} · 042c1159
      James Almer authored
      Original x86 intrinsics code and initial yasm port by Pierre-Edouard Lepere.
      Refactoring and optimizations by James Almer.
      
      Benchmarks of BQTerrace_1920x1080_60_qp22.bin with an Intel Core i5-4200U
      
      Width 32
      158583 decicycles in edge, sao_edge_filter_8 runs, 0 skips
      5205 decicycles in ff_hevc_sao_edge_filter_32_8_ssse3, 32767 runs, 1 skips
      2942 decicycles in ff_hevc_sao_edge_filter_32_8_avx2, 32767 runs, 1 skips
      
      Width 64
      705639 decicycles in sao_edge_filter_8, 262144 runs, 0 skips
      19224 decicycles in ff_hevc_sao_edge_filter_64_8_ssse3, 262111 runs, 33 skips
      10433 decicycles in ff_hevc_sao_edge_filter_64_8_avx2, 262115 runs, 29 skips
      Signed-off-by: 's avatarJames Almer <jamrial@gmail.com>
      042c1159
  10. 02 Feb, 2015 2 commits
  11. 01 Feb, 2015 2 commits
    • Christophe Gisquet's avatar
      x86: hevc/sao: aligned source buffers · bff7feb3
      Christophe Gisquet authored
      Usefull for at least band filter, for which:
      - Band filter call only:
                 32      64
      Before:  16556    54015
      After:   16497    52355
      - Whole case:
                 32      64
      Before:  37031   103008
      After:   32045    93952
      bff7feb3
    • James Almer's avatar
      x86/hevc: add ff_hevc_sao_band_filter_{8,10,12}_{sse2,avx,avx2} · fa3eccb4
      James Almer authored
      Original x86 intrinsics code and initial 8bit yasm port by Pierre-Edouard Lepere.
      10/12bit yasm ports, refactoring and optimizations by James Almer
      
      Benchmarks of BQTerrace_1920x1080_60_qp22.bin with an Intel Core i5-4200U
      
      width 32
      40338 decicycles in sao_band_filter_0_8, 2048 runs, 0 skips
      8056 decicycles in ff_hevc_sao_band_filter_8_32_sse2, 2048 runs, 0 skips
      7458 decicycles in ff_hevc_sao_band_filter_8_32_avx, 2048 runs, 0 skips
      4504 decicycles in ff_hevc_sao_band_filter_8_32_avx2, 2048 runs, 0 skips
      
      width 64
      136046 decicycles in sao_band_filter_0_8, 16384 runs, 0 skips
      28576 decicycles in ff_hevc_sao_band_filter_8_32_sse2, 16384 runs, 0 skips
      26707 decicycles in ff_hevc_sao_band_filter_8_32_avx, 16384 runs, 0 skips
      14387 decicycles in ff_hevc_sao_band_filter_8_32_avx2, 16384 runs, 0 skips
      Reviewed-by: 's avatarChristophe Gisquet <christophe.gisquet@gmail.com>
      Signed-off-by: 's avatarJames Almer <jamrial@gmail.com>
      fa3eccb4