1. 14 Sep, 2018 1 commit
  2. 01 Aug, 2018 2 commits
  3. 31 Jul, 2018 1 commit
    • Jun Zhao's avatar
      avutil/pixelutils: sad_32x32 sse2/avx2 optimizations. · d36b8394
      Jun Zhao authored
      add ff_pixelutils_sad_32x32_sse2, ff_pixelutils_sad_{a,u}_32x32_sse2,
      ff_pixelutils_sad_32x32_avx22, ff_pixelutils_sad_{a,u}_32x32_avx2
      
      use perf record/report profiling, get instructions:u for avx2 sad_32x32:
      
        72.05%  pixelutils  pixelutils     [.] block_sad_32x32_c
        18.50%  pixelutils  pixelutils     [.] block_sad_16x16_c
         4.78%  pixelutils  pixelutils     [.] block_sad_8x8_c
         2.69%  pixelutils  pixelutils     [.] block_sad_4x4_c
         0.89%  pixelutils  pixelutils     [.] block_sad_2x2_c
         0.16%  pixelutils  pixelutils     [.] ff_pixelutils_sad_32x32_avx2
         0.16%  pixelutils  pixelutils     [.] ff_pixelutils_sad_u_32x32_avx2
         0.12%  pixelutils  pixelutils     [.] ff_pixelutils_sad_a_32x32_avx2
      
      sse2 sad_32x32 instructions:u like:
      
        71.86%  pixelutils  pixelutils     [.] block_sad_32x32_c
        18.42%  pixelutils  pixelutils     [.] block_sad_16x16_c
         4.81%  pixelutils  pixelutils     [.] block_sad_8x8_c
         2.68%  pixelutils  pixelutils     [.] block_sad_4x4_c
         0.88%  pixelutils  pixelutils     [.] block_sad_2x2_c
         0.29%  pixelutils  pixelutils     [.] ff_pixelutils_sad_32x32_sse2
         0.26%  pixelutils  pixelutils     [.] ff_pixelutils_sad_u_32x32_sse2
         0.23%  pixelutils  pixelutils     [.] ff_pixelutils_sad_a_32x32_sse2
      Signed-off-by: 's avatarJun Zhao <mypopydev@gmail.com>
      d36b8394
  4. 19 Jul, 2018 1 commit
  5. 11 Jul, 2018 1 commit
  6. 06 Feb, 2018 1 commit
  7. 20 Jan, 2018 5 commits
  8. 25 Dec, 2017 1 commit
  9. 24 Dec, 2017 4 commits
    • Henrik Gramner's avatar
      x86inc: AVX-512 support · f7197f68
      Henrik Gramner authored
      AVX-512 consists of a plethora of different extensions, but in order to keep
      things a bit more manageable we group together the following extensions
      under a single baseline cpu flag which should cover SKL-X and future CPUs:
       * AVX-512 Foundation (F)
       * AVX-512 Conflict Detection Instructions (CD)
       * AVX-512 Byte and Word Instructions (BW)
       * AVX-512 Doubleword and Quadword Instructions (DQ)
       * AVX-512 Vector Length Extensions (VL)
      
      On x86-64 AVX-512 provides 16 additional vector registers, prefer using
      those over existing ones since it allows us to avoid using `vzeroupper`
      unless more than 16 vector registers are required. They also happen to
      be volatile on Windows which means that we don't need to save and restore
      existing xmm register contents unless more than 22 vector registers are
      required.
      
      Big thanks to Intel for their support.
      f7197f68
    • James Darnley's avatar
      avutil: add alignment needed for AVX-512 · e2218ed8
      James Darnley authored
      e2218ed8
    • James Darnley's avatar
      avutil: detect when AVX-512 is available · 4783a01c
      James Darnley authored
      4783a01c
    • James Darnley's avatar
      avutil: add AVX-512 flags · 8b81eabe
      James Darnley authored
      8b81eabe
  10. 02 Dec, 2017 1 commit
  11. 25 Oct, 2017 1 commit
  12. 09 Oct, 2017 1 commit
  13. 28 Sep, 2017 1 commit
  14. 18 Aug, 2017 1 commit
  15. 27 Jun, 2017 1 commit
  16. 21 Jun, 2017 1 commit
  17. 19 Jun, 2017 1 commit
  18. 12 Jun, 2017 1 commit
  19. 09 Jun, 2017 4 commits
  20. 13 May, 2017 1 commit
  21. 12 Apr, 2017 1 commit
  22. 10 Apr, 2017 1 commit
  23. 21 Mar, 2017 1 commit
  24. 14 Mar, 2017 1 commit
    • Diego Biurrun's avatar
      x86util: Port all macros to cpuflags · 994c4bc1
      Diego Biurrun authored
      Also do some small cosmetic changes: Drop pointless _MMX suffix from ABSD2
      macro name, drop pointless check for MMX support, we always assume MMX is
      available in our SIMD code, fix spelling.
      994c4bc1
  25. 01 Mar, 2017 1 commit
  26. 18 Feb, 2017 3 commits
  27. 03 Feb, 2017 1 commit