1. 09 Feb, 2019 5 commits
  2. 08 Feb, 2019 4 commits
  3. 07 Feb, 2019 3 commits
  4. 06 Feb, 2019 4 commits
  5. 05 Feb, 2019 2 commits
    • Lauri Kasanen's avatar
      libswscale/ppc: VSX-optimize 9-16 bit yuv2planeX · 8522d219
      Lauri Kasanen authored
      ./ffmpeg_g -f rawvideo -pix_fmt rgb24 -s hd1080 -i /dev/zero -pix_fmt yuv420p16be \
      -s 1920x1728 -f null -vframes 100 -v error -nostats -
      
      9-14 bit funcs get about 6x speedup, 16-bit gets about 15x.
      Fate passes, each format tested with an image to video conversion.
      
      Only POWER8 includes 32-bit vector multiplies, so POWER7 is locked out
      of the 16-bit function. This includes the vec_mulo/mule functions too,
      not just vmuluwm.
      
      With TIMER_REPORT skips disabled:
      yuv420p9le
        12412 UNITS in planarX,  131072 runs,      0 skips
        73136 UNITS in planarX,  131072 runs,      0 skips
      yuv420p9be
        12481 UNITS in planarX,  131072 runs,      0 skips
        73410 UNITS in planarX,  131072 runs,      0 skips
      yuv420p10le
        12322 UNITS in planarX,  131072 runs,      0 skips
        72546 UNITS in planarX,  131072 runs,      0 skips
      yuv420p10be
        12291 UNITS in planarX,  131072 runs,      0 skips
        72935 UNITS in planarX,  131072 runs,      0 skips
      yuv420p12le
        12316 UNITS in planarX,  131072 runs,      0 skips
        72708 UNITS in planarX,  131072 runs,      0 skips
      yuv420p12be
        12319 UNITS in planarX,  131072 runs,      0 skips
        72577 UNITS in planarX,  131072 runs,      0 skips
      yuv420p14le
        12259 UNITS in planarX,  131072 runs,      0 skips
        72516 UNITS in planarX,  131072 runs,      0 skips
      yuv420p14be
        12440 UNITS in planarX,  131072 runs,      0 skips
        72962 UNITS in planarX,  131072 runs,      0 skips
      yuv420p16le
        10548 UNITS in planarX,  131072 runs,      0 skips
        73429 UNITS in planarX,  131072 runs,      0 skips
      yuv420p16be
        10634 UNITS in planarX,  131072 runs,      0 skips
       150959 UNITS in planarX,  131072 runs,      0 skips
      Signed-off-by: 's avatarLauri Kasanen <cand@gmx.com>
      8522d219
    • Lauri Kasanen's avatar
      avutil/ppc/cpu: Fix power8 linux detection · fc6022e1
      Lauri Kasanen authored
      The existing code was in no released kernel that I can see. The corrected code
      was added in 3.9.
      fc6022e1
  6. 04 Feb, 2019 5 commits
  7. 03 Feb, 2019 1 commit
  8. 02 Feb, 2019 7 commits
  9. 01 Feb, 2019 4 commits
  10. 31 Jan, 2019 5 commits
    • Carl Eugen Hoyos's avatar
      lavc/aarch64/vp8dsp: Fix the include guard. · ed20fbcd
      Carl Eugen Hoyos authored
      Fixes fate-source.
      ed20fbcd
    • Justin Bull's avatar
      INSTALL.md: Fix Markdown formatting · 22fc9cdd
      Justin Bull authored
      Signed-off-by: 's avatarLou Logan <lou@lrcd.com>
      22fc9cdd
    • Magnus Röös's avatar
      libavcodec: vp8 neon optimizations for aarch64 · 833fed52
      Magnus Röös authored
      Partial port of the ARM Neon for aarch64.
      
      Benchmarks from fate:
      
      benchmarking with Linux Perf Monitoring API
      nop: 58.6
      checkasm: using random seed 1760970128
      NEON:
       - vp8dsp.idct       [OK]
       - vp8dsp.mc         [OK]
       - vp8dsp.loopfilter [OK]
      checkasm: all 21 tests passed
      vp8_idct_add_c: 201.6
      vp8_idct_add_neon: 83.1
      vp8_idct_dc_add_c: 107.6
      vp8_idct_dc_add_neon: 33.8
      vp8_idct_dc_add4y_c: 426.4
      vp8_idct_dc_add4y_neon: 59.4
      vp8_loop_filter8uv_h_c: 688.1
      vp8_loop_filter8uv_h_neon: 216.3
      vp8_loop_filter8uv_inner_h_c: 649.3
      vp8_loop_filter8uv_inner_h_neon: 195.3
      vp8_loop_filter8uv_inner_v_c: 544.8
      vp8_loop_filter8uv_inner_v_neon: 131.3
      vp8_loop_filter8uv_v_c: 706.1
      vp8_loop_filter8uv_v_neon: 141.1
      vp8_loop_filter16y_h_c: 668.8
      vp8_loop_filter16y_h_neon: 242.8
      vp8_loop_filter16y_inner_h_c: 647.3
      vp8_loop_filter16y_inner_h_neon: 224.6
      vp8_loop_filter16y_inner_v_c: 647.8
      vp8_loop_filter16y_inner_v_neon: 128.8
      vp8_loop_filter16y_v_c: 721.8
      vp8_loop_filter16y_v_neon: 154.3
      vp8_loop_filter_simple_h_c: 387.8
      vp8_loop_filter_simple_h_neon: 187.6
      vp8_loop_filter_simple_v_c: 384.1
      vp8_loop_filter_simple_v_neon: 78.6
      vp8_put_epel8_h4v4_c: 3971.1
      vp8_put_epel8_h4v4_neon: 855.1
      vp8_put_epel8_h4v6_c: 5060.1
      vp8_put_epel8_h4v6_neon: 989.6
      vp8_put_epel8_h6v4_c: 4320.8
      vp8_put_epel8_h6v4_neon: 1007.3
      vp8_put_epel8_h6v6_c: 5449.3
      vp8_put_epel8_h6v6_neon: 1158.1
      vp8_put_epel16_h6_c: 6683.8
      vp8_put_epel16_h6_neon: 831.8
      vp8_put_epel16_h6v6_c: 11110.8
      vp8_put_epel16_h6v6_neon: 2214.8
      vp8_put_epel16_v6_c: 7024.8
      vp8_put_epel16_v6_neon: 799.6
      vp8_put_pixels8_c: 112.8
      vp8_put_pixels8_neon: 78.1
      vp8_put_pixels16_c: 131.3
      vp8_put_pixels16_neon: 129.8
      Signed-off-by: 's avatarMagnus Röös <mla2.roos@gmail.com>
      833fed52
    • Carl Eugen Hoyos's avatar
      tests/checkasm/checkasm: Do not define an unused function. · 608572ce
      Carl Eugen Hoyos authored
      Fixes the following warning:
      tests/checkasm/checkasm.c:615:12: warning: 'bench_init_ffmpeg' defined but not used
      608572ce
    • Michael Niedermayer's avatar