• Martin Storsjö's avatar
    arm/aarch64: vp9lpf: Calculate !hev directly · f0ecbb13
    Martin Storsjö authored
    Previously we first calculated hev, and then negated it.
    
    Since we were able to schedule the negation in the middle
    of another calculation, we don't see any gain in all cases.
    
    Before:                     Cortex A7      A8      A9     A53  A53/AArch64
    vp9_loop_filter_v_4_8_neon:     147.0   129.0   115.8    89.0         88.7
    vp9_loop_filter_v_8_8_neon:     242.0   198.5   174.7   140.0        136.7
    vp9_loop_filter_v_16_8_neon:    500.0   419.5   382.7   293.0        275.7
    vp9_loop_filter_v_16_16_neon:   971.2   825.5   731.5   579.0        453.0
    After:
    vp9_loop_filter_v_4_8_neon:     143.0   127.7   114.8    88.0         87.7
    vp9_loop_filter_v_8_8_neon:     241.0   197.2   173.7   140.0        136.7
    vp9_loop_filter_v_16_8_neon:    497.0   419.5   379.7   293.0        275.7
    vp9_loop_filter_v_16_16_neon:   965.2   818.7   731.4   579.0        452.0
    
    This is cherrypicked from libav commit
    e1f9de86.
    Signed-off-by: 's avatarMartin Storsjö <martin@martin.st>
    f0ecbb13
Name
Last commit
Last update
..
Makefile Loading commit data...
asm-offsets.h Loading commit data...
cabac.h Loading commit data...
fft_init_aarch64.c Loading commit data...
fft_neon.S Loading commit data...
fmtconvert_init.c Loading commit data...
fmtconvert_neon.S Loading commit data...
h264chroma_init_aarch64.c Loading commit data...
h264cmc_neon.S Loading commit data...
h264dsp_init_aarch64.c Loading commit data...
h264dsp_neon.S Loading commit data...
h264idct_neon.S Loading commit data...
h264pred_init.c Loading commit data...
h264pred_neon.S Loading commit data...
h264qpel_init_aarch64.c Loading commit data...
h264qpel_neon.S Loading commit data...
hpeldsp_init_aarch64.c Loading commit data...
hpeldsp_neon.S Loading commit data...
mdct_neon.S Loading commit data...
mpegaudiodsp_init.c Loading commit data...
mpegaudiodsp_neon.S Loading commit data...
neon.S Loading commit data...
neontest.c Loading commit data...
rv40dsp_init_aarch64.c Loading commit data...
synth_filter_init.c Loading commit data...
synth_filter_neon.S Loading commit data...
vc1dsp_init_aarch64.c Loading commit data...
videodsp.S Loading commit data...
videodsp_init.c Loading commit data...
vorbisdsp_init.c Loading commit data...
vorbisdsp_neon.S Loading commit data...
vp9dsp_init.h Loading commit data...
vp9dsp_init_10bpp_aarch64.c Loading commit data...
vp9dsp_init_12bpp_aarch64.c Loading commit data...
vp9dsp_init_16bpp_aarch64_template.c Loading commit data...
vp9dsp_init_aarch64.c Loading commit data...
vp9itxfm_16bpp_neon.S Loading commit data...
vp9itxfm_neon.S Loading commit data...
vp9lpf_16bpp_neon.S Loading commit data...
vp9lpf_neon.S Loading commit data...
vp9mc_16bpp_neon.S Loading commit data...
vp9mc_neon.S Loading commit data...