• Janne Grunau's avatar
    arm64: port synth_filter_float_neon from arm · 705f5e5e
    Janne Grunau authored
    ~25% faster dts decoding overall. The checkasm CPU cycles numbers are
    not that useful since synth_filter_float() calls FFTContext.imdct_half().
    
                             cortex-a57   cortex-a53
    synth_filter_float_c:    1866.2       3490.9
    synth_filter_float_neon:  915.0       1531.5
    
    With fftc.imdct_half forced to imdct_half_neon:
                             cortex-a57   cortex-a53
    synth_filter_float_c:    1718.4       3025.3
    synth_filter_float_neon:  926.2       1530.1
    705f5e5e
Name
Last commit
Last update
..
Makefile Loading commit data...
asm-offsets.h Loading commit data...
cabac.h Loading commit data...
dcadsp_init.c Loading commit data...
dcadsp_neon.S Loading commit data...
fft_init_aarch64.c Loading commit data...
fft_neon.S Loading commit data...
h264chroma_init_aarch64.c Loading commit data...
h264cmc_neon.S Loading commit data...
h264dsp_init_aarch64.c Loading commit data...
h264dsp_neon.S Loading commit data...
h264idct_neon.S Loading commit data...
h264pred_init.c Loading commit data...
h264pred_neon.S Loading commit data...
h264qpel_init_aarch64.c Loading commit data...
h264qpel_neon.S Loading commit data...
hpeldsp_init_aarch64.c Loading commit data...
hpeldsp_neon.S Loading commit data...
imdct15_init.c Loading commit data...
imdct15_neon.S Loading commit data...
mdct_neon.S Loading commit data...
mpegaudiodsp_init.c Loading commit data...
mpegaudiodsp_neon.S Loading commit data...
neon.S Loading commit data...
neontest.c Loading commit data...
rv40dsp_init_aarch64.c Loading commit data...
synth_filter_neon.S Loading commit data...
vc1dsp_init_aarch64.c Loading commit data...
videodsp.S Loading commit data...
videodsp_init.c Loading commit data...
vorbisdsp_init.c Loading commit data...
vorbisdsp_neon.S Loading commit data...