• Lynne's avatar
    aarch64/opusdsp: implement NEON accelerated postfilter and deemphasis · 4d2f6215
    Lynne authored
    153372 UNITS in postfilter_c,   65536 runs,      0 skips
    73164 UNITS in postfilter_neon,   65536 runs,      0 skips -> 2.1x speedup
    
    80591 UNITS in deemphasis_c,  131072 runs,      0 skips
    43969 UNITS in deemphasis_neon,  131072 runs,      0 skips -> 1.83x speedup
    
    Total decoder speedup: ~15% on a Raspberry Pi 3 (from 28.1x to 33.5x realtime)
    
    Deemphasis SIMD based on the following unrolling:
    const float c1 = CELT_EMPH_COEFF, c2 = c1*c1, c3 = c2*c1, c4 = c3*c1;
    float state = coeff;
    
    for (int i = 0; i < len; i += 4) {
        y[0] = x[0] + c1*state;
        y[1] = x[1] + c2*state + c1*x[0];
        y[2] = x[2] + c3*state + c1*x[1] + c2*x[0];
        y[3] = x[3] + c4*state + c1*x[2] + c2*x[1] + c3*x[0];
    
        state = y[3];
        y += 4;
        x += 4;
    }
    
    Unlike the x86 version, duplication is used instead of pslldq so
    the structure and tables are different.
    4d2f6215
Name
Last commit
Last update
compat Loading commit data...
doc Loading commit data...
ffbuild Loading commit data...
fftools Loading commit data...
libavcodec Loading commit data...
libavdevice Loading commit data...
libavfilter Loading commit data...
libavformat Loading commit data...
libavresample Loading commit data...
libavutil Loading commit data...
libpostproc Loading commit data...
libswresample Loading commit data...
libswscale Loading commit data...
presets Loading commit data...
tests Loading commit data...
tools Loading commit data...
.gitattributes Loading commit data...
.gitignore Loading commit data...
.travis.yml Loading commit data...
CONTRIBUTING.md Loading commit data...
COPYING.GPLv2 Loading commit data...
COPYING.GPLv3 Loading commit data...
COPYING.LGPLv2.1 Loading commit data...
COPYING.LGPLv3 Loading commit data...
CREDITS Loading commit data...
Changelog Loading commit data...
INSTALL.md Loading commit data...
LICENSE.md Loading commit data...
MAINTAINERS Loading commit data...
Makefile Loading commit data...
README.md Loading commit data...
RELEASE Loading commit data...
configure Loading commit data...