libavcodec/arm/dca.h · 56c2f37727015212a404cae0f444d8bc8704d691 · Linshizhi / ffmpeg.wasm-core

dcadec: simplify decoding of VQ high frequencies · 4cb69642

Christophe Gisquet authored Feb 14, 2014

The vector dequantization has a test in a loop preventing effective SIMD
implementation. By moving it out of the loop, this loop can be DSPized.

Therefore, modify the current DSP implementation. In particular, the
DSP implementation no longer has to handle null loop sizes.

The decode_hf implementations have following timings:

For x86 Arrandale:
        C  SSE SSE2 SSE4
win32: 260 162  119  104
win64: 242 N/A   89   72

The arm NEON optimizations follow in a later patch as external asm. The
now unused check for the y modifier in arm inline asm is removed from
configure.

4cb69642

dca.h 3.23 KB

Replace dca.h