- 11 Sep, 2019 1 commit
-
-
James Almer authored
It's not using ymm registers, so limiting it to CPUs with fast AVX is not necessary. Signed-off-by:
James Almer <jamrial@gmail.com>
-
- 31 Mar, 2019 1 commit
-
-
Lynne authored
58893 decicycles in deemphasis_c, 130548 runs, 524 skips 9475 decicycles in deemphasis_fma3, 130686 runs, 386 skips -> 6.21x speedup 24866 decicycles in postfilter_c, 65386 runs, 150 skips 5268 decicycles in postfilter_fma3, 65505 runs, 31 skips -> 4.72x speedup Total decoder speedup: ~14% Deemphasis SIMD based on the following unrolling: const float c1 = CELT_EMPH_COEFF, c2 = c1*c1, c3 = c2*c1, c4 = c3*c1; float state = coeff; for (int i = 0; i < len; i += 4) { y[0] = x[0] + c1*state; y[1] = x[1] + c2*state + c1*x[0]; y[2] = x[2] + c3*state + c1*x[1] + c2*x[0]; y[3] = x[3] + c4*state + c1*x[2] + c2*x[1] + c3*x[0]; state = y[3]; y += 4; x += 4; }
-
- 10 Jan, 2019 1 commit
-
-
Paul B Mahol authored
-
- 17 Jan, 2016 1 commit
-
-
James Almer authored
Reviewed-by:
Paul B Mahol <onemda@gmail.com> Signed-off-by:
James Almer <jamrial@gmail.com>
-
- 06 Apr, 2014 1 commit
-
-
Janne Grunau authored
8% faster h264 decoding on Apple A7.
-
- 05 Feb, 2013 1 commit
-
-
Diego Biurrun authored
-
- 21 Dec, 2012 1 commit
-
-
Michael Niedermayer authored
These are all part of splited out dsp utils from FFmpeg Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 20 Dec, 2012 1 commit
-
-
Ronald S. Bultje authored
Move some functions from dsputil. The idea is that videodsp contains functions that are useful for a large and varied set of video decoders. Currently, it contains emulated_edge_mc() and prefetch(). Signed-off-by:
Luca Barbato <lu_zero@gentoo.org>
-
- 25 Apr, 2012 1 commit
-
-
Mans Rullgard authored
This is a preparation for complete ARMv6 optimisations. Signed-off-by:
Mans Rullgard <mans@mansr.com>
-
- 22 Apr, 2012 1 commit
-
-
Mans Rullgard authored
This allows masking CPU features with the -cpuflags avconv option which is useful for testing different optimisations without rebuilding. Signed-off-by:
Mans Rullgard <mans@mansr.com>
-
- 03 Mar, 2012 1 commit
-
-
Michael Niedermayer authored
Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 02 Mar, 2012 1 commit
-
-
Ronald S. Bultje authored
On 64bit platforms with 32bit int, this means we won't have to sign- extend the integer anymore.
-
- 06 Feb, 2012 1 commit
-
-
Diego Biurrun authored
-
- 02 Oct, 2011 1 commit
-
-
Ronald S. Bultje authored
From 52.503s (~40fps) to 27.973sec (~80fps) decoding of 480p sintel trailer, i.e. a ~2x speedup overall, on a Nexus S. Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 19 Mar, 2011 1 commit
-
-
Mans Rullgard authored
Signed-off-by:
Mans Rullgard <mans@mansr.com>
-
- 09 Feb, 2011 1 commit
-
-
Mans Rullgard authored
This adds NEON optimised versions of all functions in VP8DSPContext. Based on initial work by Rob Clark. Signed-off-by:
Mans Rullgard <mans@mansr.com> (cherry picked from commit a1c1d3c0)
-
- 07 Feb, 2011 1 commit
-
-
Mans Rullgard authored
This adds NEON optimised versions of all functions in VP8DSPContext. Based on initial work by Rob Clark. Signed-off-by:
Mans Rullgard <mans@mansr.com>
-