- 13 Oct, 2015 3 commits
-
-
Christophe Gisquet authored
This avoid going through constants.c while still sharing them with proresdsp.asm Reviewed-by:
"Ronald S. Bultje" <rsbultje@gmail.com> Signed-off-by:
Michael Niedermayer <michael@niedermayer.cc>
-
Christophe Gisquet authored
Modeled from the prores version. Clips to [0;1023] and is bitexact. Bitexactness requires to add offsets in different places compared to prores or C, and makes the function approximately 2% slower. For 16 frames of a DNxHD 4:2:2 10bits test sequence: C: 60861 decicycles in idct, 1048205 runs, 371 skips sse2: 27567 decicycles in idct, 1048216 runs, 360 skips avx: 26272 decicycles in idct, 1048171 runs, 405 skips The add version is not implemented, so the corresponding dsp function is set to NULL to make it clear in a code executing it. Signed-off-by:
Michael Niedermayer <michael@niedermayer.cc>
-
Christophe Gisquet authored
When the input of a pass has 15 or 16 bits of precision (in particular the column pass), the addition of a bias to W4 may lead to overflows in the input to pmaddwd. This requires postponing the adding of the bias to after the first butterfly. To do so, the fact that m15, unused although zeroed, is exploited. In case the pass is safe, an address can be directly used, and the number of xmm regs can be decreased. Otherwise, the 32bits bias is loaded into it. Signed-off-by:
Michael Niedermayer <michael@niedermayer.cc>
-
- 12 Oct, 2015 1 commit
-
-
Christophe Gisquet authored
This should be reused for a generic simple_idct10 function. Requires a bit of trickery to declare common constants in C. Signed-off-by:
Michael Niedermayer <michael@niedermayer.cc>
-
- 16 Mar, 2015 2 commits
-
-
James Almer authored
It's exactly the same as the sse2 version. Reviewed-by:
Michael Niedermayer <michaelni@gmx.at> Signed-off-by:
James Almer <jamrial@gmail.com>
-
James Almer authored
Reviewed-by:
Michael Niedermayer <michaelni@gmx.at> Signed-off-by:
James Almer <jamrial@gmail.com>
-
- 13 Mar, 2014 1 commit
-
-
Diego Biurrun authored
This helps grepping for functions, among other things.
-
- 23 Jan, 2013 1 commit
-
-
Diego Biurrun authored
It does not help as an abstraction and adds dsputil dependencies. Signed-off-by:
Ronald S. Bultje <rsbultje@gmail.com>
-
- 30 Oct, 2012 2 commits
-
-
Diego Biurrun authored
This is more consistent with the way we handle C #includes and it simplifies the build system.
-
Diego Biurrun authored
This is necessary to allow refactoring some x86util macros with cpuflags.
-
- 31 Aug, 2012 1 commit
-
-
Carl Eugen Hoyos authored
-
- 02 Aug, 2012 1 commit
-
-
Diego Biurrun authored
-
- 27 Jul, 2012 1 commit
-
-
Ronald S. Bultje authored
-
- 12 Feb, 2012 1 commit
-
-
Reimar Döffinger authored
%ifdef HAVE_AVX must now be %if HAVE_AVX. Signed-off-by:
Reimar Döffinger <Reimar.Doeffinger@gmx.de>
-
- 27 Jan, 2012 1 commit
-
-
Ronald S. Bultje authored
This allows combining multiple conditionals in a single statement.
-
- 12 Jan, 2012 1 commit
-
-
Carl Eugen Hoyos authored
-
- 12 Oct, 2011 5 commits
-
-
Michael Niedermayer authored
Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
Michael Niedermayer authored
Also change Libav to FFmpeg Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
Michael Niedermayer authored
Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
Michael Niedermayer authored
-
Elvis Presley authored
Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 11 Oct, 2011 1 commit
-
-
Ronald S. Bultje authored
~3.0-3.5x as fast as original C version, 1.6x as fast overall.
-