- 23 Dec, 2014 1 commit
-
-
James Almer authored
Fixes "redundant redeclaration" warnings. Signed-off-by:
James Almer <jamrial@gmail.com>
-
- 20 Dec, 2014 1 commit
-
-
James Almer authored
Signed-off-by:
James Almer <jamrial@gmail.com> Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 19 Dec, 2014 3 commits
-
-
James Almer authored
Signed-off-by:
James Almer <jamrial@gmail.com> Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
Ronald S. Bultje authored
Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
Ronald S. Bultje authored
Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 16 Dec, 2014 1 commit
-
-
Ronald S. Bultje authored
Fixes build on win32. Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 15 Dec, 2014 1 commit
-
-
Ronald S. Bultje authored
Also a slight change to the ssse3 code, which prevents a theoretical overflow in the sharp filter. Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 14 Dec, 2014 1 commit
-
-
Ronald S. Bultje authored
Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 11 Dec, 2014 1 commit
-
-
Michael Niedermayer authored
These fix failures of --enable-xmm-clobber-test It would be better to change the code to use fewer registers, but until someone does the used register count must not be too small Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 10 Dec, 2014 1 commit
-
-
Michael Niedermayer authored
Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 08 Dec, 2014 1 commit
-
-
Michael Niedermayer authored
This fixes artifacts in the last pixel of rows with some widths and pixel formats Found-by:
Dominique Leroux <Dominique.Leroux@autodesk.com> Tested-by:
Dominique Leroux <Dominique.Leroux@autodesk.com> Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 05 Dec, 2014 1 commit
-
-
Kieran Kunhya authored
Signed-off-by:
Michael Niedermayer <michaelni@gmx.at> Signed-off-by:
Vittorio Giovara <vittorio.giovara@gmail.com>
-
- 03 Dec, 2014 2 commits
-
-
Reimar Döffinger authored
Fixes fate failure on macosx clang x86-64 Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
Christophe Gisquet authored
For test images manually generated to contain only up prediction, timing results: 8380x3032 255x185 before: 138635 1992 after: 139232 1996 Actually jumping to the proper version depending on the alignment: 8380x3032: 138767 A 0.5% speed improvement for gigantic images is not worth the code duplication. Fixes ticket #4148 Signed-off-by:
Christophe Gisquet <christophe.gisquet@gmail.com> Tested-by:
Benoit Fouet <benoit.fouet@free.fr> Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 26 Nov, 2014 1 commit
-
-
Kieran Kunhya authored
Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 24 Nov, 2014 1 commit
-
-
Vittorio Giovara authored
CC: libav-stable@libav.org Bug-Id: CID 700556 / CID 700557 / CID 700558
-
- 23 Nov, 2014 2 commits
-
-
Carl Eugen Hoyos authored
-
Michael Niedermayer authored
Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 22 Nov, 2014 3 commits
-
-
Michael Niedermayer authored
Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
Michael Niedermayer authored
Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
Reimar Döffinger authored
11674 -> 10877 decicycles on my Phenom II. Overall speedup was unfortunately within measurement error. Signed-off-by:
Reimar Döffinger <Reimar.Doeffinger@gmx.de>
-
- 13 Nov, 2014 1 commit
-
-
James Almer authored
Two to four times faster depending on instruction set, block size and channel count.
-
- 12 Nov, 2014 1 commit
-
-
James Almer authored
Signed-off-by:
James Almer <jamrial@gmail.com>
-
- 23 Oct, 2014 1 commit
-
-
James Almer authored
Handle it inside the __asm__() block. Fixes fate-vc1_ilaced_twomv when using the gcc-usan toolchain. Reviewed-by:
Michael Niedermayer <michaelni@gmx.at> Signed-off-by:
James Almer <jamrial@gmail.com>
-
- 08 Oct, 2014 1 commit
-
-
Henrik Gramner authored
The upper halves are not guaranteed to be zero in x86-64. Signed-off-by:
Anton Khirnov <anton@khirnov.net>
-
- 04 Oct, 2014 1 commit
-
-
Mickaël Raulet authored
cherry picked from commit df8ebe304df453f26c28ff8f11d607f49b90a4c2 Fixes out of array access Fixes: asan_stack-oob_1046454_9_asan_stack-oob_15a9e7c_170_WP_MAIN10_B_Toshiba_3.bit Found-by: Mateusz "j00ru" Jurczyk and Gynvael Coldwind Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 03 Oct, 2014 1 commit
-
-
James Almer authored
2x to 2.5x faster than the C version. Reviewed-by:
Michael Niedermayer <michaelni@gmx.at> Signed-off-by:
James Almer <jamrial@gmail.com>
-
- 01 Oct, 2014 1 commit
-
-
James Almer authored
~15% faster. Also add an mmxext version that takes advantage of the new code, and build it alongside with the mmx version only on x86_32. Reviewed-by:
Michael Niedermayer <michaelni@gmx.at> Signed-off-by:
James Almer <jamrial@gmail.com>
-
- 28 Sep, 2014 1 commit
-
-
Michael Niedermayer authored
Fixes acodec-dca2 fate failure Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 26 Sep, 2014 1 commit
-
-
James Almer authored
About two times faster than the c wrapper. Reviewed-by:
Michael Niedermayer <michaelni@gmx.at> Signed-off-by:
James Almer <jamrial@gmail.com>
-
- 25 Sep, 2014 5 commits
-
-
Carl Eugen Hoyos authored
-
James Almer authored
Reviewed-by:
Michael Niedermayer <michaelni@gmx.at> Signed-off-by:
James Almer <jamrial@gmail.com>
-
James Almer authored
It may be used by ff_add_pixels_clamped_sse2(). Should fix fate-cavs failures on some systems. Reviewed-by:
Michael Niedermayer <michaelni@gmx.at> Signed-off-by:
James Almer <jamrial@gmail.com>
-
James Almer authored
Also add sse2 versions for both. put_pixels_clamped port and sse2 version originally written by Timothy Gu. Reviewed-by:
Michael Niedermayer <michaelni@gmx.at> Signed-off-by:
James Almer <jamrial@gmail.com>
-
James Almer authored
Reviewed-by:
Michael Niedermayer <michaelni@gmx.at> Signed-off-by:
James Almer <jamrial@gmail.com>
-
- 24 Sep, 2014 2 commits
-
-
James Almer authored
Same behavior as in simple_idct. This way the best optimized versions available will be used instead. Reviewed-by:
Michael Niedermayer <michaelni@gmx.at> Signed-off-by:
James Almer <jamrial@gmail.com>
-
James Almer authored
~15% faster than sse2. Reviewed-by:
Ronald S. Bultje <rsbultje@gmail.com> Signed-off-by:
James Almer <jamrial@gmail.com>
-
- 23 Sep, 2014 2 commits
-
-
James Almer authored
Reviewed-by:
Michael Niedermayer <michaelni@gmx.at> Signed-off-by:
James Almer <jamrial@gmail.com>
-
James Almer authored
Roughly 25% faster MC than ssse3 for blocksizes 32 and 64. Reviewed-by:
Ronald S. Bultje <rsbultje@gmail.com> Signed-off-by:
James Almer <jamrial@gmail.com>
-
- 19 Sep, 2014 1 commit
-
-
James Almer authored
Also add mmxext versions of vsad8 and vsad_intra8, and sse2 versions of vsad16 and vsad_intra16. Since vsad8 and vsad16 are not bitexact, they are accordingly marked as approximate. Reviewed-by:
Michael Niedermayer <michaelni@gmx.at> Signed-off-by:
James Almer <jamrial@gmail.com>
-