- 11 Sep, 2012 1 commit
-
-
Michael Niedermayer authored
Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 10 Sep, 2012 1 commit
-
-
Michael Niedermayer authored
signed overflow is undefined in C ... Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 08 Sep, 2012 2 commits
-
-
Martin Storsjö authored
This reverts parts of e0c6cce4. There is external mmx asm that requires this alignment. This fixes crashes when using swscale in builds with external mmx, without inline assembly. Signed-off-by:
Martin Storsjö <martin@martin.st>
-
Diego Biurrun authored
This separates code relying on inline from that relying on external assembly and fixes instances where the coalesced check was incorrect.
-
- 07 Sep, 2012 1 commit
-
-
Diego Biurrun authored
-
- 04 Sep, 2012 3 commits
-
-
Martin Storsjö authored
Signed-off-by:
Martin Storsjö <martin@martin.st>
-
Martin Storsjö authored
Signed-off-by:
Martin Storsjö <martin@martin.st>
-
Martin Storsjö authored
Signed-off-by:
Martin Storsjö <martin@martin.st>
-
- 02 Sep, 2012 1 commit
-
-
Hans-Kristian Arntzen authored
Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 01 Sep, 2012 1 commit
-
-
Carl Eugen Hoyos authored
-
- 31 Aug, 2012 1 commit
-
-
Carl Eugen Hoyos authored
-
- 30 Aug, 2012 4 commits
-
-
Diego Biurrun authored
-
Diego Biurrun authored
-
Michael Niedermayer authored
Fixes Ticket1702 Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
Michael Niedermayer authored
Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 28 Aug, 2012 2 commits
-
-
Ronald S. Bultje authored
This introduces support for width%4==2 in addition to width%4==0. For odd widths, some more checks are needed, since the current code always handles two luma items in a row, thus there is a possibility of an overread by one.
-
Martin Storsjö authored
Signed-off-by:
Martin Storsjö <martin@martin.st>
-
- 20 Aug, 2012 1 commit
-
-
XhmikosR authored
Signed-off-by:
Derek Buitenhuis <derek.buitenhuis@gmail.com> Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 19 Aug, 2012 1 commit
-
-
Giorgio Vazzana authored
Signed-off-by:
Diego Biurrun <diego@biurrun.de>
-
- 16 Aug, 2012 1 commit
-
-
Giorgio Vazzana authored
Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 15 Aug, 2012 1 commit
-
-
Martin Storsjö authored
Signed-off-by:
Martin Storsjö <martin@martin.st>
-
- 13 Aug, 2012 2 commits
-
-
Mans Rullgard authored
To access data at multiple fixed offsets from a base address, this code uses a single "m" operand and code of the form "32%0", relying on the memory operand instantiation having no displacement, giving a final result of the form "32(%rax)". If the compiler uses a register and displacement, e.g. "64(%rax)", the end result becomes "3264(%rax)", which obviously does not work. Replacing the "m" operands with "r" operands allows safe addition of a displacement. In theory, multiple memory operands could use a shared base register with different index registers, "(%rax,%rbx)", potentially making more efficient use of registers. In the cases at hand, no such sharing is possible since the addresses involved are entirely unrelated. After this change, the code somewhat rudely accesses memory without using a corresponding memory operand, which in some cases can lead to unwanted "optimisations" of surrounding code. However, the original code also accesses memory not covered by a memory operand, so this is not adding any defect not already present. It is also hightly unlikely that any such optimisations could be performed here since the memory locations in questions are not accessed elsewhere in the same functions. This fixes crashes with suncc. Signed-off-by:
Mans Rullgard <mans@mansr.com>
-
Mans Rullgard authored
This code has been disabled since 2003. Nobody will ever look at it again. Signed-off-by:
Mans Rullgard <mans@mansr.com>
-
- 12 Aug, 2012 1 commit
-
-
Reimar Döffinger authored
About 30% faster on 32 bit Atom, 120% faster on 64 bit Phenom2. This is interesting because supporting P16 is easier in e.g. OpenGL (can misuse support for any 2-component 8 bit format), whereas supporting p9/p10 without conversion needs a texture format with at least 14 bits actual precision. The shiftonly == 0 case is not optimized since the code is more complex and the speed gain less obvious. Signed-off-by:
Reimar Döffinger <Reimar.Doeffinger@gmx.de>
-
- 08 Aug, 2012 1 commit
-
-
Mans Rullgard authored
This puts x86-specific things in the x86/ subdirectory where they belong. Signed-off-by:
Mans Rullgard <mans@mansr.com>
-
- 03 Aug, 2012 1 commit
-
-
Diego Biurrun authored
Refactoring mmx2/mmxext YASM code with cpuflags will force renames. So switching to a consistent naming scheme beforehand is sensible. The name "mmxext" is more official and widespread and also the name of the CPU flag, as reported e.g. by the Linux kernel.
-
- 30 Jul, 2012 1 commit
-
-
Ronald S. Bultje authored
-
- 23 Jul, 2012 1 commit
-
-
Ronald S. Bultje authored
The function called in this block is under HAVE_INLINE_ASM itself also.
-
- 22 Jul, 2012 3 commits
-
-
Diego Biurrun authored
-
Diego Biurrun authored
The files do not contain only MMX code.
-
Ronald S. Bultje authored
-
- 21 Jul, 2012 1 commit
-
-
Ronald S. Bultje authored
Fixes compilation for compilers that do not support gcc inline assembly. Signed-off-by:
Diego Biurrun <diego@biurrun.de>
-
- 19 Jul, 2012 3 commits
-
-
Paul B Mahol authored
Signed-off-by:
Paul B Mahol <onemda@gmail.com>
-
Paul B Mahol authored
Signed-off-by:
Paul B Mahol <onemda@gmail.com>
-
Paul B Mahol authored
Signed-off-by:
Paul B Mahol <onemda@gmail.com>
-
- 18 Jul, 2012 3 commits
-
-
Paul B Mahol authored
Using av_bswap16 should be enough.
-
Michael Niedermayer authored
Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
Michael Niedermayer authored
Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 17 Jul, 2012 2 commits
-
-
Michael Niedermayer authored
Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
Michael Niedermayer authored
Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-