- 03 Sep, 2012 1 commit
-
-
Diego Biurrun authored
-
- 01 Sep, 2012 1 commit
-
-
Carl Eugen Hoyos authored
-
- 30 Aug, 2012 2 commits
-
-
Diego Biurrun authored
-
Diego Biurrun authored
-
- 26 Aug, 2012 1 commit
-
-
Loren Merritt authored
13% faster on penryn, 16% on sandybridge, 15% on bulldozer Not simd; a compiler should have generated this, but gcc didn't.
-
- 24 Aug, 2012 1 commit
-
-
Justin Ruggles authored
-
- 22 Aug, 2012 1 commit
-
-
Mans Rullgard authored
GCC 4.3 and later do the right thing with the plain C code. Earlier versions in 32-bit mode generate one extra instruction, needlessly zeroing what would be the high half of the shifted value. At least two gcc configurations miscompile the inline asm in some situations. In 64-bit mode, all gcc versions generate imul r64, r64 followed by shr. On Intel i7 and later, this imul is faster 32-bit mul. On older Intel and all AMD, it is slightly slower. On Atom it is much slower. Considering where the FASTDIV macro is used, any overall negative performance impact of this change should be negligible. If anyone cares, they should file a bug against gcc and get the instruction selection fixed. Signed-off-by:
Mans Rullgard <mans@mansr.com>
-
- 16 Aug, 2012 1 commit
-
-
Martin Storsjö authored
Signed-off-by:
Martin Storsjö <martin@martin.st>
-
- 15 Aug, 2012 1 commit
-
-
Martin Storsjö authored
Signed-off-by:
Martin Storsjö <martin@martin.st>
-
- 08 Aug, 2012 2 commits
-
-
Mans Rullgard authored
These x86-specific macros do not belong in generic code. Signed-off-by:
Mans Rullgard <mans@mansr.com>
-
Mans Rullgard authored
This puts x86-specific things in the x86/ subdirectory where they belong. Signed-off-by:
Mans Rullgard <mans@mansr.com>
-
- 07 Aug, 2012 5 commits
-
-
Michael Niedermayer authored
Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
Mans Rullgard authored
It appears that something goes wrong in old nasm versions when the %+ operator is used in the last argument of a macro invocation and this argument is tested with %ifdef within the macro. This patch rearranges the macro arguments such that the %+ operator is never used in the last argument.
-
Mans Rullgard authored
nasm does not support 'CPU foonop' directives. This adds a configure test for the directive and uses it only if supported. Signed-off-by:
Mans Rullgard <mans@mansr.com>
-
Mans Rullgard authored
For some reason, nasm requires this. No harm done to yasm. Signed-off-by:
Mans Rullgard <mans@mansr.com>
-
Mans Rullgard authored
nasm prints a warning if the colon is missing. Signed-off-by:
Mans Rullgard <mans@mansr.com>
-
- 03 Aug, 2012 2 commits
-
-
Diego Biurrun authored
Refactoring mmx2/mmxext YASM code with cpuflags will force renames. So switching to a consistent naming scheme beforehand is sensible. The name "mmxext" is more official and widespread and also the name of the CPU flag, as reported e.g. by the Linux kernel.
-
Diego Biurrun authored
Currently there is a wild mix of 3dn2/3dnow2/3dnowext. Switching to "3dnowext", which is a more common name of the CPU flag, as reported e.g. by the Linux kernel, unifies this.
-
- 28 Jul, 2012 2 commits
-
-
Loren Merritt authored
This allows us to unconditionally set the cglobal num_args parameter to a bigger value, thus making writing yasm code even easier than before. Signed-off-by:
Ronald S. Bultje <rsbultje@gmail.com>
-
Ronald S. Bultje authored
-
- 27 Jul, 2012 1 commit
-
-
Justin Ruggles authored
-
- 26 Jul, 2012 1 commit
-
-
Ronald S. Bultje authored
-
- 22 Jul, 2012 1 commit
-
-
Jason Garrett-Glaser authored
Simplifies pshufb masks that operate on words.
-
- 10 Jul, 2012 4 commits
-
-
Mans Rullgard authored
This adds macros for accessing the EFLAGS register and uses these instead of coding the entire check in inline asm. Signed-off-by:
Mans Rullgard <mans@mansr.com>
-
Ronald S. Bultje authored
Signed-off-by:
Diego Biurrun <diego@biurrun.de> Signed-off-by:
Martin Storsjö <martin@martin.st>
-
Ronald S. Bultje authored
Signed-off-by:
Martin Storsjö <martin@martin.st>
-
Ronald S. Bultje authored
Signed-off-by:
Martin Storsjö <martin@martin.st>
-
- 07 Jul, 2012 3 commits
-
-
Clément Bœsch authored
This should fix the NASM box. Reviewed-by:
Michael Niedermayer <michaelni@gmx.at>
-
Ronald S. Bultje authored
-
Michael Niedermayer authored
based on similar amdnop usage in ffmpeg Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 05 Jul, 2012 2 commits
-
-
Loren Merritt authored
Signed-off-by:
Diego Biurrun <diego@biurrun.de>
-
Loren Merritt authored
Signed-off-by:
Diego Biurrun <diego@biurrun.de>
-
- 01 Jul, 2012 1 commit
-
-
Mans Rullgard authored
This adds macros for accessing the EFLAGS register and uses these instead of coding the entire check in inline asm. Signed-off-by:
Mans Rullgard <mans@mansr.com>
-
- 25 Jun, 2012 3 commits
-
-
yang authored
Fixes Ticket1466 Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
Mans Rullgard authored
This adds whitespace around operators, aligns line continuation backslashes, and breaks long lines. Also fixes an ifdef halfway through a statement. The one line of duplication this saved is not worth the ugliness. Signed-off-by:
Mans Rullgard <mans@mansr.com>
-
Ronald S. Bultje authored
Signed-off-by:
Mans Rullgard <mans@mansr.com>
-
- 23 Jun, 2012 1 commit
-
-
Diego Biurrun authored
-
- 19 Jun, 2012 1 commit
-
-
Michael Niedermayer authored
Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 18 Jun, 2012 1 commit
-
-
Justin Ruggles authored
-
- 09 Jun, 2012 1 commit
-
-
Michael Niedermayer authored
The attribution was removed by libav while moving the code to libavutil The original code is from commit eb4825b5 Author: Loren Merritt <lorenm@u.washington.edu> Date: Thu Aug 10 19:06:25 2006 +0000 sse and 3dnow implementations of float->int conversion and mdct windowing. 15% faster vorbis. and commit 06972056 Author: Loren Merritt <lorenm@u.washington.edu> Date: Fri Aug 11 18:19:37 2006 +0000 vorbis simd tweaks Reviewed-by:
Paul B Mahol <onemda@gmail.com> Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-