- 03 Aug, 2012 1 commit
-
-
Diego Biurrun authored
Refactoring mmx2/mmxext YASM code with cpuflags will force renames. So switching to a consistent naming scheme beforehand is sensible. The name "mmxext" is more official and widespread and also the name of the CPU flag, as reported e.g. by the Linux kernel.
-
- 05 Jul, 2012 1 commit
-
-
Martin Storsjö authored
This was missed in the the previous commit in 70a1c800. Signed-off-by: Martin Storsjö <martin@martin.st>
-
- 04 Jul, 2012 1 commit
-
-
Martin Storsjö authored
Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
-
- 04 Mar, 2012 1 commit
-
-
Ronald S. Bultje authored
-
- 02 Mar, 2012 2 commits
-
-
Ronald S. Bultje authored
x86-64 is guaranteed to have at least SSE2, therefore the MMX/MMX2 functions will never be used in practice.
-
Ronald S. Bultje authored
On 64bit platforms with 32bit int, this means we won't have to sign- extend the integer anymore.
-
- 20 Oct, 2011 1 commit
-
-
Diego Biurrun authored
This avoids some unused variable warnings with YASM disabled.
-
- 19 Mar, 2011 1 commit
-
-
Mans Rullgard authored
Signed-off-by: Mans Rullgard <mans@mansr.com>
-
- 08 Sep, 2010 1 commit
-
-
Stefano Sabatini authored
function and rename it to av_get_cpu_flags(). Originally committed as revision 25076 to svn://svn.ffmpeg.org/ffmpeg/trunk
-
- 04 Sep, 2010 1 commit
-
-
Stefano Sabatini authored
symbols, and move them from libavcodec/avcodec.h to libavutil/cpu.h. Originally committed as revision 25040 to svn://svn.ffmpeg.org/ffmpeg/trunk
-
- 24 Aug, 2010 1 commit
-
-
Måns Rullgård authored
Originally committed as revision 24909 to svn://svn.ffmpeg.org/ffmpeg/trunk
-
- 02 Aug, 2010 1 commit
-
-
Jason Garrett-Glaser authored
Lets us do the zeroing in asm instead of C. Also makes it consistent with the way the regular iDCT code does it. Originally committed as revision 24668 to svn://svn.ffmpeg.org/ffmpeg/trunk
-
- 31 Jul, 2010 1 commit
-
-
Ronald S. Bultje authored
unchanged bytes) in the horizontal simple loopfilter. This makes the filter quite a bit faster in itself (~30 cycles less on Core1), probably mostly because we don't need a complex 4x4 transpose, but only a simple byte interleave. Also allows using pextrw on SSE4, which speeds up even more (e.g. 25% faster on Core i7). Originally committed as revision 24638 to svn://svn.ffmpeg.org/ffmpeg/trunk
-
- 23 Jul, 2010 3 commits
-
-
Jason Garrett-Glaser authored
Add MMX idct_dc_add4uv function for this case. ~40% faster chroma idct. Originally committed as revision 24455 to svn://svn.ffmpeg.org/ffmpeg/trunk
-
Jason Garrett-Glaser authored
Take shortcuts based on statistically common situations. Add 4-at-a-time idct_dc function (mmx and sse2) since rows of 4 DC-only DCT blocks are common. TODO: tie this more directly into the MB mode, since the DC-level transform is only used for non-splitmv blocks? Originally committed as revision 24452 to svn://svn.ffmpeg.org/ffmpeg/trunk
-
Jason Garrett-Glaser authored
~0.3% faster overall. Originally committed as revision 24448 to svn://svn.ffmpeg.org/ffmpeg/trunk
-
- 22 Jul, 2010 2 commits
-
-
Ronald S. Bultje authored
CPUs supporting it. Originally committed as revision 24437 to svn://svn.ffmpeg.org/ffmpeg/trunk
-
Ronald S. Bultje authored
Originally committed as revision 24409 to svn://svn.ffmpeg.org/ffmpeg/trunk
-
- 21 Jul, 2010 2 commits
-
-
Jason Garrett-Glaser authored
SSSE3 versions, improve SSE2 versions a bit. SSE2/SSSE3 mbedge h functions are currently broken, so explicitly disable them. Originally committed as revision 24403 to svn://svn.ffmpeg.org/ffmpeg/trunk
-
Jason Garrett-Glaser authored
Avoid pextrw, since it's slow on many older CPUs. Now it doesn't require mmxext either. Originally committed as revision 24397 to svn://svn.ffmpeg.org/ffmpeg/trunk
-
- 20 Jul, 2010 2 commits
-
-
Ronald S. Bultje authored
and chroma (width=8). Originally committed as revision 24378 to svn://svn.ffmpeg.org/ffmpeg/trunk
-
Ronald S. Bultje authored
Originally committed as revision 24377 to svn://svn.ffmpeg.org/ffmpeg/trunk
-
- 19 Jul, 2010 4 commits
-
-
Ronald S. Bultje authored
wrong with it tomorrow or so, then re-submit. Originally committed as revision 24341 to svn://svn.ffmpeg.org/ffmpeg/trunk
-
Ronald S. Bultje authored
regular MMX code. Examples of this are the Core1 CPU. Instead, set a new flag, FF_MM_SSE2/3SLOW, which can be checked for particular SSE2/3 functions that have been checked specifically on such CPUs and are actually faster than their MMX counterparts. In addition, use this flag to enable particular VP8 and LPC SSE2 functions that are faster than their MMX counterparts. Based on a patch by Loren Merritt <lorenm AT u washington edu>. Originally committed as revision 24340 to svn://svn.ffmpeg.org/ffmpeg/trunk
-
Ronald S. Bultje authored
Originally committed as revision 24339 to svn://svn.ffmpeg.org/ffmpeg/trunk
-
Ronald S. Bultje authored
so that it does both U and V planes at the same time. This will have speed advantages when using SSE2 (or higher) optimizations, since we can do both the U and V rows together in a single xmm register. This also renames filter16 to filter16y and filter8 to filter8uv so that it's more obvious what each function is used for. Originally committed as revision 24337 to svn://svn.ffmpeg.org/ffmpeg/trunk
-
- 15 Jul, 2010 1 commit
-
-
Ronald S. Bultje authored
Originally committed as revision 24250 to svn://svn.ffmpeg.org/ffmpeg/trunk
-
- 03 Jul, 2010 2 commits
-
-
Ronald S. Bultje authored
Originally committed as revision 24029 to svn://svn.ffmpeg.org/ffmpeg/trunk
-
Jason Garrett-Glaser authored
Originally committed as revision 24013 to svn://svn.ffmpeg.org/ffmpeg/trunk
-
- 02 Jul, 2010 1 commit
-
-
Jason Garrett-Glaser authored
Also make some small changes to saturation order of 4-tap SSSE3 MC to fix a non-bitexactness bug. Patch mostly by Eli Friedman <eli.friedman AT gmail DOT com>. Originally committed as revision 23965 to svn://svn.ffmpeg.org/ffmpeg/trunk
-
- 01 Jul, 2010 1 commit
-
-
Jason Garrett-Glaser authored
Originally committed as revision 23946 to svn://svn.ffmpeg.org/ffmpeg/trunk
-
- 29 Jun, 2010 2 commits
-
-
Ronald S. Bultje authored
Originally committed as revision 23886 to svn://svn.ffmpeg.org/ffmpeg/trunk
-
Jason Garrett-Glaser authored
Originally committed as revision 23878 to svn://svn.ffmpeg.org/ffmpeg/trunk
-
- 28 Jun, 2010 3 commits
-
-
Baptiste Coudurier authored
Originally committed as revision 23865 to svn://svn.ffmpeg.org/ffmpeg/trunk
-
Jason Garrett-Glaser authored
Originally committed as revision 23858 to svn://svn.ffmpeg.org/ffmpeg/trunk
-
Jason Garrett-Glaser authored
Originally committed as revision 23857 to svn://svn.ffmpeg.org/ffmpeg/trunk
-
- 27 Jun, 2010 2 commits
-
-
David Conrad authored
Originally committed as revision 23816 to svn://svn.ffmpeg.org/ffmpeg/trunk
-
Jason Garrett-Glaser authored
- MMXEXT, SSE2 and SSSE3 MC functions - MMX and SSE4 IDCT dc_add functions Patch by Jason Garrett-Glaser <darkshikari gmail com> and myself. Originally committed as revision 23815 to svn://svn.ffmpeg.org/ffmpeg/trunk
-