• Ronald S. Bultje's avatar
    Rename h264_idct_sse2.asm to h264_idct.asm; move inline IDCT asm from · 1d16a1cf
    Ronald S. Bultje authored
    h264dsp_mmx.c to h264_idct.asm (as yasm code). Because the loops are now
    coded in asm instead of C, this is (depending on the function) up to 50%
    faster for cases where gcc didn't do a great job at looping.
    
    Since h264_idct_add8() is now faster than the manual loop setup in h264.c,
    in-asm idct calling can now be enabled for chroma as well (see r16207). For
    MMX, this is 5% faster. For SSE2 (which isn't done for chroma if h264.c does
    the looping), this makes it up to 50% faster. Speed gain overall is ~0.5-1.0%.
    
    Originally committed as revision 25119 to svn://svn.ffmpeg.org/ffmpeg/trunk
    1d16a1cf
Name
Last commit
Last update
doc Loading commit data...
ffpresets Loading commit data...
libavcodec Loading commit data...
libavcore Loading commit data...
libavdevice Loading commit data...
libavfilter Loading commit data...
libavformat Loading commit data...
libavutil Loading commit data...
libpostproc Loading commit data...
libswscale Loading commit data...
tests Loading commit data...
tools Loading commit data...
COPYING.GPLv2 Loading commit data...
COPYING.GPLv3 Loading commit data...
COPYING.LGPLv2.1 Loading commit data...
COPYING.LGPLv3 Loading commit data...
CREDITS Loading commit data...
Changelog Loading commit data...
Doxyfile Loading commit data...
INSTALL Loading commit data...
LICENSE Loading commit data...
MAINTAINERS Loading commit data...
Makefile Loading commit data...
README Loading commit data...
cmdutils.c Loading commit data...
cmdutils.h Loading commit data...
cmdutils_common_opts.h Loading commit data...
common.mak Loading commit data...
configure Loading commit data...
ffmpeg.c Loading commit data...
ffplay.c Loading commit data...
ffprobe.c Loading commit data...
ffserver.c Loading commit data...
ffserver.h Loading commit data...
subdir.mak Loading commit data...
version.sh Loading commit data...