• Lauri Kasanen's avatar
    swscale/output: Altivec-optimize yuv2plane1_8 · 46c5693e
    Lauri Kasanen authored
    ./ffmpeg_g -f rawvideo -pix_fmt rgb24 -s hd1080 -i /dev/zero -pix_fmt yuv420p \
    -f null -vframes 100 -v error -nostats -
    
    1158 UNITS in planar1,   65528 runs,      8 skips
    
    -cpuflags 0
    
    19082 UNITS in planar1,   65533 runs,      3 skips
    
    16.48 speedup ratio. On x86, SSE2 is ~7. Curiously, the Power C version
    takes as many cycles as the x86 SSE2 version, yikes it's fast.
    
    Note that this function uses VSX instructions, but is not marked so.
    This is because several existing functions also make that mistake.
    I'll submit a patch moving them once this is reviewed.
    Signed-off-by: 's avatarLauri Kasanen <cand@gmx.com>
    Signed-off-by: 's avatarMichael Niedermayer <michael@niedermayer.cc>
    46c5693e
Name
Last commit
Last update
..
aarch64 Loading commit data...
arm Loading commit data...
ppc Loading commit data...
tests Loading commit data...
x86 Loading commit data...
Makefile Loading commit data...
alphablend.c Loading commit data...
bayer_template.c Loading commit data...
gamma.c Loading commit data...
hscale.c Loading commit data...
hscale_fast_bilinear.c Loading commit data...
input.c Loading commit data...
libswscale.v Loading commit data...
log2_tab.c Loading commit data...
options.c Loading commit data...
output.c Loading commit data...
rgb2rgb.c Loading commit data...
rgb2rgb.h Loading commit data...
rgb2rgb_template.c Loading commit data...
slice.c Loading commit data...
swscale.c Loading commit data...
swscale.h Loading commit data...
swscale_internal.h Loading commit data...
swscale_unscaled.c Loading commit data...
swscaleres.rc Loading commit data...
utils.c Loading commit data...
version.h Loading commit data...
vscale.c Loading commit data...
yuv2rgb.c Loading commit data...