• Martin Storsjö's avatar
    aarch64: Make transpose_4x4H do a regular transpose · cdb1665f
    Martin Storsjö authored
    Previously, ff_h264_idct_add_neon (originally in the arm version) used
    a non-regular transpose in order to be able to use more instructions
    that deal with registers as 128 bit register pairs. The aarch64
    translation doesn't do it to the same extent, but brought along the
    same structure since it was a straight translation.
    
    This reshuffles ff_h264_idct_add_neon, bringing it closer to
    the C implementation, making the transpose_4x4H macro do a regular
    transpose, usable for other algorithms as well.
    
    Previously, the third and fourth output from transpose_4x4H were
    swapped, and prior to cc29d96d, the same inputs as well. In
    addition to just swapping the outputs, also renumber the intermediate
    registers for better readability (making the register order match
    transpose_4x8B).
    
    This runs with the same number of cycles as before.
    Signed-off-by: 's avatarMartin Storsjö <martin@martin.st>
    cdb1665f
Name
Last commit
Last update
compat Loading commit data...
doc Loading commit data...
libavcodec Loading commit data...
libavdevice Loading commit data...
libavfilter Loading commit data...
libavformat Loading commit data...
libavresample Loading commit data...
libavutil Loading commit data...
libswscale Loading commit data...
presets Loading commit data...
tests Loading commit data...
tools Loading commit data...
.gitattributes Loading commit data...
.gitignore Loading commit data...
.travis.yml Loading commit data...
COPYING.GPLv2 Loading commit data...
COPYING.GPLv3 Loading commit data...
COPYING.LGPLv2.1 Loading commit data...
COPYING.LGPLv3 Loading commit data...
CREDITS Loading commit data...
Changelog Loading commit data...
INSTALL Loading commit data...
LICENSE Loading commit data...
Makefile Loading commit data...
README Loading commit data...
README.md Loading commit data...
RELEASE Loading commit data...
arch.mak Loading commit data...
avconv.c Loading commit data...
avconv.h Loading commit data...
avconv_dxva2.c Loading commit data...
avconv_filter.c Loading commit data...
avconv_opt.c Loading commit data...
avconv_qsv.c Loading commit data...
avconv_vda.c Loading commit data...
avconv_vdpau.c Loading commit data...
avplay.c Loading commit data...
avprobe.c Loading commit data...
cmdutils.c Loading commit data...
cmdutils.h Loading commit data...
cmdutils_common_opts.h Loading commit data...
common.mak Loading commit data...
configure Loading commit data...
library.mak Loading commit data...
version.sh Loading commit data...