avutil/pixelutils: faster pixelutils_sad_[au]_16x16
~560 → ~500 decicycles This is following the comments from Michael in https://ffmpeg.org/pipermail/ffmpeg-devel/2014-August/160599.html Using 2 registers for accumulator didn't help. On the other hand, some re-ordering between the movs and psadbw allowed going ~538 to ~500.
Showing
Please
register
or
sign in
to comment