• Clément Bœsch's avatar
    lavfi/nlmeans: make compute_safe_ssd_integral_image_c faster · 43d16aef
    Clément Bœsch authored
    before:  ssd_integral_image_c: 49204.6
    after:   ssd_integral_image_c: 44272.8
    
    Unrolling by 4 made the biggest difference on odroid-c2 (aarch64);
    unrolling by 2 or 8 both raised 46k cycles vs 44k for 4.
    
    Additionally, this is a much better reference when writing SIMD (SIMD
    vectorization will just target 16 instead of 4).
    43d16aef
Name
Last commit
Last update
compat Loading commit data...
doc Loading commit data...
ffbuild Loading commit data...
fftools Loading commit data...
libavcodec Loading commit data...
libavdevice Loading commit data...
libavfilter Loading commit data...
libavformat Loading commit data...
libavresample Loading commit data...
libavutil Loading commit data...
libpostproc Loading commit data...
libswresample Loading commit data...
libswscale Loading commit data...
presets Loading commit data...
tests Loading commit data...
tools Loading commit data...
.gitattributes Loading commit data...
.gitignore Loading commit data...
.travis.yml Loading commit data...
CONTRIBUTING.md Loading commit data...
COPYING.GPLv2 Loading commit data...
COPYING.GPLv3 Loading commit data...
COPYING.LGPLv2.1 Loading commit data...
COPYING.LGPLv3 Loading commit data...
CREDITS Loading commit data...
Changelog Loading commit data...
INSTALL.md Loading commit data...
LICENSE.md Loading commit data...
MAINTAINERS Loading commit data...
Makefile Loading commit data...
README.md Loading commit data...
RELEASE Loading commit data...
configure Loading commit data...