• James Almer's avatar
    x86/hevcdsp: add ff_hevc_sao_edge_filter_8_{ssse3,avx2} · 042c1159
    James Almer authored
    Original x86 intrinsics code and initial yasm port by Pierre-Edouard Lepere.
    Refactoring and optimizations by James Almer.
    
    Benchmarks of BQTerrace_1920x1080_60_qp22.bin with an Intel Core i5-4200U
    
    Width 32
    158583 decicycles in edge, sao_edge_filter_8 runs, 0 skips
    5205 decicycles in ff_hevc_sao_edge_filter_32_8_ssse3, 32767 runs, 1 skips
    2942 decicycles in ff_hevc_sao_edge_filter_32_8_avx2, 32767 runs, 1 skips
    
    Width 64
    705639 decicycles in sao_edge_filter_8, 262144 runs, 0 skips
    19224 decicycles in ff_hevc_sao_edge_filter_64_8_ssse3, 262111 runs, 33 skips
    10433 decicycles in ff_hevc_sao_edge_filter_64_8_avx2, 262115 runs, 29 skips
    Signed-off-by: 's avatarJames Almer <jamrial@gmail.com>
    042c1159
hevc_filter.c 37.2 KB