• James Almer's avatar
    x86/hevcdsp: add ff_hevc_sao_edge_filter_{10,12}_{sse2,avx2} · 15574c50
    James Almer authored
    Original x86 intrinsics code by Pierre-Edouard Lepere.
    Yasm port, refactoring and optimizations by James Almer.
    
    Benchmarks of BQTerrace_1920x1080_60_qp22.bin with an Intel Core i5-4200U
    
    Width 32
    342694 decicycles in sao_edge_filter_10, 16384 runs, 0 skips
    29476 decicycles in ff_hevc_sao_edge_filter_32_10_ssse3, 16384 runs, 0 skips
    13996 decicycles in ff_hevc_sao_edge_filter_32_10_avx2, 16381 runs, 3 skips
    
    Width 64
    581163 decicycles in sao_edge_filter_10, 8192 runs, 0 skips
    59774 decicycles in ff_hevc_sao_edge_filter_64_10_ssse3, 8192 runs, 0 skips
    28383 decicycles in ff_hevc_sao_edge_filter_64_10_avx2, 8191 runs, 1 skips
    Signed-off-by: 's avatarJames Almer <jamrial@gmail.com>
    15574c50
hevcdsp_init.c 38.8 KB