• Ilia Valiakhmetov's avatar
    libavcodec/vp9: ipred_dl_32x32_16 avx2 implementation · 73d9a9a6
    Ilia Valiakhmetov authored
    vp9_diag_downleft_32x32_8bpp_c: 580.2
    vp9_diag_downleft_32x32_8bpp_sse2: 75.6
    vp9_diag_downleft_32x32_8bpp_ssse3: 73.7
    vp9_diag_downleft_32x32_8bpp_avx: 72.7
    vp9_diag_downleft_32x32_10bpp_c: 1101.2
    vp9_diag_downleft_32x32_10bpp_sse2: 145.4
    vp9_diag_downleft_32x32_10bpp_ssse3: 137.5
    vp9_diag_downleft_32x32_10bpp_avx: 134.8
    vp9_diag_downleft_32x32_10bpp_avx2: 94.0
    vp9_diag_downleft_32x32_12bpp_c: 1108.5
    vp9_diag_downleft_32x32_12bpp_sse2: 145.5
    vp9_diag_downleft_32x32_12bpp_ssse3: 137.3
    vp9_diag_downleft_32x32_12bpp_avx: 135.2
    vp9_diag_downleft_32x32_12bpp_avx2: 94.0
    
    ~30% faster than avx implementation
    Signed-off-by: 's avatarRonald S. Bultje <rsbultje@gmail.com>
    73d9a9a6
vp9intrapred_16bpp.asm 78.7 KB