• Ronald S. Bultje's avatar
    vp9/x86: 16x16 sub-IDCT for top-left 8x8 subblock (eob <= 38). · 0d9375fc
    Ronald S. Bultje authored
    Sub8x8 speed (w/o dc-only case) goes from ~750 cycles (inter) or ~735
    cycles (intra) to ~415 cycles (inter) or ~430 cycles (intra). Average
    overall 16x16 idct speed goes from ~635 cycles (inter) or ~720 cycles
    (intra) to ~415 cycles (inter) or ~545 (intra) - all measurements done
    using ped1080p.webm.
    0d9375fc
vp9itxfm.asm 24.6 KB