• Ben Avison's avatar
    truehd: add hand-scheduled ARM asm version of mlp_filter_channel. · 87b128d5
    Ben Avison authored
    Profiling results for overall audio decode and the mlp_filter_channel(_arm)
    function in particular are as follows:
    
                  Before          After
                  Mean   StdDev   Mean   StdDev  Confidence  Change
    6:2 total     380.4  22.0     370.8  17.0    87.4%       +2.6%  (insignificant)
    6:2 function  60.7   7.2      36.6   8.1     100.0%      +65.8%
    8:2 total     357.0  17.5     343.2  19.0    97.8%       +4.0%  (insignificant)
    8:2 function  60.3   8.8      37.3   3.8     100.0%      +61.8%
    6:6 total     717.2  23.2     658.4  15.7    100.0%      +8.9%
    6:6 function  140.4  12.9     81.5   9.2     100.0%      +72.4%
    8:8 total     981.9  16.2     896.2  24.5    100.0%      +9.6%
    8:8 function  193.4  15.0     103.3  11.5    100.0%      +87.2%
    
    Experiments with adding preload instructions to this function yielded no
    useful benefit, so these have not been included.
    
    The assembly version has also been tested with a fuzz tester to ensure that
    any combinations of inputs not exercised by my available test streams still
    generate mathematically identical results to the C version.
    Signed-off-by: 's avatarMichael Niedermayer <michaelni@gmx.at>
    87b128d5
mlpdsp_arm.S 12.1 KB