1. 26 Mar, 2014 5 commits
    • Ben Avison's avatar
      truehd: add hand-scheduled ARM asm version of ff_mlp_rematrix_channel. · 89135716
      Ben Avison authored
      Profiling results for overall audio decode and the rematrix_channels function
      in particular are as follows:
      
                    Before          After
                    Mean   StdDev   Mean   StdDev  Confidence  Change
      6:2 total     370.8  17.0     348.8  20.1    99.9%       +6.3%
      6:2 function  46.4   8.4      45.8   6.6     18.0%       +1.2%  (insignificant)
      8:2 total     343.2  19.0     339.1  15.4    54.7%       +1.2%  (insignificant)
      8:2 function  38.9   3.9      40.2   6.9     52.4%       -3.2%  (insignificant)
      6:6 total     658.4  15.7     604.6  20.8    100.0%      +8.9%
      6:6 function  109.0  8.7      59.5   5.4     100.0%      +83.3%
      8:8 total     896.2  24.5     766.4  17.6    100.0%      +16.9%
      8:8 function  223.4  12.8     93.8   5.0     100.0%      +138.3%
      
      The assembly version has also been tested with a fuzz tester to ensure that
      any combinations of inputs not exercised by my available test streams still
      generate mathematically identical results to the C version.
      Signed-off-by: 's avatarMichael Niedermayer <michaelni@gmx.at>
      89135716
    • Ben Avison's avatar
      truehd: add hand-scheduled ARM asm version of mlp_filter_channel. · 87b128d5
      Ben Avison authored
      Profiling results for overall audio decode and the mlp_filter_channel(_arm)
      function in particular are as follows:
      
                    Before          After
                    Mean   StdDev   Mean   StdDev  Confidence  Change
      6:2 total     380.4  22.0     370.8  17.0    87.4%       +2.6%  (insignificant)
      6:2 function  60.7   7.2      36.6   8.1     100.0%      +65.8%
      8:2 total     357.0  17.5     343.2  19.0    97.8%       +4.0%  (insignificant)
      8:2 function  60.3   8.8      37.3   3.8     100.0%      +61.8%
      6:6 total     717.2  23.2     658.4  15.7    100.0%      +8.9%
      6:6 function  140.4  12.9     81.5   9.2     100.0%      +72.4%
      8:8 total     981.9  16.2     896.2  24.5    100.0%      +9.6%
      8:8 function  193.4  15.0     103.3  11.5    100.0%      +87.2%
      
      Experiments with adding preload instructions to this function yielded no
      useful benefit, so these have not been included.
      
      The assembly version has also been tested with a fuzz tester to ensure that
      any combinations of inputs not exercised by my available test streams still
      generate mathematically identical results to the C version.
      Signed-off-by: 's avatarMichael Niedermayer <michaelni@gmx.at>
      87b128d5
    • Ben Avison's avatar
      truehd: break out part of output_data into platform-specific callback. · b9eb0341
      Ben Avison authored
      Verified with profiling that this doesn't have a measurable effect upon
      overall performance.
      Signed-off-by: 's avatarMartin Storsjö <martin@martin.st>
      b9eb0341
    • Ben Avison's avatar
      truehd: break out part of rematrix_channels into platform-specific callback. · 4e5aa080
      Ben Avison authored
      Verified with profiling that this doesn't have a measurable effect upon
      overall performance.
      Signed-off-by: 's avatarMartin Storsjö <martin@martin.st>
      4e5aa080
    • Ben Avison's avatar
      truehd: add hand-scheduled ARM asm version of mlp_filter_channel. · 15a29c39
      Ben Avison authored
      Profiling results for overall audio decode and the mlp_filter_channel(_arm)
      function in particular are as follows:
      
                    Before          After
                    Mean   StdDev   Mean   StdDev  Confidence  Change
      6:2 total     380.4  22.0     370.8  17.0    87.4%       +2.6%  (insignificant)
      6:2 function  60.7   7.2      36.6   8.1     100.0%      +65.8%
      8:2 total     357.0  17.5     343.2  19.0    97.8%       +4.0%  (insignificant)
      8:2 function  60.3   8.8      37.3   3.8     100.0%      +61.8%
      6:6 total     717.2  23.2     658.4  15.7    100.0%      +8.9%
      6:6 function  140.4  12.9     81.5   9.2     100.0%      +72.4%
      8:8 total     981.9  16.2     896.2  24.5    100.0%      +9.6%
      8:8 function  193.4  15.0     103.3  11.5    100.0%      +87.2%
      
      Experiments with adding preload instructions to this function yielded no
      useful benefit, so these have not been included.
      
      The assembly version has also been tested with a fuzz tester to ensure that
      any combinations of inputs not exercised by my available test streams still
      generate mathematically identical results to the C version.
      Signed-off-by: 's avatarMartin Storsjö <martin@martin.st>
      15a29c39
  2. 11 Oct, 2012 1 commit
  3. 31 Jan, 2012 2 commits
  4. 12 Sep, 2011 1 commit
  5. 03 Apr, 2011 1 commit
  6. 19 Mar, 2011 1 commit
  7. 20 May, 2010 1 commit
  8. 20 Apr, 2010 1 commit
  9. 01 Feb, 2009 1 commit
  10. 19 Jan, 2009 1 commit
  11. 13 Oct, 2008 1 commit
  12. 31 Aug, 2008 1 commit
  13. 17 Oct, 2007 1 commit
  14. 09 Jul, 2007 1 commit
  15. 17 Jun, 2007 2 commits
  16. 16 Jun, 2007 1 commit
  17. 27 Jan, 2007 1 commit
  18. 07 Oct, 2006 1 commit
  19. 10 Sep, 2006 1 commit
  20. 19 Jul, 2006 1 commit