1. 07 Apr, 2016 1 commit
  2. 24 Mar, 2016 2 commits
  3. 14 Dec, 2015 1 commit
    • Janne Grunau's avatar
      libavutil: move FFALIGN macro from common.h to macros.h · 50078c1c
      Janne Grunau authored
      Include macros.h explicitly in common.h so that external code using
      FFALIGN does not break. It was already implicitly included through
      version.h. Include macros.h in lls.h and internal.h for FFALIGN.
      lls.h was including common.h only for FFALIGN and internal.h was
      missing the include for FFALIGN. `make checkheaders` did not catch it
      because it's an internal header.
      50078c1c
  4. 26 Nov, 2015 1 commit
    • Ganesh Ajjanagadde's avatar
      avutil/lls: speed up performance of solve_lls · 68e79b27
      Ganesh Ajjanagadde authored
      This is a trivial rewrite of the loops that results in better
      prefetching and associated cache efficiency. Essentially, the problem is
      that modern prefetching logic is based on finite state Markov memory, a reasonable
      assumption that is used elsewhere in CPU's in for instance branch
      predictors.
      
      Surrounding loops all iterate forward through the array, making the
      predictor think of prefetching in the forward direction, but the
      intermediate loop is unnecessarily in the backward direction.
      
      Speedup is nontrivial. Benchmarks obtained by 10^6 iterations within
      solve_lls, with START/STOP_TIMER. File is tests/data/fate/flac-16-lpc-cholesky.err.
      Hardware: x86-64, Haswell, GNU/Linux.
      
      new:
        17291 decicycles in solve_lls, 2096706 runs,    446 skips
        17255 decicycles in solve_lls, 4193657 runs,    647 skips
        17231 decicycles in solve_lls, 8384997 runs,   3611 skips
        17189 decicycles in solve_lls,16771010 runs,   6206 skips
        17132 decicycles in solve_lls,33544757 runs,   9675 skips
        17092 decicycles in solve_lls,67092404 runs,  16460 skips
        17058 decicycles in solve_lls,134188213 runs,  29515 skips
      
      old:
        18009 decicycles in solve_lls, 2096665 runs,    487 skips
        17805 decicycles in solve_lls, 4193320 runs,    984 skips
        17779 decicycles in solve_lls, 8386855 runs,   1753 skips
        18289 decicycles in solve_lls,16774280 runs,   2936 skips
        18158 decicycles in solve_lls,33548104 runs,   6328 skips
        18420 decicycles in solve_lls,67091793 runs,  17071 skips
        18310 decicycles in solve_lls,134187219 runs,  30509 skips
      Reviewed-by: 's avatarMichael Niedermayer <michael@niedermayer.cc>
      Signed-off-by: 's avatarGanesh Ajjanagadde <gajjanagadde@gmail.com>
      68e79b27
  5. 28 Aug, 2015 1 commit
  6. 28 Sep, 2014 1 commit
  7. 09 Aug, 2014 1 commit
  8. 17 Nov, 2013 2 commits
  9. 29 Jun, 2013 3 commits
  10. 04 May, 2013 1 commit
  11. 01 Mar, 2013 2 commits
  12. 28 Feb, 2013 2 commits
  13. 11 Jul, 2011 1 commit
  14. 08 Jul, 2011 1 commit
  15. 19 Mar, 2011 1 commit
  16. 20 Apr, 2010 1 commit
  17. 10 Aug, 2009 1 commit
  18. 20 Mar, 2009 1 commit
  19. 01 Feb, 2009 1 commit
  20. 07 Aug, 2008 1 commit
  21. 31 Jan, 2008 1 commit
  22. 08 Jan, 2008 1 commit
  23. 23 Nov, 2007 1 commit
  24. 05 Jul, 2007 1 commit
  25. 07 Oct, 2006 1 commit
  26. 26 Jul, 2006 1 commit
  27. 15 Jul, 2006 1 commit
  28. 14 Jul, 2006 2 commits