1. 07 May, 2014 1 commit
  2. 10 Apr, 2014 1 commit
  3. 30 Mar, 2014 1 commit
  4. 18 Mar, 2014 1 commit
  5. 05 Mar, 2014 1 commit
  6. 31 Oct, 2013 1 commit
  7. 26 Oct, 2013 1 commit
  8. 30 Aug, 2013 1 commit
  9. 06 Jul, 2013 2 commits
  10. 04 Jul, 2013 1 commit
  11. 22 Jan, 2013 1 commit
  12. 17 Sep, 2012 1 commit
  13. 13 Aug, 2012 1 commit
    • Mans Rullgard's avatar
      x86: cabac: allow building with suncc · 8ec0204e
      Mans Rullgard authored
      This fixes two issues preventing suncc from building this code.
      
      The undocumented 'a' operand modifier, causing gcc to omit a $ in
      front of immediate operands (as required in addresses), is not
      supported by suncc.  Luckily, the also undocumented 'c' modifer
      has the same effect and is supported.
      
      On some asm statements with a large number of operands, suncc for no
      obvious reason fails to correctly substitute some of the operands.
      Fortunately, some of the operands in these statements are plain
      numbers which can be inserted directly into the code block instead
      of passed as operands.
      
      With these changes, the code builds correctly with both gcc and
      suncc.
      Signed-off-by: 's avatarMans Rullgard <mans@mansr.com>
      8ec0204e
  14. 08 Aug, 2012 1 commit
  15. 25 Jun, 2012 1 commit
  16. 28 Apr, 2012 6 commits
  17. 21 Apr, 2012 1 commit
  18. 20 Apr, 2012 1 commit
    • Roland Scheidegger's avatar
      h264: assembly version of get_cabac for x86_64 with PIC (v4) · a812b599
      Roland Scheidegger authored
      This adds a hand-optimized assembly version for get_cabac much like the
      existing one, but it works if the table offsets are RIP-relative.
      Compared to the non-RIP-relative version this adds 2 lea instructions
      and it needs one extra register.
      There is a surprisingly large performance improvement over the c version (more
      so than the generated assembly seems to suggest) just in get_cabac, I measured
      roughly 40% faster for get_cabac on a K8. However, overall the difference is
      not that big, I measured roughly 5% on a test clip on a K8 and a Core2.
      Hopefully it still compiles on x86 32bit...
      v2: incorporated feedback from Loren Merritt to avoid rip-relative movs
      for every table, and got rid of unnecessary @GOTPCREL.
      v3: apply similar fixes to the the decode_significance functions, and use
      same macro arguments for non-pic case.
      v4: prettify inline asm arguments, add a non-fast-cmov version (as I expect
      the c code to be faster otherwise since both cmov and sbb suck hard on a
      Prescott, even can't construct the mask with a 64bit shift as that's just as
      terrible - it's quite difficult to find usable instructions on that chip...).
      This is tested to work but not on a P4, in theory it _should_ be fast there.
      Signed-off-by: 's avatarMichael Niedermayer <michaelni@gmx.at>
      a812b599
  19. 28 Mar, 2012 6 commits
  20. 11 Jan, 2012 1 commit
  21. 09 Jan, 2012 3 commits
  22. 06 Jan, 2012 1 commit
  23. 12 Dec, 2011 2 commits
  24. 11 Dec, 2011 1 commit
  25. 08 Nov, 2011 1 commit
  26. 03 Aug, 2011 1 commit