- 30 Nov, 2016 1 commit
-
-
James Darnley authored
Yorkfield: - sse2: - complex: 4.13x faster (1514 vs. 367 cycles) - simple: 4.38x faster (1836 vs. 419 cycles) Skylake: - sse2: - complex: 3.61x faster ( 936 vs. 260 cycles) - simple: 3.97x faster (1126 vs. 284 cycles) - avx (versus sse2): - complex: 1.07x faster (260 vs. 244 cycles) - simple: 1.03x faster (284 vs. 274 cycles)
-
- 16 Jun, 2016 1 commit
-
-
Martin Storsjö authored
Signed-off-by:
Martin Storsjö <martin@martin.st>
-
- 03 Oct, 2015 1 commit
-
-
Ronald S. Bultje authored
-
- 07 Feb, 2015 1 commit
-
-
Christophe Gisquet authored
Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 13 Mar, 2014 2 commits
-
-
Diego Biurrun authored
This helps grepping for functions, among other things.
-
Diego Biurrun authored
-
- 30 Aug, 2013 1 commit
-
-
Thilo Borgmann authored
-
- 10 Apr, 2013 1 commit
-
-
Ronald S. Bultje authored
The non-intra-pcm branch in hl_decode_mb (simple, 8bpp) goes from 700 to 672 cycles, and the complete loop of decode_mb_cabac and hl_decode_mb (in the decode_slice loop) goes from 1759 to 1733 cycles on the clip tested (cathedral), i.e. almost 30 cycles per mb faster. Signed-off-by:
Martin Storsjö <martin@martin.st>
-
- 19 Feb, 2013 1 commit
-
-
Ronald S. Bultje authored
The non-intra-pcm branch in hl_decode_mb (simple, 8bpp) goes from 700 to 672 cycles, and the complete loop of decode_mb_cabac and hl_decode_mb (in the decode_slice loop) goes from 1759 to 1733 cycles on the clip tested (cathedral), i.e. almost 30 cycles per mb faster. Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 13 Nov, 2012 1 commit
-
-
Diego Biurrun authored
-
- 11 Nov, 2012 1 commit
-
-
Diego Biurrun authored
An assembler able to cope with AVX instructions is now required.
-
- 30 Oct, 2012 2 commits
-
-
Diego Biurrun authored
This is more consistent with the way we handle C #includes and it simplifies the build system.
-
Diego Biurrun authored
This is necessary to allow refactoring some x86util macros with cpuflags.
-
- 30 Aug, 2012 1 commit
-
-
Diego Biurrun authored
-
- 07 Aug, 2012 1 commit
-
-
Mans Rullgard authored
nasm prints a warning if the colon is missing. Signed-off-by:
Mans Rullgard <mans@mansr.com>
-
- 28 Jul, 2012 1 commit
-
-
Ronald S. Bultje authored
-
- 11 Apr, 2012 1 commit
-
-
Henrik Gramner authored
Add support for all x86-64 registers Prefer caller-saved register over callee-saved on WIN64 Support up to 15 function arguments Also (by Ronald S. Bultje) Fix up our asm to work with new x86inc.asm. Signed-off-by:
Ronald S. Bultje <rsbultje@gmail.com> Signed-off-by:
Justin Ruggles <justin.ruggles@gmail.com>
-
- 07 Feb, 2012 1 commit
-
-
Michael Kostylev authored
This fixes XMM register clobber problems on Win64. Signed-off-by:
Ronald S. Bultje <rsbultje@gmail.com>
-
- 27 Jan, 2012 1 commit
-
-
Ronald S. Bultje authored
This allows combining multiple conditionals in a single statement.
-
- 19 Oct, 2011 1 commit
-
-
Kieran Kunhya authored
Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 15 Aug, 2011 1 commit
-
-
Dave Yeo authored
Signed-off-by:
Ronald S. Bultje <rsbultje@gmail.com>
-
- 12 Aug, 2011 2 commits
-
-
Ronald S. Bultje authored
This allows using it in swscale also.
-
Ronald S. Bultje authored
This allows using it in libswscale/ also.
-
- 14 Jun, 2011 1 commit
-
-
Jason Garrett-Glaser authored
Note: this is 4:4:4 from the 2007 spec revision, not the previous (now deprecated) 4:4:4 mode in H.264.
-
- 13 Jun, 2011 2 commits
-
-
Jason Garrett-Glaser authored
Needs some ARM/PPC asm modifications.
-
Jason Garrett-Glaser authored
Note: this is 4:4:4 from the 2007 spec revision, not the previous (now deprecated) 4:4:4 mode in H.264.
-
- 02 Jun, 2011 2 commits
-
-
Loren Merritt authored
Removes redundant dword tags and whitespace changes. Signed-off-by:
Ronald S. Bultje <rsbultje@gmail.com>
-
Loren Merritt authored
Signed-off-by:
Ronald S. Bultje <rsbultje@gmail.com>
-
- 31 May, 2011 1 commit
-
-
Daniel Kang authored
Ports the majority of IDCT functions for 10-bit H.264. Parts are inspired from 8-bit IDCT code in Libav; other parts ported from x264 with relicensing permission from author. Signed-off-by:
Ronald S. Bultje <rbultje@google.com>
-