- 04 Aug, 2015 1 commit
-
-
Henrik Gramner authored
The .text section is already 16-byte aligned by default on all supported platforms so `SECTION_TEXT` isn't any different from `SECTION .text`.
-
- 26 Jul, 2015 1 commit
-
-
James Almer authored
Silences warnings with Nasm Signed-off-by:
James Almer <jamrial@gmail.com>
-
- 17 Feb, 2015 1 commit
-
-
Christophe Gisquet authored
11 xmm regs seem only required for avx2. Reviewed-by: Mickaël Raulet <mraulet@insa-rennes.fr Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 16 Feb, 2015 1 commit
-
-
Christophe Gisquet authored
The 3*stride value stored in r3src can be loaded much later, so use r3src instead of a dedicated gpr when possible. Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 14 Feb, 2015 1 commit
-
-
Christophe Gisquet authored
The second stride is always the internal buffer one, MAX_PB_SIZE (times 2 to get the value in bytes). Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 12 Feb, 2015 1 commit
-
-
James Almer authored
Before 40766 decicycles in ff_hevc_put_hevc_qpel_h64_8_avx2, 8192 runs, 0 skips After 37975 decicycles in ff_hevc_put_hevc_qpel_h64_8_avx2, 8192 runs, 0 skips Reviewed-by:
Christophe Gisquet <christophe.gisquet@gmail.com> Signed-off-by:
James Almer <jamrial@gmail.com>
-
- 08 Feb, 2015 2 commits
-
-
Christophe Gisquet authored
The second parameter to the macro is always an immediate address, so no lea is needed. Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
Christophe Gisquet authored
In that case, it's just to load my, but mx/r3src is not used. Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 07 Feb, 2015 2 commits
-
-
Christophe Gisquet authored
The width parameter is now completely at the back, and actually never used. This helps understanding the actual parameter list. Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
Christophe Gisquet authored
Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 06 Feb, 2015 4 commits
-
-
Christophe Gisquet authored
Reviewed-by:
"Ronald S. Bultje" <rsbultje@gmail.com> Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
Mickaël Raulet authored
Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
Mickaël Raulet authored
Conflicts: libavcodec/x86/hevc_mc.asm Reviewed-by:
James Almer <jamrial@gmail.com> Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
Pierre Edouard Lepere authored
before 33304 decicycles in luma_bi_1, 523066 runs, 1222 skips 38138 decicycles in luma_bi_2, 523427 runs, 861 skips 13490 decicycles in luma_uni, 516138 runs, 8150 skips after 20185 decicycles in luma_bi_1, 519970 runs, 4318 skips 24620 decicycles in luma_bi_2, 521024 runs, 3264 skips 10397 decicycles in luma_uni, 515715 runs, 8573 skips Conflicts: libavcodec/x86/hevc_mc.asm libavcodec/x86/hevcdsp_init.c Reviewed-by:
James Almer <jamrial@gmail.com> Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 11 Dec, 2014 1 commit
-
-
Michael Niedermayer authored
These fix failures of --enable-xmm-clobber-test It would be better to change the code to use fewer registers, but until someone does the used register count must not be too small Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 10 Dec, 2014 1 commit
-
-
Michael Niedermayer authored
Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 04 Oct, 2014 1 commit
-
-
Mickaël Raulet authored
cherry picked from commit df8ebe304df453f26c28ff8f11d607f49b90a4c2 Fixes out of array access Fixes: asan_stack-oob_1046454_9_asan_stack-oob_15a9e7c_170_WP_MAIN10_B_Toshiba_3.bit Found-by: Mateusz "j00ru" Jurczyk and Gynvael Coldwind Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 24 Aug, 2014 1 commit
-
-
Christophe Gisquet authored
Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 22 Aug, 2014 4 commits
-
-
Christophe Gisquet authored
The stride for some buffers is known. Reviewed-by:
Mickaël Raulet <mraulet@gmail.com> Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
Christophe Gisquet authored
Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
Christophe Gisquet authored
The dststride parameter is always MAX_PB_SIZE. Reviewed-by:
Mickaël Raulet <mraulet@gmail.com> Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
Christophe Gisquet authored
Reviewed-by: Mickaël Raulet <mraulet@gmail.com Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 04 Aug, 2014 2 commits
-
-
James Almer authored
Signed-off-by:
James Almer <jamrial@gmail.com> Reviewed-by:
Christophe Gisquet <christophe.gisquet@gmail.com> Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
James Almer authored
Signed-off-by:
James Almer <jamrial@gmail.com> Reviewed-by:
Mickaël Raulet <mraulet@gmail.com> Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 29 Jul, 2014 1 commit
-
-
Christophe Gisquet authored
A macro was using a fixed register, causing too many GPRs to be declared as used. Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 27 Jul, 2014 2 commits
-
-
Christophe Gisquet authored
Before: 5679 decicycles in epel_bi, 2059976 runs, 37176 skips 3468 decicycles in epel_uni, 1040886 runs, 7690 skips After: 5323 decicycles in epel_bi, 2059493 runs, 37659 skips 3262 decicycles in epel_uni, 1040871 runs, 7705 skips Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
Christophe Gisquet authored
Should have been in 036f11bd. Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 26 Jul, 2014 1 commit
-
-
Christophe Gisquet authored
lea is detrimental for those simple cases. No impact overall to the change though. Before: 15017 decicycles in q, 1016152 runs, 32424 skips 15382 decicycles in q_bi, 1013673 runs, 34903 skips 3713 decicycles in e, 2074534 runs, 22618 skips 3901 decicycles in e_bi, 2065509 runs, 31643 skips 7852 decicycles in q_uni, 520165 runs, 4123 skips 2398 decicycles in e_uni, 1043339 runs, 5237 skips After: 14898 decicycles in q, 1016295 runs, 32281 skips 15119 decicycles in q_bi, 1015392 runs, 33184 skips 3682 decicycles in e, 20732246 runs, 23928 skips 3720 decicycles in e_bi, 2065043 runs, 32109 skips 7643 decicycles in q_uni, 520280 runs, 4008 skips 2363 decicycles in e_uni, 1043780 runs, 4796 skips Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 25 Jul, 2014 1 commit
-
-
Mickaël Raulet authored
cherry picked from commit 3fcb7a4595a6f40100a22110a5805e3b7510c0fd Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 01 Jun, 2014 1 commit
-
-
Christophe Gisquet authored
The immediate value may be 0. Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 28 May, 2014 1 commit
-
-
Christophe Gisquet authored
The xmm reg count was incorrect, and manual loading of the gprs furthermore allows to noticeable reduce the number needed. The modified functions are used in weighted prediction, so only a few samples like WP_* exhibit a change. For this one and Win64 (some widths removed because of too few occurrences): WP_A_Toshiba_3.bit, ff_hevc_put_hevc_uni_w 16 32 before: 2194 3872 after: 2119 3767 WP_B_Toshiba_3.bit, ff_hevc_put_hevc_bi_w 16 32 64 before: 2819 4960 9396 after: 2617 4788 9150 Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 18 May, 2014 1 commit
-
-
Christophe Gisquet authored
By default, macro EPEL_FILTER loads the coefficients inconditionally into m14/m15. This forces an unneeded higher register count. Reduce that count by making them parameters of EPEL_FILTER. Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 12 May, 2014 1 commit
-
-
Hendrik Leppkes authored
Fixes FATE on Windows. Reviewed-by:
"Ronald S. Bultje" <rsbultje@gmail.com> Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-
- 06 May, 2014 1 commit
-
-
plepere authored
pretty print x86 Reviewed-by:
"Ronald S. Bultje" <rsbultje@gmail.com> Signed-off-by:
Michael Niedermayer <michaelni@gmx.at>
-