- 03 Jan, 2019 1 commit
-
-
Manoj Gupta authored
Some of the assembly functions e.g. ff_h264_idct_dc_add_neon has code like: movrel x14, X(ff_h264_idct_add_neon) Linker cannot resolve them fully at link time and emits dynamic relocations. Use explicit labels instead so that no dynamic relocations are needed at all. This avoids lld complains about text relocations. For background, see https://crbug.com/917919Signed-off-by: Manoj Gupta <manojgupta@chromium.org> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
-
- 08 Dec, 2016 1 commit
-
-
Martin Storsjö authored
Signed-off-by: Martin Storsjö <martin@martin.st> (cherry picked from commit 6a62795d) Cherry pick Suggested-by: Martin Storsjö This should fix the build failure on macosx Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
-
- 10 Nov, 2016 1 commit
-
-
Martin Storsjö authored
Signed-off-by: Martin Storsjö <martin@martin.st>
-
- 10 Oct, 2016 1 commit
-
-
Martin Storsjö authored
Signed-off-by: Martin Storsjö <martin@martin.st>
-
- 26 Mar, 2016 1 commit
-
-
Martin Storsjö authored
Previously, ff_h264_idct_add_neon (originally in the arm version) used a non-regular transpose in order to be able to use more instructions that deal with registers as 128 bit register pairs. The aarch64 translation doesn't do it to the same extent, but brought along the same structure since it was a straight translation. This reshuffles ff_h264_idct_add_neon, bringing it closer to the C implementation, making the transpose_4x4H macro do a regular transpose, usable for other algorithms as well. Previously, the third and fourth output from transpose_4x4H were swapped, and prior to cc29d96d, the same inputs as well. In addition to just swapping the outputs, also renumber the intermediate registers for better readability (making the register order match transpose_4x8B). This runs with the same number of cycles as before. Signed-off-by: Martin Storsjö <martin@martin.st>
-
- 21 Dec, 2015 1 commit
-
-
Janne Grunau authored
Fix related register order issue in ff_h264_idct_add_neon. Found-by: zjh8890 <243186085@qq.com>
-
- 19 Dec, 2015 1 commit
-
-
Janne Grunau authored
Fix related register order issue in ff_h264_idct_add_neon. Found-by: zjh8890 <243186085@qq.com> Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
-
- 20 Feb, 2014 1 commit
-
-
Janne Grunau authored
Based on e3fec3f0 for arm.
-
- 15 Jan, 2014 1 commit
-
-
Janne Grunau authored
Ported from ARMv7 NEON.
-