- 21 Sep, 2012 2 commits
-
-
Mans Rullgard authored
The Apple assembler refuses to assemble the 3-operand form in Thumb2 even though it is valid syntax. Signed-off-by:
Mans Rullgard <mans@mansr.com>
-
Mans Rullgard authored
When building Thumb2 code, the end of a function, where the PIC offsets are placed, need not be aligned. Although the values are only accessed with instructions allowing unaligned addresses, keeping them aligned is preferable. Signed-off-by:
Mans Rullgard <mans@mansr.com>
-
- 20 Sep, 2012 1 commit
-
-
Mans Rullgard authored
This allows using a 16-bit opcode when generating Thumb2 code. Signed-off-by:
Mans Rullgard <mans@mansr.com>
-
- 15 Sep, 2012 1 commit
-
-
Mans Rullgard authored
-
- 12 Sep, 2012 1 commit
-
-
Tim Walker authored
Can be used by DTS-HD, TrueHD and E-AC-3, among others. Signed-off-by:
Justin Ruggles <justin.ruggles@gmail.com>
-
- 09 Sep, 2012 1 commit
-
-
Mans Rullgard authored
Signed-off-by:
Mans Rullgard <mans@mansr.com>
-
- 08 Sep, 2012 1 commit
-
-
Diego Biurrun authored
This separates code relying on inline from that relying on external assembly and fixes instances where the coalesced check was incorrect.
-
- 07 Sep, 2012 1 commit
-
-
Justin Ruggles authored
The SWAP macro does not work for explicit xmm/ymm usage, so instead just move the scalar value from xmm2 to xmm0.
-
- 05 Sep, 2012 2 commits
-
-
Michael Niedermayer authored
Signed-off-by:
Martin Storsjö <martin@martin.st>
-
Ronald S. Bultje authored
Signed-off-by:
Martin Storsjö <martin@martin.st>
-
- 04 Sep, 2012 6 commits
-
-
Martin Storsjö authored
Also mention this change in APIchanges. Signed-off-by:
Martin Storsjö <martin@martin.st>
-
Martin Storsjö authored
Signed-off-by:
Martin Storsjö <martin@martin.st>
-
Martin Storsjö authored
Signed-off-by:
Martin Storsjö <martin@martin.st>
-
Martin Storsjö authored
Signed-off-by:
Martin Storsjö <martin@martin.st>
-
Martin Storsjö authored
Signed-off-by:
Martin Storsjö <martin@martin.st>
-
Luca Barbato authored
-
- 03 Sep, 2012 1 commit
-
-
Diego Biurrun authored
-
- 30 Aug, 2012 2 commits
-
-
Diego Biurrun authored
-
Diego Biurrun authored
-
- 26 Aug, 2012 1 commit
-
-
Loren Merritt authored
13% faster on penryn, 16% on sandybridge, 15% on bulldozer Not simd; a compiler should have generated this, but gcc didn't.
-
- 24 Aug, 2012 2 commits
-
-
Anton Khirnov authored
Double does not have enough precision to represent all int64 numbers exactly.
-
Justin Ruggles authored
-
- 22 Aug, 2012 1 commit
-
-
Mans Rullgard authored
GCC 4.3 and later do the right thing with the plain C code. Earlier versions in 32-bit mode generate one extra instruction, needlessly zeroing what would be the high half of the shifted value. At least two gcc configurations miscompile the inline asm in some situations. In 64-bit mode, all gcc versions generate imul r64, r64 followed by shr. On Intel i7 and later, this imul is faster 32-bit mul. On older Intel and all AMD, it is slightly slower. On Atom it is much slower. Considering where the FASTDIV macro is used, any overall negative performance impact of this change should be negligible. If anyone cares, they should file a bug against gcc and get the instruction selection fixed. Signed-off-by:
Mans Rullgard <mans@mansr.com>
-
- 21 Aug, 2012 1 commit
-
-
Diego Biurrun authored
There is no point in having the user disable any fastdiv macros. Besides the condition implementation was broken and only disabled the C implementation, but no platform specific assembly versions.
-
- 16 Aug, 2012 1 commit
-
-
Martin Storsjö authored
Signed-off-by:
Martin Storsjö <martin@martin.st>
-
- 15 Aug, 2012 2 commits
-
-
Martin Storsjö authored
Signed-off-by:
Martin Storsjö <martin@martin.st>
-
Martin Storsjö authored
Signed-off-by:
Martin Storsjö <martin@martin.st>
-
- 13 Aug, 2012 2 commits
-
-
Mans Rullgard authored
This avoids having the compiler redundantly mask the values to the smaller size. Signed-off-by:
Mans Rullgard <mans@mansr.com>
-
Mans Rullgard authored
Fixed-point audio codecs often use saturating arithmetic, and special instructions for these operations are common. Signed-off-by:
Mans Rullgard <mans@mansr.com>
-
- 10 Aug, 2012 3 commits
-
-
Mans Rullgard authored
This makes struct AVDictionary fully opaque now that nothing needs to access it directly any more. Signed-off-by:
Mans Rullgard <mans@mansr.com>
-
Mans Rullgard authored
This adds a function to retrieve the number of entries in a dictionary and updates the places directly accessing what should be an opaque struct to use this new function instead. Signed-off-by:
Mans Rullgard <mans@mansr.com>
-
Mans Rullgard authored
Signed-off-by:
Mans Rullgard <mans@mansr.com>
-
- 09 Aug, 2012 4 commits
-
-
Mans Rullgard authored
Signed-off-by:
Mans Rullgard <mans@mansr.com>
-
Mans Rullgard authored
The only compiler I have that does not define the standard offsetof() macro is "Bruce's C Compiler", a simple compiler for producing 8/16-bit 8086 code, usually for use in early stages of PC booting. Signed-off-by:
Mans Rullgard <mans@mansr.com>
-
Mans Rullgard authored
This list is incomplete (we also use UINT16_MAX), so there does not appear to be any system we care about that needs these. Signed-off-by:
Mans Rullgard <mans@mansr.com>
-
Diego Biurrun authored
-
- 08 Aug, 2012 3 commits
-
-
Mans Rullgard authored
This macro is only used in two places, both in libavcodec, so this is a more sensible place for it. Two small tweaks to the macro are made: - removing the trailing semicolon - dropping unnecessary 'volatile' from the x86 asm Signed-off-by:
Mans Rullgard <mans@mansr.com>
-
Mans Rullgard authored
These x86-specific macros do not belong in generic code. Signed-off-by:
Mans Rullgard <mans@mansr.com>
-
Mans Rullgard authored
This puts x86-specific things in the x86/ subdirectory where they belong. Signed-off-by:
Mans Rullgard <mans@mansr.com>
-
- 07 Aug, 2012 1 commit
-
-
Mans Rullgard authored
Some compilers do not support the Q/R modifiers used to access the low/high parts of a 64-bit register pair. Check for this and disable all uses of it when not supported. Fixes bug #337. Signed-off-by:
Mans Rullgard <mans@mansr.com>
-