Commits · 4d1f7e8bc7516e6b7b15f754af4a665b3f8af79e · Linshizhi / ffmpeg.wasm-core

01 Mar, 2017 5 commits

build: Skip generating .version files when cleaning · 4d1f7e8b
Diego Biurrun authored Feb 28, 2017

4d1f7e8b
configure: Fix typo in objcc default setting · 58407b4d
Diego Biurrun authored Feb 28, 2017
```
Also drop stray duplicate OBJCC config.mak entry.
```
58407b4d

x86: hevc: Add missing colons after assembly labels · fde7ee87

Diego Biurrun authored Feb 28, 2017

This fixes several warnings of the sort
warning: label alone on a line without a colon might be in error

fde7ee87

build: Fine-grained link-time dependency settings · 7cb1d9e2

Diego Biurrun authored Jan 22, 2017

Previously, all link-time dependencies were added for all libraries,
resulting in bogus link-time dependencies since not all dependencies
are shared across libraries. Also, in some cases like libavutil, not
all dependencies were taken into account, resulting in some cases of
underlinking.

To address all this mess a machinery is added for tracking which
dependency belongs to which library component and then leveraged
to determine correct dependencies for all individual libraries.

7cb1d9e2

configure: Simplify dlopen check · d154bdd3
Diego Biurrun authored Jan 24, 2017

d154bdd3

28 Feb, 2017 5 commits

h264_sei: Check actual presence of picture timing SEI message · d7b2bb53

Michael Niedermayer authored Feb 15, 2017

Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
Signed-off-by: Vittorio Giovara <vittorio.giovara@gmail.com>

d7b2bb53

build: Explicitly disable external libraries when not explicitly enabled · 21cca00d

Diego Biurrun authored Feb 24, 2017

Leaving those variables in an undefined state allows them getting implicitly
enabled when they are declared as weak dependencies of other components.
In that case, the library check is not run and required linker flags are not
added, resulting in a failing build.

Fixes linking when enabling libfreetype without libfontconfig.

21cca00d

fate: Rename WMV8_DRM decoder tests to WMV3_DRM · e1a6d63c
Diego Biurrun authored Oct 18, 2012
```
The codec used in those files is WMV3/WMV9, not WMV2/WMV8.
```
e1a6d63c
rtsp: Lazily set up the pollfd array once · 79331df3
Luca Barbato authored Feb 20, 2017

79331df3

nvenc: Fix the preset mapping list · d8f36a6a

Ben Chang authored Feb 24, 2017

The map is a sparse array and does not need a empty element to terminate
it.

The empty element is stored after the last one inserted in the list,
overwriting whichever element was next with zeros.

Bug-Id: 1029
Signed-off-by: Luca Barbato <lu_zero@gentoo.org>

d8f36a6a

27 Feb, 2017 7 commits
- fate: Make null comparison method more useful · 698ac8f9
  Diego Biurrun authored Oct 15, 2012
```
This allows dropping /dev/null as reference value when no output is generated.
```
  698ac8f9
- build: Drop DOC_ prefix from EXAMPLES-related variables · c483398b
  Diego Biurrun authored Feb 22, 2017
  
  c483398b
- rtsp: Lazily allocate the pollfd array · 5263f464
  Luca Barbato authored Feb 20, 2017
```
And use av_malloc_array.
```
  5263f464
- rtsp: Move the pollfd setup out of the for loop · b9b82151
  Luca Barbato authored Feb 19, 2017
  
  b9b82151
- rtsp: Factor out packet reading · 150e99d6
  Luca Barbato authored Feb 19, 2017
  
  150e99d6
- Use modern avconv syntax for codec selection in documentation and tests · 4141a5a2
  Diego Biurrun authored Oct 18, 2012
  
  4141a5a2
- fate: Use bitexact optimizations in the svq3-2 test · da8093f7
  Diego Biurrun authored Feb 25, 2017
```
This fixes the test with mmxext disabled because the current reference
frame hashes correspond to the non-bitexact mmxext optimizations.
```
  da8093f7
25 Feb, 2017 4 commits

lavc: make sure not to return EAGAIN from codecs · 984736dd
Anton Khirnov authored Feb 14, 2017
```
This error is treated specially by the API.

CC: libav-stable@libav.org
```
984736dd

apetag: account for header size if present when returning the start position · 4cc02270

James Almer authored Feb 10, 2017

The size field in the header/footer accounts for the entire APE tag
structure except the 32 bytes from header, for compatibility with
APEv1.
Signed-off-by: James Almer <jamrial@gmail.com>

CC: libav-stable@libav.org
Signed-off-by: Anton Khirnov <anton@khirnov.net>

4cc02270

apetag: fix flag value to signal footer presence · 3f258f5e

James Almer authored Feb 10, 2017

According to the spec[1], a value of 0 means the footer is present and a value
of 1 means it's absent, the exact opposite of header presence flag where 1
means present and 0 absent.
The reason for this is compatibility with APEv1 tags, where there's no header,
footer presence was mandatory for all files, and the flags field was a zeroed
reserved field.

[1] http://wiki.hydrogenaud.io/index.php?title=Ape_Tags_FlagsSigned-off-by: James Almer <jamrial@gmail.com>

CC: libav-stable@libav.org
Signed-off-by: Anton Khirnov <anton@khirnov.net>

3f258f5e

svq3: fix the slice size check · b2788fe9

Anton Khirnov authored Feb 01, 2017

Currently it incorrectly compares bits with bytes.

Also, move the check right before where it's relevant, so that the
correct number of remaining bits is used.

CC: libav-stable@libav.org

b2788fe9

24 Feb, 2017 3 commits
- asfdec: fix reading files larger than 2GB · cd7a2e15
  John Stebbins authored Feb 23, 2017
```
avio_skip returns file position and overflows int
```
  cd7a2e15
- h264dec: fix dropped initial SEI recovery point · 248dc5c1
  John Stebbins authored Feb 23, 2017
  
  248dc5c1
- fate: Add another SVQ3 test to increase coverage · 8e4d4efc
  Diego Biurrun authored Apr 06, 2013
  
  8e4d4efc
23 Feb, 2017 10 commits

aarch64: vp9itxfm: Reorder iadst16 coeffs · b8f66c08

Martin Storsjö authored Dec 31, 2016

This matches the order they are in the 16 bpp version.

There they are in this order, to make sure we access them in the
same order they are declared, easing loading only half of the
coefficients at a time.

This makes the 8 bpp version match the 16 bpp version better.
Signed-off-by: Martin Storsjö <martin@martin.st>

b8f66c08

arm: vp9itxfm: Reorder iadst16 coeffs · 08074c09

Martin Storsjö authored Dec 31, 2016

This matches the order they are in the 16 bpp version.

There they are in this order, to make sure we access them in the
same order they are declared, easing loading only half of the
coefficients at a time.

This makes the 8 bpp version match the 16 bpp version better.
Signed-off-by: Martin Storsjö <martin@martin.st>

08074c09

aarch64: vp9itxfm: Reorder the idct coefficients for better pairing · 09eb88a1

Martin Storsjö authored Dec 31, 2016

All elements are used pairwise, except for the first one.
Previously, the 16th element was unused. Move the unused element
to the second slot, to make the later element pairs not split
across registers.

This simplifies loading only parts of the coefficients,
reducing the difference to the 16 bpp version.
Signed-off-by: Martin Storsjö <martin@martin.st>

09eb88a1

arm: vp9itxfm: Reorder the idct coefficients for better pairing · de06bdfe

Martin Storsjö authored Dec 31, 2016

All elements are used pairwise, except for the first one.
Previously, the 16th element was unused. Move the unused element
to the second slot, to make the later element pairs not split
across registers.

This simplifies loading only parts of the coefficients,
reducing the difference to the 16 bpp version.
Signed-off-by: Martin Storsjö <martin@martin.st>

de06bdfe

aarch64: vp9itxfm: Avoid reloading the idct32 coefficients · 65aa002d

Martin Storsjö authored Jan 02, 2017

The idct32x32 function actually pushed d8-d15 onto the stack even
though it didn't clobber them; there are plenty of registers that
can be used to allow keeping all the idct coefficients in registers
without having to reload different subsets of them at different
stages in the transform.

After this, we still can skip pushing d12-d15.

Before:
vp9_inv_dct_dct_32x32_sub32_add_neon: 8128.3
After:
vp9_inv_dct_dct_32x32_sub32_add_neon: 8053.3
Signed-off-by: Martin Storsjö <martin@martin.st>

65aa002d

arm: vp9itxfm: Avoid reloading the idct32 coefficients · 402546a1

Martin Storsjö authored Jan 02, 2017

The idct32x32 function actually pushed q4-q7 onto the stack even
though it didn't clobber them; there are plenty of registers that
can be used to allow keeping all the idct coefficients in registers
without having to reload different subsets of them at different
stages in the transform.

Since the idct16 core transform avoids clobbering q4-q7 (but clobbers
q2-q3 instead, to avoid needing to back up and restore q4-q7 at all
in the idct16 function), and the lanewise vmul needs a register in
the q0-q3 range, we move the stored coefficients from q2-q3 into q4-q5
while doing idct16.

While keeping these coefficients in registers, we still can skip pushing
q7.

Before:                              Cortex A7       A8       A9      A53
vp9_inv_dct_dct_32x32_sub32_add_neon:  18553.8  17182.7  14303.3  12089.7
After:
vp9_inv_dct_dct_32x32_sub32_add_neon:  18470.3  16717.7  14173.6  11860.8
Signed-off-by: Martin Storsjö <martin@martin.st>

402546a1

arm: vp9lpf: Implement the mix2_44 function with one single filter pass · 575e31e9

Martin Storsjö authored Jan 14, 2017

For this case, with 8 inputs but only changing 4 of them, we can fit
all 16 input pixels into a q register, and still have enough temporary
registers for doing the loop filter.

The wd=8 filters would require too many temporary registers for
processing all 16 pixels at once though.

Before:                          Cortex A7      A8     A9     A53
vp9_loop_filter_mix2_v_44_16_neon:   289.7   256.2  237.5   181.2
After:
vp9_loop_filter_mix2_v_44_16_neon:   221.2   150.5  177.7   138.0
Signed-off-by: Martin Storsjö <martin@martin.st>

575e31e9

aarch64: vp9lpf: Use dup+rev16+uzp1 instead of dup+lsr+dup+trn1 · 3bf9c483

Martin Storsjö authored Feb 23, 2017

This is one cycle faster in total, and three instructions fewer.

Before:
vp9_loop_filter_mix2_v_44_16_neon: 123.2
After:
vp9_loop_filter_mix2_v_44_16_neon: 122.2
Signed-off-by: Martin Storsjö <martin@martin.st>

3bf9c483

arm/aarch64: vp9lpf: Keep the comparison to E within 8 bit · c582cb85

Martin Storsjö authored Jan 14, 2017

The theoretical maximum value of E is 193, so we can just
saturate the addition to 255.

Before: Cortex A7 A8 A9 A53 A53/AArch64
vp9_loop_filter_v_4_8_neon: 143.0 127.7 114.8 88.0 87.7
vp9_loop_filter_v_8_8_neon: 241.0 197.2 173.7 140.0 136.7
vp9_loop_filter_v_16_8_neon: 497.0 419.5 379.7 293.0 275.7
vp9_loop_filter_v_16_16_neon: 965.2 818.7 731.4 579.0 452.0
After:
vp9_loop_filter_v_4_8_neon: 136.0 125.7 112.6 84.0 83.0
vp9_loop_filter_v_8_8_neon: 234.0 195.5 171.5 136.0 133.7
vp9_loop_filter_v_16_8_neon: 490.0 417.5 377.7 289.0 271.0
vp9_loop_filter_v_16_16_neon: 951.2 814.7 732.3 571.0 446.7
Signed-off-by: Martin Storsjö <martin@martin.st>

c582cb85

Place attribute_deprecated in the right position for struct declarations · ed6a891c

Diego Biurrun authored Feb 22, 2017

libavcodec/vaapi.h:58:1: warning: attribute 'deprecated' is ignored, place it after "struct" to apply attribute to type declaration [-Wignored-attributes]

ed6a891c

22 Feb, 2017 2 commits
- mkv: Update the seek test to match 5d3953a5 · 04d2afa9
  Luca Barbato authored Feb 22, 2017
  
  04d2afa9
- fate: Update fate-lavf-mkv after commit 5d3953a5 · fec3456c
  John Stebbins authored Feb 21, 2017
  
  fec3456c
21 Feb, 2017 4 commits

fate: Add webp alpha test · 156bc019
Mark Thompson authored Feb 17, 2017

156bc019

matroskaenc: factor ts_offset into block timecode computation · 5d3953a5

John Stebbins authored Feb 15, 2017

ts_offset was added to cluster timecode, but then effectively subtracted
back off the block timecode

When setting initial_padding for an audio stream, the timestamps are
written incorrectly to the mkv file.  cluster timecode gets written
as pts0 + ts_offset which is correct, but then block timecode gets
written as pts - cluster timecode which expanded is
pts - (pts0 + ts_offset).  Adding cluster and block tc back together:
cluster + block = (pts0 + ts_offset) + (pts - (pts0 + ts_offset)) = pts
But the result should be pts + ts_offset since demux will subtract the
CodecDelay element from pts and set initial_padding to CodecDelay.
This patch gives the correct result.

5d3953a5

build: Move cli tool sources to a separate subdirectory · c95169f0
Diego Biurrun authored Jan 04, 2017
```
This unclutters the top-level directory and groups related files together.
```
c95169f0
build: Separate logic for building examples from that for building avtools · ab566cc9
Diego Biurrun authored Feb 14, 2017

ab566cc9