Commits · 87552d54d3337c3241e8a9e1a05df16eaa821496 · Linshizhi / ffmpeg.wasm-core

17 Jul, 2014 7 commits

armv6: Accelerate ff_fft_calc for general case (nbits != 4) · 87552d54

Ben Avison authored Jul 16, 2014

The previous implementation targeted DTS Coherent Acoustics, which only
requires nbits == 4 (fft16()). This case was (and still is) linked directly
rather than being indirected through ff_fft_calc_vfp(), but now the full
range from radix-4 up to radix-65536 is available. This benefits other codecs
such as AAC and AC3.

The implementaion is based upon the C version, with each routine larger than
radix-16 calling a hierarchy of smaller FFT functions, then performing a
post-processing pass. This pass benefits a lot from loop unrolling to
counter the long pipelines in the VFP. A relaxed calling standard also
reduces the overhead of the call hierarchy, and avoiding the excessive
inlining performed by GCC probably helps with I-cache utilisation too.

I benchmarked the result by measuring the number of gperftools samples that
hit anywhere in the AAC decoder (starting from aac_decode_frame()) or
specifically in the FFT routines (fft4() to fft512() and pass()) for the
same sample AAC stream:

Before After
Mean StdDev Mean StdDev Confidence Change
Audio decode 2245.5 53.1 1599.6 43.8 100.0% +40.4%
FFT routines 940.6 22.0 348.1 20.8 100.0% +170.2%
Signed-off-by: Martin Storsjö <martin@martin.st>

87552d54

armv6: Accelerate ff_imdct_half for general case (mdct_bits != 6) · 5c22e8e4

Ben Avison authored Jul 10, 2014

The previous implementation targeted DTS Coherent Acoustics, which only
requires mdct_bits == 6. This relatively small size lent itself to
unrolling the loops a small number of times, and encoding offsets
calculated at assembly time within the load/store instructions of each
iteration.

In the more general case (codecs such as AAC and AC3) much larger arrays
are used - mdct_bits == [8, 9, 11]. The old method does not scale for
these cases, so more integer registers are used with non-unrolled versions
of the loops (and with some stack spillage). The postrotation filter loop
is still unrolled by a factor of 2 to permit the double-buffering of some
VFP registers to facilitate overlap of neighbouring iterations.

I benchmarked the result by measuring the number of gperftools samples
that hit anywhere in the AAC decoder (starting from aac_decode_frame())
or specifically in ff_imdct_half_c / ff_imdct_half_vfp, for the same
example AAC stream:

Before After
Mean StdDev Mean StdDev Confidence Change
aac_decode_frame 2368.1 35.8 2117.2 35.3 100.0% +11.8%
ff_imdct_half_* 457.5 22.4 251.2 16.2 100.0% +82.1%
Signed-off-by: Martin Storsjö <martin@martin.st>

5c22e8e4

dsputil: Split motion estimation compare bits off into their own context · 2d604443
Diego Biurrun authored Feb 08, 2014

2d604443
configure: Assume runtime cpu detection on arm on --target-os=android as well · a578b040
Martin Storsjö authored Jul 16, 2014
```
Signed-off-by: Martin Storsjö <martin@martin.st>
```
a578b040
x86: dsputil: Coalesce all init files · c23ce454
Diego Biurrun authored Jan 04, 2014
```
This makes the init files match the structure of the dsputil split.
```
c23ce454
avpacket: Check for and return errors in ff_interleave_add_packet() · 324ff594
Nidhi Makhijani authored Jul 14, 2014
```
Signed-off-by: Diego Biurrun <diego@biurrun.de>
```
324ff594

h264: K&R formatting cosmetics · 2db953f8

Luca Barbato authored Jul 16, 2014

Signed-off-by: Diego Biurrun <diego@biurrun.de>
Signed-off-by: Vittorio Giovara <vittorio.giovara@gmail.com>

2db953f8

16 Jul, 2014 3 commits
- h264: Remove some commented-out, broken cruft · a11ef610
  Diego Biurrun authored Jul 16, 2014
  
  a11ef610
- arm: dsputil: Coalesce all init files · adff0a81
  Diego Biurrun authored Jul 08, 2014
  
  adff0a81
- g2meet: allow size changes within original sizes · 14b4e64e
  Vittorio Giovara authored Jul 15, 2014
  
  14b4e64e
14 Jul, 2014 1 commit

fate: Use the correct, local path to samples for opus reference files · f9900822

Martin Storsjö authored Jul 12, 2014

This fixes running fate in configs where the samples are located
in a different path on the target.
Signed-off-by: Martin Storsjö <martin@martin.st>

f9900822

13 Jul, 2014 2 commits
- x86: dsputil: Avoid pointless CONFIG_ENCODERS indirection · acf91215
  Diego Biurrun authored Jul 08, 2014
```
The remaining dsputil bits are encoding-specific anyway.
```
  acf91215
- ppc: dsputil: Coalesce all init files · a8552ee3
  Diego Biurrun authored Jul 08, 2014
  
  a8552ee3
11 Jul, 2014 8 commits
- examples/output: Remove unused variable · 6cc1409b
  Diego Biurrun authored Jul 11, 2014
```
doc/examples/output.c:460:9: warning: unused variable ‘i’
```
  6cc1409b
- dsputil: Drop unused bit_depth parameter from all init functions · 11733202
  Diego Biurrun authored Jul 08, 2014
  
  11733202
- mov: Clarify tkhd flag settings · df2aa222
  Luca Barbato authored Jul 04, 2014
  
  df2aa222
- mov: Do not group tracks if more than one is enabled per type · f9072969
  Luca Barbato authored Jul 04, 2014
```
The specification requires at most 1 track enabled per alternate group.
```
  f9072969
- hevc: implement pic_output_flag handling · 458e7c94
  Gildas Cocherel authored Jul 04, 2014
```
Sample-Id: OPFLAG_B_Qualcomm_1.bit, OPFLAG_C_Qualcomm_1.bit
Signed-off-by: Vittorio Giovara <vittorio.giovara@gmail.com>
Signed-off-by: Anton Khirnov <anton@khirnov.net>
```
  458e7c94
- hevc: set the keyframe flag on output frames · f43789b7
  Mickaël Raulet authored Jul 04, 2014
```
Signed-off-by: Vittorio Giovara <vittorio.giovara@gmail.com>
Signed-off-by: Anton Khirnov <anton@khirnov.net>
```
  f43789b7
- hevc: Replace nal type chek with equivalent IS_IRAP macro · 1493b237
  Mickaël Raulet authored Jul 04, 2014
```
Signed-off-by: Vittorio Giovara <vittorio.giovara@gmail.com>
Signed-off-by: Anton Khirnov <anton@khirnov.net>
```
  1493b237
- hevc_ps: remove a write-only variable · 17e9d52c
  Anton Khirnov authored Jul 11, 2014
  
  17e9d52c
10 Jul, 2014 2 commits

cdg: Forward error from avio_size() in read_header() function · 44386aaa
Nidhi Makhijani authored Jul 10, 2014
```
Signed-off-by: Diego Biurrun <diego@biurrun.de>
```
44386aaa

mpegts: pass MpegTSContext ptr explicitly · 5adcef9c

Alexander V. Lukyanov authored Jul 08, 2014

AVFormatContext->priv_data is not always a MpegTSContext, it can be
RTSPState when decoding a RTP stream. So it is necessary to pass
MpegTSContext pointer explicitly.

Within libav, the write_section_data function doesn't actually use
the MpegTSContext at all, so this doesn't change anything at the
moment (no memory was corrupted before), but it reduces the risk of
anybody trying to touch the MpegTSContext via AVFormatContext->priv_data
in the future.
Signed-off-by: Martin Storsjö <martin@martin.st>

5adcef9c

09 Jul, 2014 16 commits
- dsputil: Split off pixel block routines into their own context · f46bb608
  Diego Biurrun authored Feb 03, 2014
  
  f46bb608
- hevc: parse display orientation SEI message · 0569a7e0
  Vittorio Giovara authored Jul 02, 2014
```
Signed-off-by: Vittorio Giovara <vittorio.giovara@gmail.com>
```
  0569a7e0
- h264: parse display orientation SEI message · 18e3d61e
  Vittorio Giovara authored Jul 02, 2014
```
Signed-off-by: Vittorio Giovara <vittorio.giovara@gmail.com>
```
  18e3d61e
- display: add matrix flip api · a54f03bf
  Vittorio Giovara authored Jun 18, 2014
  
  a54f03bf
- doc: mention option to mix shared/static libraries · 33a7b453
  Andrew Kelley authored Jul 03, 2014
```
Signed-off-by: Vittorio Giovara <vittorio.giovara@gmail.com>
```
  33a7b453
- rtpdec: pass an AVFormatContext to ff_parse_fmtp() · 0307cc22
  Anton Khirnov authored Jul 05, 2014
```
Use it for logging, instead of NULL or the stream codec context.
```
  0307cc22
- yuv4mpegenc: do not access AVCodecContext.coded_frame · 650d3840
  Anton Khirnov authored Jul 05, 2014
```
Its contents are meaningful only if the stream codec context is the one
actually used for encoding, which is often not the case (and is
discouraged).

Use AVCodecContext.field_order instead.
```
  650d3840
- nsvdec: remove commented out cruft · 27c1f82f
  Anton Khirnov authored Jun 19, 2014
  
  27c1f82f
- mov: free the dv demux context with avformat_free_context() · edb1af7c
  Anton Khirnov authored Jun 19, 2014
  
  edb1af7c
- mtv: do not set sample_rate for video · a14b6165
  Anton Khirnov authored Jun 15, 2014
  
  a14b6165
- oggparsecelt: do not set AVCodecContext.frame_size · b8604a97
  Anton Khirnov authored Jun 01, 2014
```
It is supposed to be set by decoders only.
```
  b8604a97
- adxdec: get rid of an avpriv function · d5cf5afa
  Anton Khirnov authored Jul 03, 2014
```
The only thing the demuxer needs is the sample rate to set the timebase,
which can be simply read with AV_RB32.
```
  d5cf5afa
- lavc: export DV profile API used by muxer/demuxer as public · f6ee61fb
  Anton Khirnov authored Jul 05, 2014
  
  f6ee61fb
- avconv: set the output stream timebase · 3f3232a3
  Anton Khirnov authored Jul 09, 2014
```
This is required by the new API.
```
  3f3232a3
- avformat: update muxing doxy · c9c1265c
  Anton Khirnov authored Jul 09, 2014
```
The callers should now set the stream timebase, not the codec one.
```
  c9c1265c
- cdg: set the keyframe flag on the first packet · abda15a9
  Anton Khirnov authored Jul 04, 2014
```
Bug-Id: 55
```
  abda15a9
08 Jul, 2014 1 commit
- mov: Remove a variable that is set but never used · 18fb38fb
  Martin Storsjö authored Jul 07, 2014
```
This silences a warning with gcc.
Signed-off-by: Martin Storsjö <martin@martin.st>
```
  18fb38fb