Commits · 3d4dd9195f6892b3e9d04dc543d6211095248d22 · Linshizhi / ffmpeg.wasm-core

28 Mar, 2017 1 commit

vp9: re-split the decoder/format/dsp interface header files. · f8c01994

The advantage here is that the internal software decoder interface is
not exposed to the DSP functions or the hardware accelerations.

f8c01994

27 Mar, 2017 1 commit
- lavc/vp9: split into vp9{block,data,mvs} · 1c9f4b50
  Clément Bœsch authored 7 years ago
```
This is following Libav layout to ease merges.
```
  1c9f4b50
24 Jan, 2017 2 commits

aarch64: Add NEON optimizations for 10 and 12 bit vp9 MC · 638eceed

Martin Storsjö authored 8 years ago

This work is sponsored by, and copyright, Google.

This has mostly got the same differences to the 8 bit version as
in the arm version. For the horizontal filters, we do 16 pixels
in parallel as well. For the 8 pixel wide vertical filters, we can
accumulate 4 rows before storing, just as in the 8 bit version.

Examples of runtimes vs the 32 bit version, on a Cortex A53:
                                           ARM   AArch64
vp9_avg4_10bpp_neon:                      35.7      30.7
vp9_avg8_10bpp_neon:                      93.5      84.7
vp9_avg16_10bpp_neon:                    324.4     296.6
vp9_avg32_10bpp_neon:                   1236.5    1148.2
vp9_avg64_10bpp_neon:                   4639.6    4571.1
vp9_avg_8tap_smooth_4h_10bpp_neon:       130.0     128.0
vp9_avg_8tap_smooth_4hv_10bpp_neon:      440.0     440.5
vp9_avg_8tap_smooth_4v_10bpp_neon:       114.0     105.5
vp9_avg_8tap_smooth_8h_10bpp_neon:       327.0     314.0
vp9_avg_8tap_smooth_8hv_10bpp_neon:      918.7     865.4
vp9_avg_8tap_smooth_8v_10bpp_neon:       330.0     300.2
vp9_avg_8tap_smooth_16h_10bpp_neon:     1187.5    1155.5
vp9_avg_8tap_smooth_16hv_10bpp_neon:    2663.1    2591.0
vp9_avg_8tap_smooth_16v_10bpp_neon:     1107.4    1078.3
vp9_avg_8tap_smooth_64h_10bpp_neon:    17754.6   17454.7
vp9_avg_8tap_smooth_64hv_10bpp_neon:   33285.2   33001.5
vp9_avg_8tap_smooth_64v_10bpp_neon:    16066.9   16048.6
vp9_put4_10bpp_neon:                      25.5      21.7
vp9_put8_10bpp_neon:                      56.0      52.0
vp9_put16_10bpp_neon/armv8:              183.0     163.1
vp9_put32_10bpp_neon/armv8:              678.6     563.1
vp9_put64_10bpp_neon/armv8:             2679.9    2195.8
vp9_put_8tap_smooth_4h_10bpp_neon:       120.0     118.0
vp9_put_8tap_smooth_4hv_10bpp_neon:      435.2     435.0
vp9_put_8tap_smooth_4v_10bpp_neon:       107.0      98.2
vp9_put_8tap_smooth_8h_10bpp_neon:       303.0     290.0
vp9_put_8tap_smooth_8hv_10bpp_neon:      893.7     828.7
vp9_put_8tap_smooth_8v_10bpp_neon:       305.5     263.5
vp9_put_8tap_smooth_16h_10bpp_neon:     1089.1    1059.2
vp9_put_8tap_smooth_16hv_10bpp_neon:    2578.8    2452.4
vp9_put_8tap_smooth_16v_10bpp_neon:     1009.5     933.5
vp9_put_8tap_smooth_64h_10bpp_neon:    16223.4   15918.6
vp9_put_8tap_smooth_64hv_10bpp_neon:   32153.0   31016.2
vp9_put_8tap_smooth_64v_10bpp_neon:    14516.5   13748.1

These are generally about as fast as the corresponding ARM
routines on the same CPU (at least on the A53), in most cases
marginally faster.

The speedup vs C code is around 4-9x.
Signed-off-by: Martin Storsjö <martin@martin.st>

638eceed

arm: Add NEON optimizations for 10 and 12 bit vp9 MC · a4d4bad7

Martin Storsjö authored 8 years ago

This work is sponsored by, and copyright, Google.

The plain pixel put/copy functions are used from the 8 bit version,
for the double size (e.g. put16 uses ff_vp9_copy32_neon), and a new
copy128 is added.

Compared with the 8 bit version, the filters can no longer use the
trick to accumulate in 16 bit with only saturation at the end, but now
the accumulators need to be 32 bit. This avoids the need to keep track
of which filter index is the largest though, reducing the size of the
executable code for these filters.

For the horizontal filters, we only do 4 or 8 pixels wide in parallel
(while doing two rows at a time), since we don't have enough register
space to filter 16 pixels wide.

For the vertical filters, we still do 4 and 8 pixels in parallel just
as in the 8 bit case, but we need to store the output after every 2
rows instead of after every 4 rows.

Examples of relative speedup compared to the C version, from checkasm:
Cortex A7 A8 A9 A53
vp9_avg4_10bpp_neon: 2.25 2.44 3.05 2.16
vp9_avg8_10bpp_neon: 3.66 8.48 3.86 3.50
vp9_avg16_10bpp_neon: 3.39 8.26 3.37 2.72
vp9_avg32_10bpp_neon: 4.03 10.20 4.07 3.42
vp9_avg64_10bpp_neon: 4.15 10.01 4.13 3.70
vp9_avg_8tap_smooth_4h_10bpp_neon: 3.38 6.22 3.41 4.75
vp9_avg_8tap_smooth_4hv_10bpp_neon: 3.89 6.39 4.30 5.32
vp9_avg_8tap_smooth_4v_10bpp_neon: 5.32 9.73 6.34 7.31
vp9_avg_8tap_smooth_8h_10bpp_neon: 4.45 9.40 4.68 6.87
vp9_avg_8tap_smooth_8hv_10bpp_neon: 4.64 8.91 5.44 6.47
vp9_avg_8tap_smooth_8v_10bpp_neon: 6.44 13.42 8.68 8.79
vp9_avg_8tap_smooth_64h_10bpp_neon: 4.66 9.02 4.84 7.71
vp9_avg_8tap_smooth_64hv_10bpp_neon: 4.61 9.14 4.92 7.10
vp9_avg_8tap_smooth_64v_10bpp_neon: 6.90 14.13 9.57 10.41
vp9_put4_10bpp_neon: 1.33 1.46 2.09 1.33
vp9_put8_10bpp_neon: 1.57 3.42 1.83 1.84
vp9_put16_10bpp_neon: 1.55 4.78 2.17 1.89
vp9_put32_10bpp_neon: 2.06 5.35 2.14 2.30
vp9_put64_10bpp_neon: 3.00 2.41 1.95 1.66
vp9_put_8tap_smooth_4h_10bpp_neon: 3.19 5.81 3.31 4.63
vp9_put_8tap_smooth_4hv_10bpp_neon: 3.86 6.22 4.32 5.21
vp9_put_8tap_smooth_4v_10bpp_neon: 5.40 9.77 6.08 7.21
vp9_put_8tap_smooth_8h_10bpp_neon: 4.22 8.41 4.46 6.63
vp9_put_8tap_smooth_8hv_10bpp_neon: 4.56 8.51 5.39 6.25
vp9_put_8tap_smooth_8v_10bpp_neon: 6.60 12.43 8.17 8.89
vp9_put_8tap_smooth_64h_10bpp_neon: 4.41 8.59 4.54 7.49
vp9_put_8tap_smooth_64hv_10bpp_neon: 4.43 8.58 5.34 6.63
vp9_put_8tap_smooth_64v_10bpp_neon: 7.26 13.92 9.27 10.92

For the larger 8tap filters, the speedup vs C code is around 4-14x.
Signed-off-by: Martin Storsjö <martin@martin.st>

a4d4bad7

05 May, 2013 1 commit
- Fix type of shared flac table ff_flac_blocksize_table[]. · a07ac1f7
  Carl Eugen Hoyos authored 11 years ago
```
Fixes ticket #2533.
```
  a07ac1f7
19 Mar, 2011 1 commit
- Replace FFmpeg with Libav in licence headers · 2912e87a
  Mans Rullgard authored 13 years ago
```
Signed-off-by: Mans Rullgard <mans@mansr.com>
```
  2912e87a
21 Mar, 2009 1 commit
- share sample rate and blocksize tables between the FLAC encoder and FLAC · d4df4e50
  Justin Ruggles authored 15 years ago
```
decoder

Originally committed as revision 18089 to svn://svn.ffmpeg.org/ffmpeg/trunk
```
  d4df4e50
26 Feb, 2009 1 commit
- Share the function to write a raw FLAC header and use it in the Matroska · 2578326f
  Justin Ruggles authored 15 years ago
```
muxer.

Originally committed as revision 17606 to svn://svn.ffmpeg.org/ffmpeg/trunk
```
  2578326f
17 Feb, 2009 1 commit
- use new metadata API in rm (de)muxer · 7379d5bc
  Aurelien Jacobs authored 15 years ago
```
Originally committed as revision 17396 to svn://svn.ffmpeg.org/ffmpeg/trunk
```
  7379d5bc
31 Aug, 2008 1 commit

Globally rename the header inclusion guard names. · 98790382

Stefano Sabatini authored 16 years ago

Consistently apply this rule: the guard name is obtained from the
filename by stripping the leading "lib", converting '/' and '.'  to
'_' and uppercasing the resulting name. Guard names in the root
directory have to be prefixed by "FFMPEG_".

Originally committed as revision 15120 to svn://svn.ffmpeg.org/ffmpeg/trunk

98790382

23 Aug, 2008 2 commits
- Remove unnecessary header inclusion from g729.h · 6bf8b3ef
  Vladimir Voroshilov authored 16 years ago
```
Originally committed as revision 14916 to svn://svn.ffmpeg.org/ffmpeg/trunk
```
  6bf8b3ef
- Move from g729.h all definitions which are used only in g729dec.c · fe3a80d6
  Vladimir Voroshilov authored 16 years ago
```
Originally committed as revision 14915 to svn://svn.ffmpeg.org/ffmpeg/trunk
```
  fe3a80d6
17 Aug, 2008 1 commit

G.729 decoder main code · 52098468

Vladimir Voroshilov authored 16 years ago

(just skeleton, contains only parts, explicitly ok'ed by Michael)

Originally committed as revision 14800 to svn://svn.ffmpeg.org/ffmpeg/trunk

52098468

30 Oct, 2007 1 commit
- Mark the source buffer as "const" · e76e2bbc
  Luca Abeni authored 17 years ago
```
Originally committed as revision 10877 to svn://svn.ffmpeg.org/ffmpeg/trunk
```
  e76e2bbc
17 Oct, 2007 1 commit
- Add FFMPEG_ prefix to all multiple inclusion guards. · 5b21bdab
  Diego Biurrun authored 17 years ago
```
Originally committed as revision 10765 to svn://svn.ffmpeg.org/ffmpeg/trunk
```
  5b21bdab
17 Jun, 2007 2 commits
- add a comment to indicate which #endif belong to which #define · efb77577
  Guillaume Poirier authored 17 years ago
```
Originally committed as revision 9356 to svn://svn.ffmpeg.org/ffmpeg/trunk
```
  efb77577
- add multiple inclusion guards to headers · 699b3f99
  Måns Rullgård authored 17 years ago
```
Originally committed as revision 9345 to svn://svn.ffmpeg.org/ffmpeg/trunk
```
  699b3f99
16 Jun, 2007 1 commit
- include all prerequisites in header files · 99545457
  Måns Rullgård authored 17 years ago
```
Originally committed as revision 9344 to svn://svn.ffmpeg.org/ffmpeg/trunk
```
  99545457
19 Mar, 2007 1 commit
- expose av_base64_decode and av_base64_encode · bd03c380
  Luca Barbato authored 17 years ago
```
Originally committed as revision 8448 to svn://svn.ffmpeg.org/ffmpeg/trunk
```
  bd03c380
28 Feb, 2007 1 commit
- Reverting stray commit part II, r8156 had the base64 export patch mixed with the nutdec patch · 558b86a5
  Luca Barbato authored 17 years ago
```
Originally committed as revision 8158 to svn://svn.ffmpeg.org/ffmpeg/trunk
```
  558b86a5
07 Oct, 2006 1 commit
- Change license headers to say 'FFmpeg' instead of 'this program/this library' · b78e7197
  Diego Biurrun authored 18 years ago
```
and fix GPL/LGPL version mismatches.

Originally committed as revision 6577 to svn://svn.ffmpeg.org/ffmpeg/trunk
```
  b78e7197
12 Jan, 2006 1 commit
- Update licensing information: The FSF changed postal address. · 5509bffa
  Diego Biurrun authored 18 years ago
```
Originally committed as revision 4842 to svn://svn.ffmpeg.org/ffmpeg/trunk
```
  5509bffa
17 Dec, 2005 1 commit
- COSMETICS: Remove all trailing whitespace. · 115329f1
  Diego Biurrun authored 19 years ago
```
Originally committed as revision 4749 to svn://svn.ffmpeg.org/ffmpeg/trunk
```
  115329f1
25 Oct, 2003 1 commit
- * adding integer/floating point AAN implementations for DCT 2-4-8 · 48b1f800
  Roman Shaposhnik authored 21 years ago
```
Originally committed as revision 2430 to svn://svn.ffmpeg.org/ffmpeg/trunk
```
  48b1f800
23 Oct, 2003 1 commit
- optionally merge postscale into quantization table for the float aan dct · b4c3816c
  Michael Niedermayer authored 21 years ago
```
Originally committed as revision 2420 to svn://svn.ffmpeg.org/ffmpeg/trunk
```
  b4c3816c
22 Oct, 2003 1 commit
- floating point AAN DCT · 65e4c8c9
  Michael Niedermayer authored 21 years ago
```
Originally committed as revision 2415 to svn://svn.ffmpeg.org/ffmpeg/trunk
```
  65e4c8c9
03 Mar, 2003 1 commit
- MpegEncContext.(i)dct_* -> DspContext.(i)dct_* · b0368839
  Michael Niedermayer authored 21 years ago
```
bitexact cleanup

Originally committed as revision 1617 to svn://svn.ffmpeg.org/ffmpeg/trunk
```
  b0368839
11 Feb, 2003 1 commit
- * UINTX -> uintx_t INTX -> intx_t · 0c1a9eda
  Zdenek Kabelac authored 21 years ago
```
Originally committed as revision 1578 to svn://svn.ffmpeg.org/ffmpeg/trunk
```
  0c1a9eda
20 Nov, 2002 1 commit

* cut&paste fix · bb285683

Zdenek Kabelac authored 22 years ago

Originally committed as revision 1249 to svn://svn.ffmpeg.org/ffmpeg/trunk

bb285683

19 Nov, 2002 2 commits

* oops fixed bad initialization of ff vals. · 59402627

Zdenek Kabelac authored 22 years ago

  - put FF_LIBMPEG2_IDCT_PERM into CVS - so it will work for now

Originally committed as revision 1227 to svn://svn.ffmpeg.org/ffmpeg/trunk

59402627

* compilation fix (ARM users please check) · 83f238cb
Zdenek Kabelac authored 22 years ago
```
Originally committed as revision 1225 to svn://svn.ffmpeg.org/ffmpeg/trunk
```
83f238cb

25 Oct, 2002 1 commit
- idct_permutation_type variable, so the permutation type can quickly be identified · 50eb9cbc
  Michael Niedermayer authored 22 years ago
```
Originally committed as revision 1071 to svn://svn.ffmpeg.org/ffmpeg/trunk
```
  50eb9cbc
06 Oct, 2002 1 commit
- trying to fix the non-x86 IDCTs (untested) · 676e200c
  Michael Niedermayer authored 22 years ago
```
Originally committed as revision 1006 to svn://svn.ffmpeg.org/ffmpeg/trunk
```
  676e200c
25 May, 2002 1 commit
- license/copyright change · ff4ec49e
  Fabrice Bellard authored 22 years ago
```
Originally committed as revision 599 to svn://svn.ffmpeg.org/ffmpeg/trunk
```
  ff4ec49e
13 Aug, 2001 1 commit

arm specific code · 92651f67

Fabrice Bellard authored 23 years ago

Originally committed as revision 79 to svn://svn.ffmpeg.org/ffmpeg/trunk

92651f67