Commits · 7be8f7ac8143f4ae144c4951ddc5d42d466a9e23 · Linshizhi / ffmpeg.wasm-core

11 Apr, 2019 3 commits

avcodec/agm: add support for non-dct coding · 7be8f7ac
Paul B Mahol authored Apr 09, 2019

7be8f7ac
avcodec/agm: add support for higher compression · 0f283559
Paul B Mahol authored Mar 29, 2019

0f283559

swscale/ppc: VSX-optimize non-full-chroma yuv2rgb_2 · ce92ee4b

Lauri Kasanen authored Apr 05, 2019

./ffmpeg -f lavfi -i yuvtestsrc=duration=1:size=1200x1440 -sws_flags fast_bilinear \
        -s 1200x720 -f null -vframes 100 -pix_fmt $i -nostats \
        -cpuflags 0 -v error -

32-bit mul, power8 only.

~2x speedup:

rgb24
  24431 UNITS in yuv2packed2,   16384 runs,      0 skips
  13783 UNITS in yuv2packed2,   16383 runs,      1 skips
bgr24
  24396 UNITS in yuv2packed2,   16384 runs,      0 skips
  14059 UNITS in yuv2packed2,   16384 runs,      0 skips
rgba
  26815 UNITS in yuv2packed2,   16383 runs,      1 skips
  12797 UNITS in yuv2packed2,   16383 runs,      1 skips
bgra
  27060 UNITS in yuv2packed2,   16384 runs,      0 skips
  13138 UNITS in yuv2packed2,   16384 runs,      0 skips
argb
  26998 UNITS in yuv2packed2,   16384 runs,      0 skips
  12728 UNITS in yuv2packed2,   16381 runs,      3 skips
bgra
  26651 UNITS in yuv2packed2,   16384 runs,      0 skips
  13124 UNITS in yuv2packed2,   16384 runs,      0 skips

This is a low speedup, but the x86 mmx version also gets only ~2x. The mmx version
is also heavily inaccurate, while the vsx version has high accuracy.

ce92ee4b

10 Apr, 2019 5 commits
- avcodec/pnm_parser: Factor out next/index compensation · 3fe37033
  Michael Niedermayer authored Apr 06, 2019
```
Reviewed-by: Paul B Mahol <onemda@gmail.com>
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
```
  3fe37033
- avcodec/pnm_parser: Factor next initialization out · 1d43d72b
  Michael Niedermayer authored Apr 06, 2019
```
Reviewed-by: Paul B Mahol <onemda@gmail.com>
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
```
  1d43d72b
- avcodec/pnm_parser: Support concatenated ASCII images · 7f3d39b2
  Michael Niedermayer authored Apr 06, 2019
```
Fixes: Timeout (8sec -> 0.1sec)
Fixes: 13864/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_PAM_fuzzer-5737860621139968

Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpegSigned-off-by: Michael Niedermayer <michael@niedermayer.cc>
```
  7f3d39b2
- avdevice/opengl_enc: fix build error using msvc compiler · 2be7c388
  Don C. Bigler authored Apr 09, 2019
  
  2be7c388
- libavformat/dashenc : Prevent writing manifest files multiple times · 951561b6
  joepadmiraal authored Apr 08, 2019
  
  951561b6
09 Apr, 2019 5 commits

aarch64/opusdsp: implement NEON accelerated postfilter and deemphasis · 4d2f6215

Lynne authored Mar 15, 2019

153372 UNITS in postfilter_c,   65536 runs,      0 skips
73164 UNITS in postfilter_neon,   65536 runs,      0 skips -> 2.1x speedup

80591 UNITS in deemphasis_c,  131072 runs,      0 skips
43969 UNITS in deemphasis_neon,  131072 runs,      0 skips -> 1.83x speedup

Total decoder speedup: ~15% on a Raspberry Pi 3 (from 28.1x to 33.5x realtime)

Deemphasis SIMD based on the following unrolling:
const float c1 = CELT_EMPH_COEFF, c2 = c1*c1, c3 = c2*c1, c4 = c3*c1;
float state = coeff;

for (int i = 0; i < len; i += 4) {
    y[0] = x[0] + c1*state;
    y[1] = x[1] + c2*state + c1*x[0];
    y[2] = x[2] + c3*state + c1*x[1] + c2*x[0];
    y[3] = x[3] + c4*state + c1*x[2] + c2*x[1] + c3*x[0];

    state = y[3];
    y += 4;
    x += 4;
}

Unlike the x86 version, duplication is used instead of pslldq so
the structure and tables are different.

4d2f6215

libavutil/hwcontext_opencl: Fix channel order in format support check · 1c50d61a

Jarek Samic authored Apr 08, 2019

The `opencl_get_plane_format` function was incorrectly determining the
value used to set the image channel order. This resulted in all RGB
pixel formats being set to the `CL_RGBA` pixel format, regardless of
whether or not they actually *were* RGBA.

This patch fixes the issue by using the `offset` and depth of components
rather than the loop index to determine the value of `order`.
Signed-off-by: Jarek Samic <cldfire3@gmail.com>
Signed-off-by: Mark Thompson <sw@jkqxz.net>

1c50d61a

avformat/matroskaenc: fix leak on error · 1ec777dc
Tristan Matthews authored Apr 04, 2019
```
Signed-off-by: James Almer <jamrial@gmail.com>
```
1ec777dc
lavf/movenc: Pass correct pointer to av_log(). · d6a83922
Carl Eugen Hoyos authored Apr 07, 2019

d6a83922

lavf/matroskaenc: Fix memory leak after write trailer · 0a347ff4

Jun Zhao authored Apr 04, 2019

Fix memory leak after write trailer for #7827, only store a audio
packet whose buffer has size greater than zero in cur_audio_pkt.

Audio packets with size zero, but with side-data currently lead to
memleaks, in the Matroska muxer, because they are not properly freed:

They are currently put into an AVPacket in the MatroskaMuxContext to
ensure that the necessary audio is always available for a new cluster,
but are only written and freed when their size is > 0.

As the only use we have for such packets consists in updating the
CodecPrivate it makes no sense to store these packets at all and this
is how this commit solves the memleak.
Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@googlemail.com>
Signed-off-by: Jun Zhao <barryjzhao@tencent.com>

0a347ff4

08 Apr, 2019 1 commit

avformat/av1: Initialize padding in ff_isom_write_av1c · bb5efd17

Jeremy Dorfman via ffmpeg-devel authored Apr 08, 2019

Otherwise, AV1 encodes with FFmpeg trigger use-of-uninitialized-value
warnings under MemorySanitizer, and the output buffer potentially
changes from run to run.
Signed-off-by: James Almer <jamrial@gmail.com>

bb5efd17

07 Apr, 2019 8 commits

avfilter/af_asetnsamples: use correct function · ecdaa4b4
Paul B Mahol authored Apr 07, 2019

ecdaa4b4
avformat/riffdec: pass correct pointer to av_log · 3a2adeed
Paul B Mahol authored Apr 07, 2019

3a2adeed

avfilter/af_asetnsamples: fix sample queuing. · 4c8e3725

Nikolas Bowe via ffmpeg-devel authored Apr 06, 2019

When asetnsamples uses output samples < input samples, remaining samples build up in the fifo over time.
Fix this by marking the filter as ready again if there are enough samples.

Regression since ef3babb2Reviewed-by: Paul B Mahol <onemda@gmail.com>
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>

4c8e3725

swscale/ppc: VSX-optimize yuv2rgb_full_X · 8607e29f

Lauri Kasanen authored Apr 01, 2019

./ffmpeg -f lavfi -i yuvtestsrc=duration=1:size=1200x1440 \
                -s 1200x720 -f null -vframes 100 -pix_fmt $i -nostats \
                -cpuflags 0 -v error -

32-bit mul, power8 only.

~6.4x speedup:

rgb24
 214278 UNITS in yuv2packedX,   16384 runs,      0 skips
  33249 UNITS in yuv2packedX,   16384 runs,      0 skips
bgr24
 214616 UNITS in yuv2packedX,   16384 runs,      0 skips
  33233 UNITS in yuv2packedX,   16384 runs,      0 skips
rgba
 214517 UNITS in yuv2packedX,   16384 runs,      0 skips
  33271 UNITS in yuv2packedX,   16384 runs,      0 skips
bgra
 214973 UNITS in yuv2packedX,   16384 runs,      0 skips
  33397 UNITS in yuv2packedX,   16384 runs,      0 skips
argb
 214613 UNITS in yuv2packedX,   16384 runs,      0 skips
  33310 UNITS in yuv2packedX,   16384 runs,      0 skips
bgra
 214637 UNITS in yuv2packedX,   16384 runs,      0 skips
  33330 UNITS in yuv2packedX,   16384 runs,      0 skips

8607e29f

swscale/ppc: VSX-optimize yuv2rgb_full_2 · 3256e949

Lauri Kasanen authored Apr 01, 2019

./ffmpeg -f lavfi -i yuvtestsrc=duration=1:size=1200x1440 -sws_flags area \
            -s 1200x720 -f null -vframes 100 -pix_fmt $i -nostats \
            -cpuflags 0 -v error -

32-bit mul, power8 only.

~4x speedup:

rgb24
  52763 UNITS in yuv2packed2,   16384 runs,      0 skips
  13453 UNITS in yuv2packed2,   16384 runs,      0 skips
bgr24
  53144 UNITS in yuv2packed2,   16384 runs,      0 skips
  13616 UNITS in yuv2packed2,   16384 runs,      0 skips
rgba
  52796 UNITS in yuv2packed2,   16384 runs,      0 skips
  12904 UNITS in yuv2packed2,   16384 runs,      0 skips
bgra
  52732 UNITS in yuv2packed2,   16384 runs,      0 skips
  13262 UNITS in yuv2packed2,   16384 runs,      0 skips
argb
  52661 UNITS in yuv2packed2,   16384 runs,      0 skips
  12879 UNITS in yuv2packed2,   16384 runs,      0 skips
bgra
  52662 UNITS in yuv2packed2,   16384 runs,      0 skips
  12932 UNITS in yuv2packed2,   16384 runs,      0 skips

3256e949

swscale/ppc: VSX-optimize non-full-chroma yuv2rgb_1 · 50e672bc

Lauri Kasanen authored Mar 31, 2019

./ffmpeg -f lavfi -i yuvtestsrc=duration=1:size=1200x1440 -sws_flags fast_bilinear \
        -s 1200x1440 -f null -vframes 100 -pix_fmt $i -nostats \
        -cpuflags 0 -v error -

32-bit mul, power8 only.

1.8-2.3x speedup:

rgb24
  18192 UNITS in yuv2packed1,   32767 runs,      1 skips
   9983 UNITS in yuv2packed1,   32760 runs,      8 skips
bgr24
  18665 UNITS in yuv2packed1,   32766 runs,      2 skips
   9925 UNITS in yuv2packed1,   32763 runs,      5 skips
rgba
  20239 UNITS in yuv2packed1,   32767 runs,      1 skips
   8794 UNITS in yuv2packed1,   32759 runs,      9 skips
bgra
  20354 UNITS in yuv2packed1,   32768 runs,      0 skips
   8770 UNITS in yuv2packed1,   32761 runs,      7 skips
argb
  20185 UNITS in yuv2packed1,   32768 runs,      0 skips
   8761 UNITS in yuv2packed1,   32761 runs,      7 skips
bgra
  20360 UNITS in yuv2packed1,   32766 runs,      2 skips
   8759 UNITS in yuv2packed1,   32764 runs,      4 skips

This is a low speedup, but the x86 mmx version also gets only ~2x. The mmx version
is also heavily inaccurate, while the vsx version has high accuracy.

50e672bc

doc/examples/metadata: fix the example can't dump FLV metadata · 7c187514
Jun Zhao authored Apr 03, 2019
```
fix the example can't dump FLV metadata.
Signed-off-by: Jun Zhao <barryjzhao@tencent.com>
```
7c187514
lavf/Makefile: Fix kux demuxer standalone compilation. · d234ed76
Carl Eugen Hoyos authored Apr 07, 2019

d234ed76

06 Apr, 2019 2 commits

lavf/flvdec: added support for KUX container · 208ae228

Swaraj Hota authored Apr 06, 2019

Fixes ticket #4519.

The metadata starting at 0xe00004 is encrypted
with the password "meta" but zlib does not
support decryption, so no kux metadata is read.

208ae228

lavd/x11grab: fix vertical repositioning · f4f40cbb

Octavio Alvarez authored Mar 28, 2019

There is a calculation error in xcbgrab_reposition() that breaks
vertical repositioning on follow_mouse. It made the bottom
reposition occur when moving the mouse lower than N pixels after
the capture bottom edge, instead of before.

This commit fixes the calculation to match the documentation.

follow_mouse: centered or number of pixels. The documentation says:

When it is specified with "centered", the grabbing region follows
the mouse pointer and keeps the pointer at the center of region;
otherwise, the region follows only when the mouse pointer reaches
within PIXELS (greater than zero) to the edge of region.

f4f40cbb

05 Apr, 2019 3 commits

FATE: Add test for HEVC files that claim to have two first slices · 772c73e6
Derek Buitenhuis authored Mar 18, 2019
```
This makes sure we don't regress on 70c8c8a8.
Signed-off-by: Derek Buitenhuis <derek.buitenhuis@gmail.com>
```
772c73e6

avcodec/agm: Fix integer overflow with w/h · 2169a3f2

Michael Niedermayer authored Apr 04, 2019

Fixes: negation of -2147483648 cannot be represented in type 'int'; cast to an unsigned type to negate this value to itself
Fixes: 13999/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_AGM_fuzzer-5644405991538688

Found-by: continuous fuzzing process https://github.com/google/oss-fuzz/tree/master/projects/ffmpegReviewed-by: Paul B Mahol <onemda@gmail.com>
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>

2169a3f2

avformat/matroskadec: Improve length check · 18a851ac

Andreas Rheinhardt via ffmpeg-devel authored Mar 27, 2019

The earlier code had three flaws:

1. The case of an unknown-sized element inside a finite-sized element
(which is against the specifications) was not caught.

2. The error message wasn't helpful: It compared the length of the child
with the offset of the end of the parent and claimed that the first
exceeds the latter, although that is not necessarily true.

3. Unknown-sized elements that are not parsed can't be skipped. Given
that according to the Matroska specifications only the segment and the
clusters can be of unknown-size, this is handled by not allowing any
other units to have infinite size whereas the earlier code would seek
back by 1 byte upon encountering an infinite-size element that ought
to be skipped.
Signed-off-by: Andreas Rheinhardt <andreas.rheinhardt@googlemail.com>
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>

18a851ac

04 Apr, 2019 1 commit

avcodec/agm: More completely check size before using it · 8e3b01e2

Michael Niedermayer authored Apr 03, 2019

Fixes: out of array access
Fixes: 13997/clusterfuzz-testcase-minimized-ffmpeg_AV_CODEC_ID_AGM_fuzzer-5701427252428800

8e3b01e2

03 Apr, 2019 7 commits

avcodec/av1_metadata: add an option to remove Padding OBUs · ee16d14b
James Almer authored Mar 25, 2019
```
Reviewed-by: Mark Thompson <sw@jkqxz.net>
Signed-off-by: James Almer <jamrial@gmail.com>
```
ee16d14b

lavc/qsvenc: enable hevc gpb option · 1125277b

Zhong Li authored Jan 11, 2019

GPB is the default type, just contains forward references but the
slice_type is B slice with higher encoding efficiency than regular P
slice, but lower performance.

Add an option to allow user to set regular P slice.

Fix ticket#6870

Test data on Intel Kabylake (i7-7567U CPU @ 3.50GHz):
1. ffmpeg -hwaccel qsv -c:v h264_qsv -i bbb_sunflower_1080p_30fps_normal.mp4 -vsync passthrough
-vframes 1000  -c:v hevc_qsv -gpb 0 -bf 0 -q 25 test_gpb_off_bf0_kbl.mp4

transcoding fps: 85
encoded file size of test_gpb_off_bf0_kbl.mp4: 21960100 (bytes)

2. ffmpeg -hwaccel qsv -c:v h264_qsv -i bbb_sunflower_1080p_30fps_normal.mp4 -vsync passthrough
-vframes 1000  -c:v hevc_qsv -gpb 1 -bf 0 -q 25 test_gpb_on_bf0_kbl.mp4

transcoding fps: 79
encoded file size oftest_gpb_on_bf0_kbl.mp4:  21211449 (bytes)

In this case, enable gpb can bring about 7% performance drop but 3.4% encoding efficiency improvment.
Signed-off-by: Zhong Li <zhong.li@intel.com>

1125277b

lavc/qsvenc: enable hevc coding options configuration · c745bedd
Zhong Li authored Mar 27, 2019
```
Signed-off-by: Zhong Li <zhong.li@intel.com>
```
c745bedd
lavc/qsvenc: no need to include h264.h for jpeg encoder · 6f9d7c55
Zhong Li authored Apr 01, 2019
```
Signed-off-by: Zhong Li <zhong.li@intel.com>
```
6f9d7c55

lavf/movenc: fix tmcd writing for non-MP4/MOV modes · 8161ac29

Gyan Doshi authored Mar 30, 2019

write_tmcd allows tmcd track to be created with any mode but in
mov_write_header, index for first tmcd track is only set for modes
MP4 or MOV, causing a crash if tmcd creation is attempted with other
modes.

8161ac29

fate: unbreak fate with custom binary names · b131a07e
Gyan Doshi authored Apr 02, 2019

b131a07e

lavf/hashenc: Correct the hash/MD5 muxer class name · ecb4398d

Jun Zhao authored Mar 29, 2019

Follow the name style to correct the hash/md5 muxer class name
Signed-off-by: Jun Zhao <barryjzhao@tencent.com>

ecb4398d

02 Apr, 2019 5 commits

avcodec/libaomenc: fix range of values for enable-intrabc option · 0e1ea034
James Almer authored Apr 02, 2019
```
Signed-off-by: James Almer <jamrial@gmail.com>
```
0e1ea034

avcodec/cbs_av1: fix parsing spatial_id · 461303f9

James Almer authored Mar 25, 2019

Reviewed-by: Mark Thompson <sw@jkqxz.net>
Signed-off-by: James Almer <jamrial@gmail.com>

461303f9

libavcodec/zmbvenc: add support for 24-bit encoding, using pix_fmt BGR24. · b97a7dd0
Matthew Fearnley authored Mar 26, 2019
```
Support is #ifdef'd out at this stage, using ZMBV_ENABLE_24BPP (like in
the zmbv.c decoder)
```
b97a7dd0

libavcodec/zmbv: change 24-bit decoder channel order, from RGB24 to BGR24 · 1046e880

Matthew Fearnley authored Mar 29, 2019

This brings the channel order in line with that used in 32-bit mode (BGR0).

24-bit decoding is disabled by default (#ifdef ZMBV_ENABLE_24BPP), and no
prior encoders or sample videos are known to exist for this bit depth, so
I consider this change in implementation is unlikely to affect anyone.

The decision has been made in agreement with the DOSBox Development Team
(dosbox.crew@gmail.com), specifically with harekiet, who wrote the original
codec.

1046e880

libavcodec/zmbv: use PTRDIFF_SPECIFIER for `src - c->decomp_buf`. · 5dcc63c1
Matthew Fearnley authored Mar 26, 2019
```
Other bit depths saw this change in ced0d6c1, but this instance was
presumably missed because of the #ifdef block.
```
5dcc63c1