Commits · 93d1756af2908150f7c8c0590b9ed246951d474a · Linshizhi / ffmpeg.wasm-core

09 May, 2018 14 commits

avcodec/cuviddec: explicitly synchronize cuMemcpy calls · 93d1756a
Timo Rothenpieler authored May 08, 2018

93d1756a
avutil/hwcontext_cuda: explicitly synchronize cuMemcpy calls · 9b82e333
Timo Rothenpieler authored May 08, 2018

9b82e333
avcodec/nvdec: pass CUstream in vpp parameters · 880236e8
Timo Rothenpieler authored May 07, 2018

880236e8
avutil/hwcontext_cuda: add CUstream in cuda hwctx · c8556834
Timo Rothenpieler authored May 07, 2018

c8556834

avcodec/nvdec: avoid needless copy of output frame · baabd3c2

Timo Rothenpieler authored May 07, 2018

Replaces the data pointers with the mapped cuvid ones.
Adds buffer_refs to the frame to ensure the needed contexts stay alive
and the cuvid idx stays allocated.
Adds another buffer_ref to unmap the frame when it's unreferenced itself.

baabd3c2

Revert "avcodec/nvenc: make hw_frames_ctx fully optional" · 2e700b08

Timo Rothenpieler authored May 07, 2018

This reverts commit 7d4e1f7c.

Accidentially pushed this with a batch of other patches, and it didn't
seem to break anything, so I went with it.
Except it does, so reverting it it is.

2e700b08

avformat/mpegts: clean up whitespace · 07d9c310
Aman Gupta authored May 09, 2018
```
Signed-off-by: Aman Gupta <aman@tmm1.net>
```
07d9c310
avformat/mpegts: use MAX_SECTION_SIZE instead of hardcoded value · 1a14e391
Aman Gupta authored May 09, 2018
```
Signed-off-by: Aman Gupta <aman@tmm1.net>
```
1a14e391

avformat/mpegts: skip non-PMT tids earlier · 2c500f50

Aman Gupta authored May 08, 2018

This mimics the logic flow in all the other callbacks
(pat_cb, sdt_cb, m4sl_cb), and avoids calling skip_identical()
for non PMT_TID packets.

Since skip_identical modifies internal state like
MpegTSSectionFilter.last_ver, this change prevents unnecessary
reprocessing on some streams which contain multiple tables in
the PMT pid. This can be observed with streams from certain US
cable providers, which include both tid=0x2 and another unspecified
tid=0xc0.
Signed-off-by: Aman Gupta <aman@tmm1.net>

2c500f50

ffprobe: fix SEGV when new streams are added · 12ceaf0f
Aman Gupta authored May 08, 2018
```
Signed-off-by: Aman Gupta <aman@tmm1.net>
```
12ceaf0f

avcodec/hevc: remove videotoolbox hack · a19bac8f

Aman Gupta authored May 04, 2018

No longer required since 63d87577. The equivalent hack
for h264 was removed in that commit, but this one was missed.
Signed-off-by: Aman Gupta <aman@tmm1.net>

a19bac8f

avcodec/videotoolbox: split h264/hevc callbacks · 07d175d0

Aman Gupta authored May 04, 2018

Previously the shared callbacks were trying to interpret
avctx->priv_data as H264Context*
Signed-off-by: Aman Gupta <aman@tmm1.net>

07d175d0

avcodec/videotoolbox: cleanups · dd77cca1
Aman Gupta authored May 04, 2018
```
No functional changes.
Signed-off-by: Aman Gupta <aman@tmm1.net>
```
dd77cca1

avcodec/cbs_h2645: use AVBufferRef to store list of active parameter sets · c6a63e11

James Almer authored May 08, 2018

Removes unnecessary data copies, and partially fixes potential issues
with dangling references held in said lists.
Reviewed-by: Mark Thompson <sw@jkqxz.net>
Signed-off-by: James Almer <jamrial@gmail.com>

c6a63e11

08 May, 2018 26 commits
- avformat/mxfenc: Write transfer characteristic · 293a6e83
  Michael Niedermayer authored Mar 21, 2018
```
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
```
  293a6e83
- avformat/mxfenc: Add Stored F2 Offset / Image Start/End Offset for D10 · c35ca7e0
  Michael Niedermayer authored Apr 17, 2018
```
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
```
  c35ca7e0
- avformat/mxfenc: Write Audio Ref Level for D10 · 530ac1e5
  Michael Niedermayer authored Apr 06, 2018
```
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
```
  530ac1e5
- avformat/mxfenc: Add Padding Bits · 1246754c
  Michael Niedermayer authored Apr 05, 2018
```
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
```
  1246754c
- avformat/mxfenc: add white/black ref /color range · 6d033909
  Michael Niedermayer authored Apr 05, 2018
```
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
```
  6d033909
- avformat/mxfenc: Add vertical subsampling support · 2bee43b6
  Michael Niedermayer authored Mar 21, 2018
```
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
```
  2bee43b6
- avformat/mxfenc: Fix stored width · 77cbe698
  Michael Niedermayer authored Mar 17, 2018
```
This fixes the width to have computations matching the height
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
```
  77cbe698
- avformat/mxfenc: Add object model version · 1b6c89ca
  Michael Niedermayer authored Mar 17, 2018
```
Other tools (XFConvert at least) write this as well.
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
```
  1b6c89ca
- avformat/mxfenc: Add Product Version, Toolkit version and Platform · 86c92509
  Michael Niedermayer authored Mar 17, 2018
```
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
```
  86c92509
- avformat/mxfenc: Bump minor versions for S377-1-2009 · 3ba1bbb4
  Michael Niedermayer authored May 08, 2018
```
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
```
  3ba1bbb4
- avformat/mxfenc: Correct KAG alignment of preface · 5c705134
  Michael Niedermayer authored Apr 30, 2018
```
Signed-off-by: Michael Niedermayer <michael@niedermayer.cc>
```
  5c705134
- lavfi/vf_srcnn: use avio_check instead of access · 8007a863
  Hendrik Leppkes authored May 08, 2018
```
The filter uses avio for file access already, and avio_check is
portable.

Fixes trac #7192.
```
  8007a863
- lavc/cfhd: use AV_CEIL_RSHIFT instead of deprecated FF_CEIL_RSHIFT · 6876a633
  Clément Bœsch authored May 08, 2018
  
  6876a633
- lavfi/swaprect: use AV_CEIL_RSHIFT instead of deprecated FF_CEIL_RSHIFT · 1eb4e731
  Clément Bœsch authored May 08, 2018
  
  1eb4e731
- lavfi/nlmeans: use AV_CEIL_RSHIFT instead of deprecated FF_CEIL_RSHIFT · 8d6354aa
  Clément Bœsch authored May 08, 2018
  
  8d6354aa
- fate/hapenc : remove tests due to inconsistent result · 6ebc7184
  Martin Vignali authored May 08, 2018
  
  6ebc7184
- lavfi/nlmeans: inline integral patch value function · e6114d21
  Clément Bœsch authored May 07, 2018
```
This prevents redundant position computation and make the code faster
(1.1x faster overall).
```
  e6114d21
- lavfi/nlmeans: use unsigned for the integral patch value · 4278f79e
  Clément Bœsch authored May 07, 2018
```
This value can not be negative.
```
  4278f79e
- lavfi/nlmeans: reorder memory accesses in get_integral_patch_value · de956198
  Clément Bœsch authored May 06, 2018
```
This doesn't seem to make much of a difference but it can't hurt.
```
  de956198
- lavfi/nlmeans: move final weighted averaging out of nlmeans_plane · 34e1e53e
  Clément Bœsch authored May 06, 2018
```
This helps figuring out where the filter is slow:

  70.53%  ffmpeg_g  ffmpeg_g          [.] nlmeans_slice
  25.73%  ffmpeg_g  ffmpeg_g          [.] compute_safe_ssd_integral_image_c
   1.74%  ffmpeg_g  ffmpeg_g          [.] compute_unsafe_ssd_integral_image
   0.82%  ffmpeg_g  ffmpeg_g          [.] ff_mjpeg_decode_sos
   0.51%  ffmpeg_g  [unknown]         [k] 0xffffffff91800a80
   0.24%  ffmpeg_g  ffmpeg_g          [.] weight_averages

(Tested with a large image that takes several seconds to process)

Since this function is irrelevant speed wise, the file's TODO is
updated.
```
  34e1e53e
- lavfi/nlmeans: switch from double to float · 667503ef
  Clément Bœsch authored May 06, 2018
```
Overall speed appears to be 1.1x faster with no noticeable quality
impact.
```
  667503ef
- lavfi/nlmeans: make compute_safe_ssd_integral_image_c faster · 43d16aef
  Clément Bœsch authored May 06, 2018
```
before:  ssd_integral_image_c: 49204.6
after:   ssd_integral_image_c: 44272.8

Unrolling by 4 made the biggest difference on odroid-c2 (aarch64);
unrolling by 2 or 8 both raised 46k cycles vs 44k for 4.

Additionally, this is a much better reference when writing SIMD (SIMD
vectorization will just target 16 instead of 4).
```
  43d16aef
- checkasm: add vf_nlmeans test for ssd_integral_image · f679711c
  Clément Bœsch authored May 06, 2018
  
  f679711c
- lavfi/nlmeans: add AArch64 SIMD for compute_safe_ssd_integral_image · 5a71bce3
  Clément Bœsch authored May 06, 2018
```
ssd_integral_image_c: 49204.6
ssd_integral_image_neon: 28346.8
```
  5a71bce3
- lavfi/nlmeans: use ptrdiff_t for linesizes · 5ba14f4f
  Clément Bœsch authored May 06, 2018
```
Similarly to previous commit, this will help writing SIMD code by not
having manual zero-extension in SIMD code
```
  5ba14f4f
- lavfi/nlmeans: add SIMD-friendly assumptions for compute_safe_ssd_integral_image · 26f02c51
  Clément Bœsch authored May 06, 2018
```
SIMD code will not have to deal with padding itself. Overwriting in that
function may have been possible but involve large overreading of the
sources. Instead, we simply make sure the width to process is always a
multiple of 16. Additionally, there must be some actual area to process
so the SIMD code can have its boundary checks after processing the first
pixels.
```
  26f02c51