1. 19 Feb, 2019 4 commits
    • Martin Storsjö's avatar
      2eeac799
    • Martin Storsjö's avatar
      aarch64: vp8: Fix assembling with clang · 26d7af4c
      Martin Storsjö authored
      This also partially fixes assembling with MS armasm64 (via
      gas-preprocessor).
      
      The movrel macro invocations need to pass the offset via a separate
      parameter. Mach-o and COFF relocations don't allow a negative
      offset to a symbol, which is handled properly if the offset is passed
      via the parameter. If no offset parameter is given, the macro
      evaluates to something like "adrp x17, subpel_filters-16+(0)", which
      older clang versions also fail to parse (the older clang versions
      only support one single offset term, although it can be a parenthesis.
      Signed-off-by: 's avatarMartin Storsjö <martin@martin.st>
      26d7af4c
    • Magnus Röös's avatar
      libavcodec: vp8 neon optimizations for aarch64 · 0801853e
      Magnus Röös authored
      Partial port of the ARM Neon for aarch64.
      
      Benchmarks from fate:
      
      benchmarking with Linux Perf Monitoring API
      nop: 58.6
      checkasm: using random seed 1760970128
      NEON:
       - vp8dsp.idct       [OK]
       - vp8dsp.mc         [OK]
       - vp8dsp.loopfilter [OK]
      checkasm: all 21 tests passed
      vp8_idct_add_c: 201.6
      vp8_idct_add_neon: 83.1
      vp8_idct_dc_add_c: 107.6
      vp8_idct_dc_add_neon: 33.8
      vp8_idct_dc_add4y_c: 426.4
      vp8_idct_dc_add4y_neon: 59.4
      vp8_loop_filter8uv_h_c: 688.1
      vp8_loop_filter8uv_h_neon: 216.3
      vp8_loop_filter8uv_inner_h_c: 649.3
      vp8_loop_filter8uv_inner_h_neon: 195.3
      vp8_loop_filter8uv_inner_v_c: 544.8
      vp8_loop_filter8uv_inner_v_neon: 131.3
      vp8_loop_filter8uv_v_c: 706.1
      vp8_loop_filter8uv_v_neon: 141.1
      vp8_loop_filter16y_h_c: 668.8
      vp8_loop_filter16y_h_neon: 242.8
      vp8_loop_filter16y_inner_h_c: 647.3
      vp8_loop_filter16y_inner_h_neon: 224.6
      vp8_loop_filter16y_inner_v_c: 647.8
      vp8_loop_filter16y_inner_v_neon: 128.8
      vp8_loop_filter16y_v_c: 721.8
      vp8_loop_filter16y_v_neon: 154.3
      vp8_loop_filter_simple_h_c: 387.8
      vp8_loop_filter_simple_h_neon: 187.6
      vp8_loop_filter_simple_v_c: 384.1
      vp8_loop_filter_simple_v_neon: 78.6
      vp8_put_epel8_h4v4_c: 3971.1
      vp8_put_epel8_h4v4_neon: 855.1
      vp8_put_epel8_h4v6_c: 5060.1
      vp8_put_epel8_h4v6_neon: 989.6
      vp8_put_epel8_h6v4_c: 4320.8
      vp8_put_epel8_h6v4_neon: 1007.3
      vp8_put_epel8_h6v6_c: 5449.3
      vp8_put_epel8_h6v6_neon: 1158.1
      vp8_put_epel16_h6_c: 6683.8
      vp8_put_epel16_h6_neon: 831.8
      vp8_put_epel16_h6v6_c: 11110.8
      vp8_put_epel16_h6v6_neon: 2214.8
      vp8_put_epel16_v6_c: 7024.8
      vp8_put_epel16_v6_neon: 799.6
      vp8_put_pixels8_c: 112.8
      vp8_put_pixels8_neon: 78.1
      vp8_put_pixels16_c: 131.3
      vp8_put_pixels16_neon: 129.8
      
      This contains a fix to include guards by Carl Eugen Hoyos.
      Signed-off-by: 's avatarMartin Storsjö <martin@martin.st>
      0801853e
    • Luca Barbato's avatar
      Unbreak travis on macos · 899ee030
      Luca Barbato authored
      899ee030
  2. 16 Feb, 2019 11 commits
  3. 12 Feb, 2019 1 commit
    • Sven Dueking's avatar
      srt: Set srto_sender flag to sender srt socket · 90b15f60
      Sven Dueking authored
      SRT API Documentation:
      This flag is superfluous if both parties are at least version 1.3.0
      (this shall be enforced by setting this value to SRTO_MINVERSION if
      you expect that it be true) and therefore support HSv5 handshake,
      where the SRT extended handshake is done with the overall handshake
      process.
      
      This flag is however obligatory if at least one party may be using
      SRT below version 1.3.0 and does not support HSv5.
      90b15f60
  4. 27 Jan, 2019 1 commit
  5. 26 Jan, 2019 5 commits
    • Martin Storsjö's avatar
      libopenh264dec: Use a newer decoding entry point function · eec93e57
      Martin Storsjö authored
      The "new" entry point actually has existed since OpenH264 1.4 in
      2015 and is the the recommended decoding entry point.
      
      The name of this function, DecodeFrameNoDelay, is rather backwards
      considering that it doesn't return the latest decoded frame immediately,
      but actually does proper delaying and reordering of frames.
      Signed-off-by: 's avatarMartin Storsjö <martin@martin.st>
      eec93e57
    • Janne Grunau's avatar
      h264/aarch64: add intra loop filter neon asm · 28a8b541
      Janne Grunau authored
      Add my neon asm from x264 relicensed under the LGPL 2.1 or later. Ported
      (x264 uses nv12 chroma) and optimized.
      
      Cycle count for checkasm --bench on a Snapdragon 820e:
      h264_h_loop_filter_luma_intra_8bpp_c: 60.0
      h264_h_loop_filter_luma_intra_8bpp_neon: 54.2
      h264_v_loop_filter_luma_intra_8bpp_c: 148.3
      h264_v_loop_filter_luma_intra_8bpp_neon: 73.8
      h264_h_loop_filter_chroma_intra_8bpp_c: 27.8
      h264_h_loop_filter_chroma_intra_8bpp_neon: 21.4
      h264_h_loop_filter_chroma_mbaff_intra_8bpp_c: 15.8
      h264_h_loop_filter_chroma_mbaff_intra_8bpp_neon: 15.7
      h264_v_loop_filter_chroma_intra_8bpp_c: 45.8
      h264_v_loop_filter_chroma_intra_8bpp_neon: 17.3
      28a8b541
    • Janne Grunau's avatar
      h264/aarch64: optimize neon loop filter · 846c3d6a
      Janne Grunau authored
      Exit as soon as possible if no filtering will be done.
      
      Improves the checkasm --bench cycle count on a Snapdragon 820e:
      h264_h_loop_filter_luma_8bpp_c:      72.4 ->  72.5
      h264_h_loop_filter_luma_8bpp_neon:   97.1 ->  56.3
      h264_v_loop_filter_luma_8bpp_c:     174.0 -> 173.5
      h264_v_loop_filter_luma_8bpp_neon:   62.9 ->  60.9
      h264_h_loop_filter_chroma_8bpp_c:    30.2 ->  30.3
      h264_h_loop_filter_chroma_8bpp_neon: 51.6 ->  25.7
      h264_v_loop_filter_chroma_8bpp_c:    57.3 ->  57.3
      h264_v_loop_filter_chroma_8bpp_neon: 28.0 ->  24.0
      846c3d6a
    • Janne Grunau's avatar
      checkasm/h264: add loop filter tests · d7f4f5c4
      Janne Grunau authored
      d7f4f5c4
    • Janne Grunau's avatar
      bb515e3a
  6. 25 Jan, 2019 1 commit
    • Martin Storsjö's avatar
      arm: Create proper .rdata sections for COFF · 41cf3e3b
      Martin Storsjö authored
      As .rodata isn't one of the default created sections for COFF, it was
      created as a read-write data section. By using the default .rdata
      section name for COFF, it automatically becomes a read-only data section.
      The existing ".section .rodata" works as intended for ELF though.
      
      This is based on an original patch and diagnose by Tom Tan
      <Tom.Tan@microsoft.com>.
      Signed-off-by: 's avatarMartin Storsjö <martin@martin.st>
      41cf3e3b
  7. 23 Jan, 2019 1 commit
  8. 17 Jan, 2019 1 commit
  9. 12 Dec, 2018 1 commit
    • James Almer's avatar
      libdav1d: update API usage to the first stable release · 70ab2778
      James Almer authored
      The color fields were moved to another struct, and a way to propagate
      timestamps and other input metadata was introduced, so the packet
      fifo can be removed.
      
      Add support for 12bit streams, an option to disable film grain, and
      read the profile from the sequence header referenced by the ouput
      picture instead of guessing based on output pix_fmt.
      Signed-off-by: 's avatarJames Almer <jamrial@gmail.com>
      70ab2778
  10. 15 Nov, 2018 1 commit
  11. 13 Nov, 2018 1 commit
    • Linjie Fu's avatar
      qsvenc: Add VDENC support for H264 and HEVC · e716323f
      Linjie Fu authored
      Add VDENC(lowpower mode) support for QSV h264 and HEVC
      
      It's an experimental function(like lowpower in vaapi) with
      some limitations:
      - CBR/VBR require HuC which should be explicitly loaded via i915
      module parameter(i915.enable_guc=2 for linux kerner version >= 4.16)
      - HEVC VDENC was supported >= ICE LAKE
      
      use option "-low_power 1" to enable VDENC.
      Signed-off-by: 's avatarLinjie Fu <linjie.fu@intel.com>
      e716323f
  12. 06 Nov, 2018 3 commits
  13. 05 Nov, 2018 2 commits
  14. 27 Oct, 2018 2 commits
  15. 13 Oct, 2018 2 commits
  16. 12 Oct, 2018 1 commit
    • Martin Storsjö's avatar
      arm: Emit .thumb_func directives · 5584abf6
      Martin Storsjö authored
      Prior to Xcode 9.3, the clang built-in assembler didn't support
      altmacro, and gas-preprocessor was used for assembling for arm/darwin.
      
      For thumb functions, gas-preprocessor took care of adding the .thumb_func
      directives, but when now being able to assemble without gas-preprocessor,
      we need to add these directives ourselves.
      Signed-off-by: 's avatarMartin Storsjö <martin@martin.st>
      5584abf6
  17. 11 Oct, 2018 2 commits