1. 16 Apr, 2019 7 commits
  2. 15 Apr, 2019 10 commits
  3. 14 Apr, 2019 4 commits
  4. 13 Apr, 2019 3 commits
  5. 11 Apr, 2019 9 commits
  6. 10 Apr, 2019 5 commits
  7. 09 Apr, 2019 2 commits
    • Lynne's avatar
      aarch64/opusdsp: implement NEON accelerated postfilter and deemphasis · 4d2f6215
      Lynne authored
      153372 UNITS in postfilter_c,   65536 runs,      0 skips
      73164 UNITS in postfilter_neon,   65536 runs,      0 skips -> 2.1x speedup
      
      80591 UNITS in deemphasis_c,  131072 runs,      0 skips
      43969 UNITS in deemphasis_neon,  131072 runs,      0 skips -> 1.83x speedup
      
      Total decoder speedup: ~15% on a Raspberry Pi 3 (from 28.1x to 33.5x realtime)
      
      Deemphasis SIMD based on the following unrolling:
      const float c1 = CELT_EMPH_COEFF, c2 = c1*c1, c3 = c2*c1, c4 = c3*c1;
      float state = coeff;
      
      for (int i = 0; i < len; i += 4) {
          y[0] = x[0] + c1*state;
          y[1] = x[1] + c2*state + c1*x[0];
          y[2] = x[2] + c3*state + c1*x[1] + c2*x[0];
          y[3] = x[3] + c4*state + c1*x[2] + c2*x[1] + c3*x[0];
      
          state = y[3];
          y += 4;
          x += 4;
      }
      
      Unlike the x86 version, duplication is used instead of pslldq so
      the structure and tables are different.
      4d2f6215
    • Jarek Samic's avatar
      libavutil/hwcontext_opencl: Fix channel order in format support check · 1c50d61a
      Jarek Samic authored
      The `opencl_get_plane_format` function was incorrectly determining the
      value used to set the image channel order. This resulted in all RGB
      pixel formats being set to the `CL_RGBA` pixel format, regardless of
      whether or not they actually *were* RGBA.
      
      This patch fixes the issue by using the `offset` and depth of components
      rather than the loop index to determine the value of `order`.
      Signed-off-by: 's avatarJarek Samic <cldfire3@gmail.com>
      Signed-off-by: 's avatarMark Thompson <sw@jkqxz.net>
      1c50d61a