Commits · bd4d192081230b217ecab14d39daf06f40067191 · Linshizhi / ffmpeg.wasm-core

05 Apr, 2015 1 commit

swr/resample: use av_clip functions · 43482bd1

James Almer authored 9 years ago

Signed-off-by: James Almer <jamrial@gmail.com>
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>

43482bd1

16 Feb, 2015 1 commit
- swresample/resample_template: Add () to protect the arguments of the OUT() macro · 0cb95f90
  Michael Niedermayer authored 9 years ago
```
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
```
  0cb95f90
03 Jul, 2014 1 commit
- swr: initialize only the necessary resample dsp functions · 857cd1f3
  James Almer authored 10 years ago
```
Signed-off-by: James Almer <jamrial@gmail.com>
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
```
  857cd1f3
02 Jul, 2014 1 commit

Partially revert "swr: add prototypes for resample dsp functions" · 23a9edf5

James Almer authored 10 years ago

Prototypes are not needed anymore now that the x86 functions don't
include resample_template.c

The DO_RESAMPLE_ONE macro is removed for that same reason as well.
Signed-off-by: James Almer <jamrial@gmail.com>
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>

23a9edf5

01 Jul, 2014 1 commit

x86/swr: convert resample_{common, linear}_double_sse2 to yasm · dd2c9034

James Almer authored 10 years ago

Signed-off-by: James Almer <jamrial@gmail.com>

312531 -> 311528 dezicycles
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>

dd2c9034

30 Jun, 2014 2 commits
- swr: convert resample_common/linear_int16_mmx2/sse2 to yasm. · 847bb638
  Ronald S. Bultje authored 10 years ago
```
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
```
  847bb638
- swresample/resample_template: move division out of loop for float/double swri_resample_linear() · 418e5768
  Michael Niedermayer authored 10 years ago
```
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
```
  418e5768
29 Jun, 2014 1 commit

swresample/resample_template: flip order of operations in swri_resample_linear() for 32bit · c5a405c4

Michael Niedermayer authored 10 years ago

Fixes integer overflow

Found-by: BBB
Reviewed-by: "Ronald S. Bultje" <rsbultje@gmail.com>
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>

c5a405c4

28 Jun, 2014 1 commit

swr: rewrite resample_common/linear_float_sse/avx in yasm. · faa1471f

Ronald S. Bultje authored 10 years ago

Linear interpolation goes from 63 (llvm) or 58 (gcc) to 48 (yasm)
cycles/sample on 64bit, or from 66 (llvm/gcc) to 52 (yasm) cycles/
sample on 32bit. Bon-linear goes from 43 (llvm) or 38 (gcc) to
32 (yasm) cycles/sample on 64bit, or from 46 (llvm) or 44 (gcc) to
38 (yasm) cycles/sample on 32bit (all testing on OSX 10.9.2, llvm
5.1 and gcc 4.8/9).
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>

faa1471f

22 Jun, 2014 1 commit
- swr: remove another forgotten division in DSP function. · 0dae193d
  Ronald S. Bultje authored 10 years ago
```
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
```
  0dae193d
18 Jun, 2014 1 commit

swr: remove div/mod from DSP functions. · cbf21628

Ronald S. Bultje authored 10 years ago

Also fix a bug with resample_compensation resetting dst_incr.
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>

cbf21628

15 Jun, 2014 1 commit
- swr: reindent. · edf93047
  Ronald S. Bultje authored 10 years ago
```
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
```
  edf93047
14 Jun, 2014 4 commits

swr: add prototypes for resample dsp functions · 7f4dfbd0

James Almer authored 10 years ago

Should fix compilation failures with MSVC and any other compiler
without inline asm support.
Signed-off-by: James Almer <jamrial@gmail.com>
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>

7f4dfbd0

swr: split out DSP functions. · 7128a35f

Ronald S. Bultje authored 10 years ago

DSP bits of swri_resample go into their own mini-DSP functions; DSP
init goes from a per-call branch in multiple_resample to a proper
DSP init routine; x86 bits go into x86/; swri_resample() moves out of
resample_template.c into resample.c because it's independent of DSP
code or sample type; multiple_resample() is simplified.
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>

7128a35f

swr: handle initial negative sample index outside DSP function. · b785c626
Ronald S. Bultje authored 10 years ago
```
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
```
b785c626

swr: remove unnecessary assignment. · 6b9685de

Ronald S. Bultje authored 10 years ago

I don't see dst_incr/dst_incr_frac ever being changed from their
initial value (which is the inverse of this operation), so it seems
to me that this is a no-op.
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>

6b9685de

09 Jun, 2014 1 commit
- swr: handle 64bit overflow check in multiple_resample(). · f3413405
  Ronald S. Bultje authored 10 years ago
```
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
```
  f3413405
02 Jun, 2014 1 commit

swr: move compensation_distance handling to swri_resample caller. · cdfd9717

Ronald S. Bultje authored 10 years ago

I think there's an off-by-one in terms of the switchpoint where we
switch from dst_incr to ideal_dst_incr, I don't think that's a massive
issue, but just be aware of that. It's probably trivial to prevent but
I don't care.
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>

I could not reproduce any off by 1 error, results are bit exact (michael)

cdfd9717

01 Jun, 2014 2 commits

swr/resample_template: prevent end_index from overflowing and add check for delta_frac overflow · 2c23f87c
Michael Niedermayer authored 10 years ago
```
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
```
2c23f87c

Rewrite main resampling loop (common and linear). · 9b538537

Ronald S. Bultje authored 10 years ago

This removes a branch at a performance-sensitive point (in the middle
of the loop). In fate-swr-resample-s32p-8000-2626, this makes the code
about 10% faster. It also simplifies the loops, allowing us to rewrite
it in yasm at some later point.

The compensation_distance != 0 code and index < 0 code are still kind
of hairy. For compensation_distance != 0, this should likely be handled
in the caller, so that it calls swri_resample twice (once until the
dst_incr switch-point, and once with the remainder of the samples). For
index < 0, the code should probably be rewritten to break out of the
loop once sample_index >= 0, and then resume (e.g. as a tail-call) to
the common or linear resampling loops.
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>

9b538537

16 May, 2014 1 commit

swresample: add swri_resample_float_avx · a9bf713d

James Almer authored 10 years ago

Signed-off-by: James Almer <jamrial@gmail.com>
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>

a9bf713d

25 Apr, 2014 1 commit

swresample: add swri_resample_double_sse2 · cdac3ab5

James Almer authored 10 years ago

Signed-off-by: James Almer <jamrial@gmail.com>
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>

cdac3ab5

15 Apr, 2014 1 commit

swresample/resample_template: try to consider src_size more exactly · 2b58c9c9

Michael Niedermayer authored 10 years ago

This should avoid slight differences in the output causes by input
size alignment differences between archs
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>

2b58c9c9

14 Apr, 2014 2 commits
- swresample/resample: simplify index/consumed calculation for the filter = 1 case · 5e379cd3
  Michael Niedermayer authored 10 years ago
```
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
```
  5e379cd3
- swresample/resample: Fix fractional part of index in the filter_size = 1 filters = 1 case · 6c8ee74a
  Michael Niedermayer authored 10 years ago
```
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
```
  6c8ee74a
24 Mar, 2014 2 commits

swresample/resample: sse float linear interpolation · 63dbba65

James Almer authored 10 years ago

About two times faster
Signed-off-by: James Almer <jamrial@gmail.com>
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>

63dbba65

swresample/resample: mmx2/sse2 int16 linear interpolation · fa25c4c4

James Almer authored 10 years ago

About three times faster
Signed-off-by: James Almer <jamrial@gmail.com>
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>

fa25c4c4

20 Mar, 2014 1 commit

swresample: add swri_resample_float_sse · 32291ba6

James Almer authored 10 years ago

At least two times faster than the C version.
Signed-off-by: James Almer <jamrial@gmail.com>
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>

32291ba6

18 Mar, 2014 2 commits

swresample: reuse COMMON_CORE asm where possible · 3d48cbc5

James Almer authored 10 years ago

Signed-off-by: James Almer <jamrial@gmail.com>
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>

3d48cbc5

swresample: change COMMON_CORE_INT16 asm from SSSE3 to SSE2 · 7c8bf09e

James Almer authored 10 years ago

pshuf+paddd is slightly faster than phaddd.
The real gain is in pre-ssse3 processors like AMD K8 and K10, which get
a big boost in performance compared to the mmxext version
Signed-off-by: James Almer <jamrial@gmail.com>
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>

7c8bf09e

04 Feb, 2013 1 commit

swr/resample: fix integer overflow, add missing cast · b8c55590

Michael Niedermayer authored 11 years ago

The effects of this are limited to numeric errors in the output
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>

b8c55590

06 Dec, 2012 1 commit
- resample: remove disabled debug code · b6a7f66f
  Michael Niedermayer authored 12 years ago
```
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
```
  b6a7f66f
15 Nov, 2012 3 commits

swr/resample: move templating parameters to template itself. · 8ea88339

Clément Bœsch authored 12 years ago

It has various benefits such as allowing some refactoring, clarifying
the code in the inclusion part, and making the template understandable
in standalone.

This commit is based on the templating method used by Justin Ruggles for
libavresample.

8ea88339

swr: move if() block into the only branch where it can be true. · d53f4471
Michael Niedermayer authored 12 years ago
```
This should make the code a tiny tiny bit faster.
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
```
d53f4471

swr: reorder/redesign operations to avoid integer overflow. · 17da2d9e

Michael Niedermayer authored 12 years ago

This fixes a out of array read.

Found-by: Mateusz "j00ru" Jurczyk and Gynvael Coldwind
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>

17da2d9e

27 Jun, 2012 1 commit
- swr: MMX2 & SSSE3 int16 resample core · 4ccf6e39
  Michael Niedermayer authored 12 years ago
```
about 4 times faster
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
```
  4ccf6e39
19 Jun, 2012 2 commits
- swr: introduce filter_alloc in preparation of SIMD resample optimisations · 0c142e4c
  Michael Niedermayer authored 12 years ago
```
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
```
  0c142e4c
- swr/resample: optimize C code for the most common case · 80e857c9
  Michael Niedermayer authored 12 years ago
```
15% speedup
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
```
  80e857c9
06 Jun, 2012 1 commit
- resample_template: use av_assert · 6e6dd999
  Michael Niedermayer authored 12 years ago
```
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
```
  6e6dd999
10 Apr, 2012 1 commit
- swr: support float & int32 in the resampler · 7f1ae79d
  Michael Niedermayer authored 12 years ago
```
Signed-off-by: Michael Niedermayer <michaelni@gmx.at>
```
  7f1ae79d