Commits · 8f2aa89a5d4e0bc530af7dc7b4d9e9b2d61f7feb · Linshizhi / ffmpeg.wasm-core

07 Aug, 2012 1 commit
- Replace all CODEC_ID_* with AV_CODEC_ID_* · 36ef5369
  Anton Khirnov authored 12 years ago
  
  36ef5369
03 Aug, 2012 2 commits

ARMv6: vp8: fix stack allocation with Apple's assembler · e6cd6989

Mans Rullgard authored 12 years ago

In the GNU assembler, a relational expression, bizarrely, has the
value -1 if true, whereas in Apple's it is +1.  This patch makes
sure the correct expression is used in both cases.
Signed-off-by: Mans Rullgard <mans@mansr.com>

e6cd6989

ARM: vp56: allow inline asm to build with clang · 9829a81b

Mans Rullgard authored 12 years ago

The clang integrated assembler does not support pre-UAL syntax,
while gcc requires pre-UAL syntax for ARM code.  A patch[1] for
clang to support the old syntax as well has been ignored since
January.

This patch chooses the syntax appropriate for each compiler,
allowing both to build the code.  Notably, this change allows
building for iphone with the latest Apple Xcode update.

[1] http://llvm.org/bugs/show_bug.cgi?id=11855Signed-off-by: Mans Rullgard <mans@mansr.com>

9829a81b

01 Aug, 2012 2 commits

ARM: use =const syntax instead of explicit literal pools · faa78822
Mans Rullgard authored 12 years ago
```
Signed-off-by: Mans Rullgard <mans@mansr.com>
```
faa78822

ARM: use standard syntax for all LDRD/STRD instructions · 99817091

Mans Rullgard authored 12 years ago

The standard syntax requires two destination registers for
LDRD/STRD instructions.  Some versions of the GNU assembler
allow using only one with the second implicit, others are
more strict.
Signed-off-by: Mans Rullgard <mans@mansr.com>

99817091

18 Jul, 2012 2 commits

vp3: move idct and loop filter pointers to new vp3dsp context · 28f9ab70

Mans Rullgard authored 12 years ago

This moves all VP3-specific function pointers from dsputil to a
new vp3dsp context.  There is no reason to ever use the VP3 IDCT
where an MPEG2 IDCT is expected or vice versa.
Signed-off-by: Mans Rullgard <mans@mansr.com>

28f9ab70

build: add CONFIG_VP3DSP, reduce repetition in OBJS lists · ab9f9876
Mans Rullgard authored 12 years ago
```
Signed-off-by: Mans Rullgard <mans@mansr.com>
```
ab9f9876

01 Jul, 2012 1 commit

ARM: generate position independent code to access data symbols · 62634158

Mans Rullgard authored 12 years ago

This creates proper position independent code when accessing
data symbols if CONFIG_PIC is set.

References to external symbols should now use the movrelx macro.
Some additional code changes are required since this macro may
need a register to hold the GOT pointer.
Signed-off-by: Mans Rullgard <mans@mansr.com>

62634158

18 Jun, 2012 1 commit
- float_dsp: Move vector_fmac_scalar() from libavcodec to libavutil · cb5042d0
  Justin Ruggles authored 12 years ago
  
  cb5042d0
08 Jun, 2012 2 commits
- Add a float DSP framework to libavutil · d5a7229b
  Justin Ruggles authored 12 years ago
```
Move vector_fmul() from DSPContext to AVFloatDSPContext.
```
  d5a7229b
- ARM: Move asm.S from libavcodec to libavutil · 94d2b0d2
  Justin Ruggles authored 12 years ago
```
This will allow for easier implementation of ARM-optimized functions in
libraries other than libavcodec.
```
  94d2b0d2
10 May, 2012 3 commits

arm/neon: dsputil: use correct size specifiers on vld1/vst1 · e54e6f25

Mans Rullgard authored 12 years ago

Change the size specifiers to match the actual element sizes
of the data.  This makes no practical difference with strict
alignment checking disabled (the default) other than somewhat
documenting the code.  With strict alignment checking on, it
avoids trapping the unaligned loads.
Signed-off-by: Mans Rullgard <mans@mansr.com>

e54e6f25

arm: dsputil: prettify some conditional instructions in put_pixels macros · 2eba6898
Mans Rullgard authored 12 years ago
```
Signed-off-by: Mans Rullgard <mans@mansr.com>
```
2eba6898

arm: dsputil: fix overreads in put/avg_pixels functions · cbc7d60a

Mans Rullgard authored 12 years ago

The vertically interpolating variants of these functions read
ahead one line to optimise the loop.  On the last line processed,
this might be outside the buffer.  Fix these invalid reads by
processing the last line outside the loop.
Signed-off-by: Mans Rullgard <mans@mansr.com>

cbc7d60a

05 May, 2012 1 commit
- aacps: NEON optimisations · 96f7590e
  Mans Rullgard authored 13 years ago
```
Signed-off-by: Mans Rullgard <mans@mansr.com>
```
  96f7590e
25 Apr, 2012 4 commits

vp8: armv6: fix non-armv6t2 build · 3d11c2d7

Mans Rullgard authored 12 years ago

The assembler may fail to place literal pools close enough to
instructions referencing them.  An explicit .ltorg directive
fixes this.
Signed-off-by: Mans Rullgard <mans@mansr.com>

3d11c2d7

vp8: armv6 optimisations · e4ac0312

Mans Rullgard authored 12 years ago

Based on patch by Ronald S. Bultje <rsbultje@gmail.com>,
partially ported from libvpx.
Signed-off-by: Mans Rullgard <mans@mansr.com>

e4ac0312

vp8: arm: separate ARMv6 functions from NEON · b692d246

Mans Rullgard authored 12 years ago

This is a preparation for complete ARMv6 optimisations.
Signed-off-by: Mans Rullgard <mans@mansr.com>

b692d246

ARM: add some compatibility macros · dac78fd1

Mans Rullgard authored 13 years ago

This adds some macros simplifying Thumb and pre-v6T2 compatibility.
Signed-off-by: Mans Rullgard <mans@mansr.com>

dac78fd1

22 Apr, 2012 1 commit

ARM: allow runtime masking of CPU features · d526c533

Mans Rullgard authored 12 years ago

This allows masking CPU features with the -cpuflags avconv option
which is useful for testing different optimisations without rebuilding.
Signed-off-by: Mans Rullgard <mans@mansr.com>

d526c533

21 Apr, 2012 1 commit

Remove lowres video decoding · 2bcbd984

Mans Rullgard authored 12 years ago

This feature is complex, of questionable utility, and slows down
normal decoding.
Signed-off-by: Mans Rullgard <mans@mansr.com>

2bcbd984

12 Apr, 2012 1 commit
- build: Consistently handle conditional compilation for all optimization OBJS. · 7bb3a302
  Diego Biurrun authored 13 years ago
  
  7bb3a302
10 Apr, 2012 1 commit

rv40dsp: implement prescaled versions for biweight. · 272b252c

Christophe GISQUET authored 13 years ago

Quite often, the original weights are multiple of 512. By prescaling them
by 1/512 when they are computed (once per frame), no intermediate shifting
is needed, and no prescaling on each call either.

The x86 code already used that trick.
Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>

272b252c

04 Apr, 2012 1 commit
- cosmetics: Consistently place static, inline and av_cold attributes/keywords. · 3dde147f
  Diego Biurrun authored 13 years ago
  
  3dde147f
12 Mar, 2012 1 commit

remove iwmmxt optimizations · 363bd1c6

Janne Grunau authored 13 years ago

The were broken since August of 2010 without anyone noticing until
three weeks ago. Nobody cares about it anymore and hopefully Marvell
will support NEON like in the PXA978 from now on.

363bd1c6

07 Mar, 2012 1 commit

dsputil: remove shift parameter from scalarproduct_int16 · 7e1ce6a6

Christophe GISQUET authored 13 years ago

There is only one caller, which does not need the shifting. Other use cases
are situations where different roundings would be needed.

The x86 and neon versions are modified accordingly.
Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>

7e1ce6a6

02 Mar, 2012 1 commit
- vp8: change int stride to ptrdiff_t stride. · bd66f073
  Ronald S. Bultje authored 13 years ago
```
On 64bit platforms with 32bit int, this means we won't have to sign-
extend the integer anymore.
```
  bd66f073
23 Feb, 2012 1 commit
- SBR DSP: use intptr_t for the ixh parameter. · 2e74a5ab
  Christophe GISQUET authored 13 years ago
```
Signed-off-by: Ronald S. Bultje <rsbultje@gmail.com>
```
  2e74a5ab
20 Feb, 2012 1 commit

rv34: change most "int stride" into "ptrdiff_t stride". · 3ab9a2a5

Ronald S. Bultje authored 13 years ago

This prevents having to sign-extend on 64-bit systems with 32-bit ints,
such as x86-64. Also fixes crashes on systems where we don't do it and
arguments are not in registers, such as Win64 for all weight functions.

3ab9a2a5

15 Feb, 2012 2 commits
- mpegvideo: Add ff_ prefix to nonstatic functions · efd29844
  Martin Storsjö authored 13 years ago
```
Signed-off-by: Martin Storsjö <martin@martin.st>
```
  efd29844
- dsputil: Add ff_ prefix to the dsputil*_init* functions · 9cf0841e
  Martin Storsjö authored 13 years ago
```
Signed-off-by: Martin Storsjö <martin@martin.st>
```
  9cf0841e
09 Feb, 2012 1 commit
- arm: Add missing #include to vp8.h to fix a make checkheaders warning. · aa06d656
  Diego Biurrun authored 13 years ago
  
  aa06d656
06 Feb, 2012 1 commit
- doxygen: Do not include license boilerplates in Doxygen comment blocks. · 32f3c541
  Diego Biurrun authored 13 years ago
  
  32f3c541
02 Feb, 2012 1 commit

ARM: ac3: fix ac3_bit_alloc_calc_bap_armv6 · cd2f98f3

Mans Rullgard authored 13 years ago

This function was broken when the start bin was not at the start
of a band.
Signed-off-by: Mans Rullgard <mans@mansr.com>

cd2f98f3

28 Jan, 2012 1 commit

aacsbr: ARM NEON optimised sbrdsp functions · be822d77

Mans Rullgard authored 13 years ago

Overall speedup of HE-AAC decoding 2.3x on Cortex-A8, 1.2x on A9.
Signed-off-by: Mans Rullgard <mans@mansr.com>

be822d77

20 Jan, 2012 1 commit

ARM: fix build with FFT enabled and MDCT disabled · c3d5e290

Felipe Contreras authored 13 years ago

Signed-off-by: Felipe Contreras <felipe.contreras@gmail.com>
Signed-off-by: Mans Rullgard <mans@mansr.com>

c3d5e290

16 Jan, 2012 2 commits

rv34: add NEON rv34_idct_add · 9e12002f

Janne Grunau authored 13 years ago

Overall almost 4% faster, idct_add down from 350 to 85 cycles, idct_dc_add
down from 83 to 30 cycles.

squash: rv34 idct rearrange partial register loads

9e12002f

rv34: 1-pass inter MB reconstruction · 9ba9c340
Christophe GISQUET authored 13 years ago
```
Implement 1-pass inverse transform and reconstruction for inter blocks.
```
9ba9c340

13 Jan, 2012 2 commits

ARM: fix Thumb-mode simple_idct_arm · 71b3a63e

Mans Rullgard authored 13 years ago

The alignment directive must obviously precede the label.
This was never noticed in ARM mode since the location is
already aligned there.
Signed-off-by: Mans Rullgard <mans@mansr.com>

71b3a63e

ARM: 4-byte align start of all asm functions · 5c5e1ea3

Mans Rullgard authored 13 years ago

Due to apprent bugs in the GNU assembler and/or linker, relocations
can be incorrectly processed if the alignment of a Thumb instruction
is changed in the output file compared to the input object.

This fixes crashes in h264 decoding with Thumb enabled. No effect in
ARM mode since everything is 4-byte aligned there.
Signed-off-by: Mans Rullgard <mans@mansr.com>

5c5e1ea3