- 02 Apr, 2018 1 commit
-
-
Kieran Kunhya authored
This is a profile supporting > 8-bit video and has a higher quality DCT
-
- 28 Jun, 2017 2 commits
-
-
James Darnley authored
They now match according to FATE, barring any further bugs with untested parts
-
James Darnley authored
Includes add/put functions Rounding contributed by Ronald S. Bultje
-
- 30 May, 2017 2 commits
-
-
James Darnley authored
-
James Darnley authored
-
- 06 Apr, 2017 2 commits
-
-
Ronald S. Bultje authored
-
Ronald S. Bultje authored
These use the mmx IDCT, but sse2 put/add_pixels_clamped implementations. This way we don't need to use the ff_put/add_pixels_clamped function pointers.
-
- 13 Oct, 2015 2 commits
-
-
Christophe Gisquet authored
On 12 frames of a 444p 12 bits DNxHR sequence, _put function: C: 78902 decicycles in idct, 262071 runs, 73 skips avx: 32478 decicycles in idct, 262045 runs, 99 skips Difference between the 2: stddev: 0.39 PSNR:104.47 MAXDIFF: 2 This is unavoidable and due to the scale factors used in the x86 version, which cannot match the C ones. In addition, the trick of adding an initial bias to the input of a pass can overflow, as the input coefficients are already 15bits, which is the maximum this function can handle. Overall, however, the omse on 12 bits samples goes from 0.16916 to 0.16883. Reducing rowshift by 1 improves to 0.0908, but causes overflows. Signed-off-by:
Michael Niedermayer <michael@niedermayer.cc>
-
Christophe Gisquet authored
Modeled from the prores version. Clips to [0;1023] and is bitexact. Bitexactness requires to add offsets in different places compared to prores or C, and makes the function approximately 2% slower. For 16 frames of a DNxHD 4:2:2 10bits test sequence: C: 60861 decicycles in idct, 1048205 runs, 371 skips sse2: 27567 decicycles in idct, 1048216 runs, 360 skips avx: 26272 decicycles in idct, 1048171 runs, 405 skips The add version is not implemented, so the corresponding dsp function is set to NULL to make it clear in a code executing it. Signed-off-by:
Michael Niedermayer <michael@niedermayer.cc>
-
- 25 Sep, 2014 1 commit
-
-
James Almer authored
Also add sse2 versions for both. put_pixels_clamped port and sse2 version originally written by Timothy Gu. Reviewed-by:
Michael Niedermayer <michaelni@gmx.at> Signed-off-by:
James Almer <jamrial@gmail.com>
-
- 08 Aug, 2014 1 commit
-
-
Diego Biurrun authored
This fixes cases where the scantable permuation would get overwritten by the general idctdsp initialization.
-
- 01 Aug, 2014 1 commit
-
-
Diego Biurrun authored
The Xvid IDCT is only required to decode some Xvid-encoded MPEG-4 files, so there is no point in having it as an unconditional part of idctdsp.
-
- 19 Jul, 2014 1 commit
-
-
Diego Biurrun authored
-
- 18 Jul, 2014 2 commits
-
-
Diego Biurrun authored
-
Diego Biurrun authored
Also rename the enum values to be consistent with other DCT permutations.
-
- 30 Jun, 2014 1 commit
-
-
Diego Biurrun authored
-