Commit 355903f5 authored by Michael Niedermayer's avatar Michael Niedermayer

make high precission synth filter 3 times faster on x86

Originally committed as revision 6046 to svn://svn.ffmpeg.org/ffmpeg/trunk
parent 0bd2483a
...@@ -784,8 +784,13 @@ static inline int round_sample(int64_t *sum) ...@@ -784,8 +784,13 @@ static inline int round_sample(int64_t *sum)
return sum1; return sum1;
} }
#ifdef ARCH_X86
/* ask gcc devels why this is 3 times faster then the generic code below */
#define MULS(ra, rb) \
({ int64_t rt; asm ("imull %2\n\t" : "=A"(rt) : "a" (ra), "g" (rb)); rt; })
#else
#define MULS(ra, rb) MUL64(ra, rb) #define MULS(ra, rb) MUL64(ra, rb)
#endif
#endif #endif
#define SUM8(sum, op, w, p) \ #define SUM8(sum, op, w, p) \
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment