make high precission synth filter 3 times faster on x86

Originally committed as revision 6046 to svn://svn.ffmpeg.org/ffmpeg/trunk

make high precission synth filter 3 times faster on x86
Originally committed as revision 6046 to svn://svn.ffmpeg.org/ffmpeg/trunk
355903f5 · Michael Niedermayer · 0bd2483a · 355903f5
Commit 355903f5 authored Aug 22, 2006 by Michael Niedermayer
Show whitespace changes
Inline Side-by-side

Showing with 6 additions and 1 deletion

mpegaudiodec.c libavcodec/mpegaudiodec.c +6 -1

No files found.
--- a/libavcodec/mpegaudiodec.c
+++ b/libavcodec/mpegaudiodec.c
@@ -784,8 +784,13 @@ static inline int round_sample(int64_t *sum)
    return sum1;
 }
+#ifdef ARCH_X86
+/* ask gcc devels why this is 3 times faster then the generic code below */
+#define MULS(ra, rb) \
+    ({ int64_t rt; asm ("imull %2\n\t" : "=A"(rt) : "a" (ra), "g" (rb)); rt; })
+#else
 #define MULS(ra, rb) MUL64(ra, rb)
+#endif
 #endif
 #define SUM8(sum, op, w, p) \