Commit e9da9a31 authored by Justin Ruggles's avatar Justin Ruggles

lavr: x86: improve non-SSE4 version of S16_TO_S32_SX macro

Removes a false dependency on existing contents of the 2nd dst register,
giving better performance for OOE.
parent 743f0706
......@@ -26,7 +26,8 @@
pmovsxwd m%1, m%1
SWAP %1, %2
%else
punpckhwd m%2, m%1
mova m%2, m%1
punpckhwd m%2, m%2
punpcklwd m%1, m%1
psrad m%2, 16
psrad m%1, 16
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment