Make register usage in macros explicit; change mulsub_2w_4x to use 2 instead of 3 temp registers.
Attach a file by drag & drop or click to upload