• Mans Rullgard's avatar
    celp: optimise ff_celp_lp_synthesis_filter() · fddc5b9b
    Mans Rullgard authored
    Adding instead of subtracting the products in the loop allows the
    compiler to generate more efficient multiply-accumulate instructions
    when 16-bit multiply-subtract is not available. ARM has only
    multiply-accumulate for 16-bit operands.  In general, if only one
    variant exists, it is usually accumulate rather than subtract.
    
    In the same spirit, using the dedicated saturation function enables
    use of any special optimised versions of this.
    Signed-off-by: 's avatarMans Rullgard <mans@mansr.com>
    fddc5b9b
celp_filters.c 5.47 KB