• Janne Grunau's avatar
    arm64: int32_to_float_fmul neon asm · a0fc780a
    Janne Grunau authored
    3% faster dts decoding on a cortex-a57.
    
                                     cortex-a57   cortex-a53
    int32_to_float_fmul_array8_c:    1270.9       4475.6
    int32_to_float_fmul_array8_neon:  328.6        569.2
    int32_to_float_fmul_scalar_c:     928.5       4119.6
    int32_to_float_fmul_scalar_neon:  309.1        524.1
    a0fc780a
fmtconvert_neon.S 2.7 KB