SIMD benchmark

Iain Buclaw ibuclaw at ubuntu.com
Sun Jan 15 12:46:51 PST 2012


On 15 January 2012 19:01, bearophile <bearophileHUGS at lycos.com> wrote:
> Iain Buclaw:
>
>> Correction, 1.5x speed up without, 20x speed up with -O1, 30x speed up
>> with -O2 and above.  My oh my...
>
> Please, show me the assembly code produced, with its relative D source :-)
>
> Bye,
> bearophile

For those who can't read AT&T:
----
.LC5:
        .long   1067030938
        .long   1067030938
        .long   1067030938
        .long   1067030938
        .align 16

_D4test5test2FZNhG4f:
        .cfi_startproc
        mov     eax, 3
        cvtsi2ss        xmm0, eax
        mov     al, 7
        cvtsi2ss        xmm1, eax
        unpcklps        xmm0, xmm0
        unpcklps        xmm1, xmm1
        movlhps xmm0, xmm0
        movlhps xmm1, xmm1
        mulps   xmm0, XMMWORD PTR .LC5[rip]
        addps   xmm0, xmm1
        ret
        .cfi_endproc
----


-- 
Iain Buclaw

*(p < e ? p++ : p) = (c & 0x0f) + '0';


More information about the Digitalmars-d mailing list