SIMD benchmark
Iain Buclaw
ibuclaw at ubuntu.com
Sun Jan 15 12:46:51 PST 2012
On 15 January 2012 19:01, bearophile <bearophileHUGS at lycos.com> wrote:
> Iain Buclaw:
>
>> Correction, 1.5x speed up without, 20x speed up with -O1, 30x speed up
>> with -O2 and above. My oh my...
>
> Please, show me the assembly code produced, with its relative D source :-)
>
> Bye,
> bearophile
For those who can't read AT&T:
----
.LC5:
.long 1067030938
.long 1067030938
.long 1067030938
.long 1067030938
.align 16
_D4test5test2FZNhG4f:
.cfi_startproc
mov eax, 3
cvtsi2ss xmm0, eax
mov al, 7
cvtsi2ss xmm1, eax
unpcklps xmm0, xmm0
unpcklps xmm1, xmm1
movlhps xmm0, xmm0
movlhps xmm1, xmm1
mulps xmm0, XMMWORD PTR .LC5[rip]
addps xmm0, xmm1
ret
.cfi_endproc
----
--
Iain Buclaw
*(p < e ? p++ : p) = (c & 0x0f) + '0';
More information about the Digitalmars-d
mailing list