Performance issue with @fastmath and vectorization

Johan Engelen via digitalmars-d-ldc digitalmars-d-ldc at puremagic.com
Sat Nov 12 03:04:59 PST 2016


On Saturday, 12 November 2016 at 10:56:20 UTC, deXtoRious wrote:
> On Saturday, 12 November 2016 at 10:47:42 UTC, Johan Engelen 
> wrote:
>>
>> Does the C++ need `__restrict__` for the parameters to get the 
>> assembly you want?
>
> In this case, it doesn't seem to make any difference.

That's good news, because there is currently no way to add that 
to LDC code, afaik.

Hope you can try to cut more of these things from the example so 
it's easier to figure out why things are different.  (e.g. is 
-Ofast needed, or is -O3 enough?)

Thanks!

cheers,
   Johan





More information about the digitalmars-d-ldc mailing list