Performance issue with @fastmath and vectorization
Johan Engelen via digitalmars-d-ldc
digitalmars-d-ldc at puremagic.com
Sat Nov 12 03:04:59 PST 2016
On Saturday, 12 November 2016 at 10:56:20 UTC, deXtoRious wrote:
> On Saturday, 12 November 2016 at 10:47:42 UTC, Johan Engelen
> wrote:
>>
>> Does the C++ need `__restrict__` for the parameters to get the
>> assembly you want?
>
> In this case, it doesn't seem to make any difference.
That's good news, because there is currently no way to add that
to LDC code, afaik.
Hope you can try to cut more of these things from the example so
it's easier to figure out why things are different. (e.g. is
-Ofast needed, or is -O3 enough?)
Thanks!
cheers,
Johan
More information about the digitalmars-d-ldc
mailing list