performance issues with SIMD function

Sergey kornburn at yandex.ru
Fri Nov 3 15:32:08 UTC 2023


On Friday, 3 November 2023 at 15:11:31 UTC, Bogdan wrote:
> Hi everyone,
>
> I was playing around with the intel-intrinsics library, trying 
> to improve the speed of a simple area function. I could not see 
> any performance improvements from the non-SIMD implementation. 
> The SIMD version is a little bit slower even with LDC2 and 
> --o3. Can anyone help me to understand what I am missing?
>
> Thanks!
> Bogdan

In your SIMD algorithm has not so many gain from using SIMD. The 
length of the loop is the same.
Also probably compiler applying some optimizations in regular 
versions, that doing almost the same.


More information about the Digitalmars-d-learn mailing list