toy windowing auto-vec miss

Bruce Carneal bcarneal at gmail.com
Mon Nov 7 01:59:03 UTC 2022


Here's a simple godbolt example of one of the areas in which gdc 
solidly outperforms ldc wrt auto-vectorization: simple but not 
trivial operand gather
https://godbolt.org/z/ox1vvxd8s


Compile time target adaptive manual __vector-ization is an answer 
here if you have no access to SIMT, so not a show stopper, but 
the code is less readable.

I'm not sure what the data parallel future should look like wrt 
language/IR but I'm pretty sure we can do better than praying 
that the auto vectorizer can dig patterns out of for loops, or 
throwing ourselves on the manual vectorization grenade, 
repeatedly.




More information about the digitalmars-d-ldc mailing list