Scientific computing and parallel computing C++23/C++26
rumpf_a at web.de
Thu Jan 20 04:01:09 UTC 2022
On Thursday, 20 January 2022 at 00:43:30 UTC, Nicholas Wilson
> I mean there are parametric attributes of the hardware, say for
> example cache size (or available registers for GPUs), that have
> a direct effect on how many times you can unroll the inner
> loop, say for a windowing function, and you want to ship
> optimised code for multiple configurations of hardware.
> You can much more easily create multiple copies for different
> sized cache (or register availability) in D than you can in
> C++, because static foreach and static if >>> if constexpr.
And you can do that even more easily with an AST macro system.
Which Julia has...
More information about the Digitalmars-d