ldc/dcompute and shared (programmer managed cache) access

Johan Engelen j at j.nl
Sat Mar 6 11:46:39 UTC 2021


On Friday, 5 March 2021 at 04:49:56 UTC, Bruce Carneal wrote:
>
> The code generation through standalone (uninitialized) 
> __shared__ pointers, which we *can* declare, already emits the 
> correct PTX instructions (ld.shared, st.shared) but we have no 
> way, at least that I've found, to declare __shared__ address 
> space storage that we can point to.  (clang cuda can do this, 
> nvvc can obviously do this, and we can do this currently only 
> if we manually modify the .PTX file)

I'm not very familiar with DCompute, but sounds to me that all 
that is needed is a proposed syntax for what you want. The 
compiler support is probably pretty easy to add.

-Johan



More information about the digitalmars-d-ldc mailing list