Reducing the cost of autodecoding

Andrei Alexandrescu via Digitalmars-d digitalmars-d at puremagic.com
Wed Oct 12 18:27:35 PDT 2016


On 10/12/2016 08:41 PM, safety0ff wrote:
> On Thursday, 13 October 2016 at 00:32:36 UTC, safety0ff wrote:
>>
>> It made little difference: LDC compiled into AVX2 vectorized addition
>> (vpmovzxbq & vpaddq.)
>
> Measurements without -mcpu=native:
> overhead 0.336s
> bytes    0.610s
> without branch hints 0.852s
> code pasted 0.766s

So we should be able to reduce overhead by means of proper code 
arrangement and interplay of inlining and outlining. The prize, however, 
would be to get the AVX instructions for ASCII going. Is that possible? 
-- Andrei


More information about the Digitalmars-d mailing list