Reducing the cost of autodecoding

safety0ff via Digitalmars-d digitalmars-d at puremagic.com
Wed Oct 12 17:41:16 PDT 2016


On Thursday, 13 October 2016 at 00:32:36 UTC, safety0ff wrote:
>
> It made little difference: LDC compiled into AVX2 vectorized 
> addition (vpmovzxbq & vpaddq.)

Measurements without -mcpu=native:
overhead 0.336s
bytes    0.610s
without branch hints 0.852s
code pasted 0.766s


More information about the Digitalmars-d mailing list