How to deal with inline asm functions in Phobos/druntime?
Daniel Murphy via digitalmars-d-ldc
digitalmars-d-ldc at puremagic.com
Wed Apr 8 06:15:25 PDT 2015
"Johan Engelen" wrote in message
news:hqpjetsgaeoqkfyqexka at forum.dlang.org...
> About SSE: I can't vectorize the code for this one function with one real
> as argument! I had done a brief search for what instructions are available
> on xmm regs (argument is passed through xmm0), but it is mostly simple
> arithmetic I think, not the kind of stuff that is used in the original
> druntime asm code.
> (Btw, the pro-epilogues also consist of pushing/popping all XMM regs,
> quite a bit of data, but indeed no clue how slow/fast that is. Didn't
> measure a thing, but it just looked kind of wasteful :)
I don't think it's so much about vectorizing as it is about avoiding the x87
FPU, which you can do when 80-bit precision is not needed.
More information about the digitalmars-d-ldc