SIMD implementation of dot-product. Benchmarks

Manu turkeyman at gmail.com
Sat Aug 17 18:46:30 PDT 2013


It doesn't look like you account for alignment.
This is basically not-portable (I doubt unaligned loads in this context are
faster than performing scalar operations), and possibly inefficient on x86
too.
To make it account for potentially random alignment will be awkward, but it
might be possible to do efficiently.


On 18 August 2013 04:50, Ilya Yaroshenko <ilyayaroshenko at gmail.com> wrote:

> http://spiceandmath.blogspot.**ru/2013/08/simd-**
> implementation-of-dot-product_**17.html<http://spiceandmath.blogspot.ru/2013/08/simd-implementation-of-dot-product_17.html>
>
> Ilya
>
>
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.puremagic.com/pipermail/digitalmars-d-announce/attachments/20130818/ca7bc5f2/attachment.html>


More information about the Digitalmars-d-announce mailing list