On Sunday, 7 September 2014 at 14:25:51 UTC, Ola Fosheim Grøstad wrote: > Modern SIMD can act on 32 bytes in parallel, so libraries that Actually, latest gen AVX-512 can work on 64 bytes per instruction…