dsimcha: > I see at least part of the problem. When you use such huge arrays, it ends up > being more a test of your memory bandwidth than of the vector ops. Right. Finding good benchmarks is not easy, and I have shown the code here for people to spot problems in it. I have added a C version too now. Bye, bearophile