I tested that, modulus is slower. The compiler is surely
converting it to something branchless like:
uint iter_next = (iter + 1) * !(iter + 1 > k);
I take your point but I think most people know that the equals
operators have the lowest associativity.