I was curious if anyone has tried implementing the GC mark phase with SIMD instrs? If so, what were the results, and why didn't it get accepted? At first thought, it seems like a simple project that could improve performance of big-memory programs by a significant margin, right?