This was causing 8-byte SSE-to-SSE moves involving registers xmm8-xmm15 to be misencoded on x86_64, leading to incorrect code generation in methods with lots of local variables of type double.