SPACE: Noise Contrastive Estimation Stabilizes Self-Play Fine-Tuning for Large Language Models | Read Paper on Bytez