Training Chain-of-Thought via Latent-Variable Inference | Read Paper on Bytez