Training Language Models to Reason Efficiently | Read Paper on Bytez