BackSlash: Rate Constrained Optimized Training of Large Language Models | Read Paper on Bytez