Scaling Inference-Efficient Language Models | Read Paper on Bytez