Language Models with Transformers | Read Paper on Bytez