Character-Level Language Modeling with Deeper Self-Attention | Read Paper on Bytez