Transformers without Normalization | Read Paper on Bytez