Transformers on Markov data: Constant depth suffices | Read Paper on Bytez