Separations in the Representational Capabilities of Transformers and Recurrent Architectures | Read Paper on Bytez