ENAT: Rethinking Spatial-temporal Interactions in Token-based Image Synthesis | Read Paper on Bytez