Vision Foundation Models as Effective Visual Tokenizers for Autoregressive Generation | Read Paper on Bytez