End-to-End Vision Tokenizer Tuning | Read Paper on Bytez