Visual Perception by Large Language Model's Weights | Read Paper on Bytez