Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs | Read Paper on Bytez