V?: Guided Visual Search as a Core Mechanism in Multimodal LLMs | Read Paper on Bytez