Visually-Guided Policy Optimization for Multimodal Reasoning | Read Paper on Bytez