Multi-step Visual Reasoning with Visual Tokens Scaling and Verification | Read Paper on Bytez