Retrieve, Integrate, and Synthesize: Spatial-Semantic Grounded Latent Visual Reasoning | Read Paper on Bytez