See&Trek: Training-Free Spatial Prompting for Multimodal Large Language Model | Read Paper on Bytez