SURDS: Benchmarking Spatial Understanding and Reasoning in Driving Scenarios with Vision Language Models | Read Paper on Bytez