Loading paper
SURDS: Benchmarking Spatial Understanding and Reasoning in Driving Scenarios with Vision Language Models | Tomesphere