Loading paper
Seeing Across Views: Benchmarking Spatial Reasoning of Vision-Language Models in Robotic Scenes | Tomesphere