Loading paper
CityCube: Benchmarking Cross-view Spatial Reasoning on Vision-Language Models in Urban Environments | Tomesphere