Loading paper
TopViewRS: Vision-Language Models as Top-View Spatial Reasoners | Tomesphere