Loading paper
SpatialBot: Precise Spatial Understanding with Vision Language Models | Tomesphere