Loading paper
Zero-Shot 3D Visual Grounding from Vision-Language Models | Tomesphere