Loading paper
Fine-Grained Spatial and Verbal Losses for 3D Visual Grounding | Tomesphere