Loading paper
D3D-VLP: Dynamic 3D Vision-Language-Planning Model for Embodied Grounding and Navigation | Tomesphere