Loading paper
Environmental Understanding Vision-Language Model for Embodied Agent | Tomesphere