Prioritized Semantic Learning for Zero-shot Instance Navigation
Xinyu Sun, Lizhao Liu, Hongyan Zhi, Ronghe Qiu, Junwei Liang

TL;DR
This paper introduces a Prioritized Semantic Learning approach to enhance zero-shot instance navigation by improving semantic understanding, leading to significant performance gains on object and instance navigation tasks without object annotations.
Contribution
The paper proposes a novel semantic-enhanced training strategy and inference scheme for zero-shot navigation, and introduces the InstanceNav task for detailed object instance navigation.
Findings
Outperforms previous state-of-the-art by 66% success rate on zero-shot ObjectNav
Achieves superior results on the new InstanceNav task
Demonstrates improved semantic understanding in navigation agents
Abstract
We study zero-shot instance navigation, in which the agent navigates to a specific object without using object annotations for training. Previous object navigation approaches apply the image-goal navigation (ImageNav) task (go to the location of an image) for pretraining, and transfer the agent to achieve object goals using a vision-language model. However, these approaches lead to issues of semantic neglect, where the model fails to learn meaningful semantic alignments. In this paper, we propose a Prioritized Semantic Learning (PSL) method to improve the semantic understanding ability of navigation agents. Specifically, a semantic-enhanced PSL agent is proposed and a prioritized semantic training strategy is introduced to select goal images that exhibit clear semantic supervision and relax the reward function from strict exact view matching. At inference time, a semantic expansion…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Robotic Path Planning Algorithms
