Prioritized Semantic Learning for Zero-shot Instance Navigation

Xinyu Sun; Lizhao Liu; Hongyan Zhi; Ronghe Qiu; Junwei Liang

arXiv:2403.11650·cs.CV·July 18, 2024·1 cites

Prioritized Semantic Learning for Zero-shot Instance Navigation

Xinyu Sun, Lizhao Liu, Hongyan Zhi, Ronghe Qiu, Junwei Liang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a Prioritized Semantic Learning approach to enhance zero-shot instance navigation by improving semantic understanding, leading to significant performance gains on object and instance navigation tasks without object annotations.

Contribution

The paper proposes a novel semantic-enhanced training strategy and inference scheme for zero-shot navigation, and introduces the InstanceNav task for detailed object instance navigation.

Findings

01

Outperforms previous state-of-the-art by 66% success rate on zero-shot ObjectNav

02

Achieves superior results on the new InstanceNav task

03

Demonstrates improved semantic understanding in navigation agents

Abstract

We study zero-shot instance navigation, in which the agent navigates to a specific object without using object annotations for training. Previous object navigation approaches apply the image-goal navigation (ImageNav) task (go to the location of an image) for pretraining, and transfer the agent to achieve object goals using a vision-language model. However, these approaches lead to issues of semantic neglect, where the model fails to learn meaningful semantic alignments. In this paper, we propose a Prioritized Semantic Learning (PSL) method to improve the semantic understanding ability of navigation agents. Specifically, a semantic-enhanced PSL agent is proposed and a prioritized semantic training strategy is introduced to select goal images that exhibit clear semantic supervision and relax the reward function from strict exact view matching. At inference time, a semantic expansion…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xinyusun/psl-instancenav
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Robotic Path Planning Algorithms