Zero-shot object goal visual navigation

Qianfan Zhao; Lu Zhang; Bin He; Hong Qiao; and Zhiyong Liu

arXiv:2206.07423·cs.CV·February 21, 2023

Zero-shot object goal visual navigation

Qianfan Zhao, Lu Zhang, Bin He, Hong Qiao, and Zhiyong Liu

PDF

Open Access 1 Repo

TL;DR

This paper introduces a zero-shot object goal visual navigation framework that enables robots to find novel objects without prior training, leveraging semantic similarity and detection results to generalize across unseen classes.

Contribution

The paper proposes SSNet, a novel zero-shot navigation framework using semantic similarity, improving generalization to unseen object classes in visual navigation tasks.

Findings

01

Outperforms baseline models in zero-shot navigation tasks

02

Demonstrates strong generalization to novel object classes

03

Validated on the AI2-THOR platform

Abstract

Object goal visual navigation is a challenging task that aims to guide a robot to find the target object based on its visual observation, and the target is limited to the classes pre-defined in the training stage. However, in real households, there may exist numerous target classes that the robot needs to deal with, and it is hard for all of these classes to be contained in the training stage. To address this challenge, we study the zero-shot object goal visual navigation task, which aims at guiding robots to find targets belonging to novel classes without any training samples. To this end, we also propose a novel zero-shot object navigation framework called semantic similarity network (SSNet). Our framework use the detection results and the cosine similarity between semantic word embeddings as input. Such type of input data has a weak correlation with classes and thus our framework has…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pioneer-innovation/zero-shot-object-navigation
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Advanced Neural Network Applications