Active Domain-Invariant Self-Localization Using Ego-Centric and   World-Centric Maps

Kanya Kurauchi; Kanji Tanaka; Ryogo Yamamoto; and Mitsuki Yoshida

arXiv:2204.10497·cs.RO·July 29, 2022

Active Domain-Invariant Self-Localization Using Ego-Centric and World-Centric Maps

Kanya Kurauchi, Kanji Tanaka, Ryogo Yamamoto, and Mitsuki Yoshida

PDF

Open Access

TL;DR

This paper introduces a domain-invariant next-best-view planner for visual place recognition in autonomous navigation, leveraging CNN features from both ego-centric and world-centric maps to improve robot localization without extensive data collection.

Contribution

It proposes a novel domain-invariant NBV planning framework that uses CNN intermediate and output layer cues, trained with deep reinforcement learning, to enhance robot self-localization.

Findings

01

Validated on NCLT dataset showing improved localization accuracy

02

Effective transfer of domain-invariant features to NBV planning

03

Reduced data collection costs for visual place recognition

Abstract

The training of a next-best-view (NBV) planner for visual place recognition (VPR) is a fundamentally important task in autonomous robot navigation, for which a typical approach is the use of visual experiences that are collected in the target domain as training data. However, the collection of a wide variety of visual experiences in everyday navigation is costly and prohibitive for real-time robotic applications. We address this issue by employing a novel {\it domain-invariant} NBV planner. A standard VPR subsystem based on a convolutional neural network (CNN) is assumed to be available, and its domain-invariant state recognition ability is proposed to be transferred to train the domain-invariant NBV planner. Specifically, we divide the visual cues that are available from the CNN model into two types: the output layer cue (OLC) and intermediate layer cue (ILC). The OLC is available at…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications