LMD-PGN: Cross-Modal Knowledge Distillation from First-Person-View Images to Third-Person-View BEV Maps for Universal Point Goal Navigation
Riku Uemura, Kanji Tanaka, Kenta Tsukahara, Daiki Iwata

TL;DR
This paper introduces a cross-modal knowledge distillation framework for point goal navigation, enabling models trained on first-person views to be transferred to third-person view representations across diverse robot platforms.
Contribution
It proposes a novel knowledge transfer method that maps first-person view navigation models to third-person view representations, enhancing multi-robot generalizability in point goal navigation.
Findings
Effective transfer of FPV to TPV representations demonstrated in Habitat-Sim.
Minimal implementation effort required for cross-platform PGN.
Potential extension to 3D action spaces like drones.
Abstract
Point goal navigation (PGN) is a mapless navigation approach that trains robots to visually navigate to goal points without relying on pre-built maps. Despite significant progress in handling complex environments using deep reinforcement learning, current PGN methods are designed for single-robot systems, limiting their generalizability to multi-robot scenarios with diverse platforms. This paper addresses this limitation by proposing a knowledge transfer framework for PGN, allowing a teacher robot to transfer its learned navigation model to student robots, including those with unknown or black-box platforms. We introduce a novel knowledge distillation (KD) framework that transfers first-person-view (FPV) representations (view images, turning/forward actions) to universally applicable third-person-view (TPV) representations (local maps, subgoals). The state is redefined as reconstructed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInertial Sensor and Navigation · Robotics and Sensor-Based Localization · Target Tracking and Data Fusion in Sensor Networks
