Teacher-Student Reinforcement Learning for Mapless Navigation using a   Planetary Space Rover

Anton Bj{\o}rndahl Mortensen; Emil Tribler Pedersen; Laia Vives; Benedicto; Lionel Burg; Mads Rossen Madsen; Simon B{\o}gh

arXiv:2309.12807·cs.RO·September 25, 2023

Teacher-Student Reinforcement Learning for Mapless Navigation using a Planetary Space Rover

Anton Bj{\o}rndahl Mortensen, Emil Tribler Pedersen, Laia Vives, Benedicto, Lionel Burg, Mads Rossen Madsen, Simon B{\o}gh

PDF

Open Access

TL;DR

This paper presents a novel teacher-student reinforcement learning approach for mapless navigation of planetary rovers, improving real-world robustness and sim-to-real transfer in unpredictable terrains.

Contribution

Introduces a two-stage RL method using offline noisy data and a teacher-student paradigm to enhance rover navigation robustness and transferability.

Findings

01

Student policy outperforms teacher in real-world tests

02

Enhanced noise resilience in student policy

03

Improved sim-to-real transfer effectiveness

Abstract

We address the challenge of enhancing navigation autonomy for planetary space rovers using reinforcement learning (RL). The ambition of future space missions necessitates advanced autonomous navigation capabilities for rovers to meet mission objectives. RL's potential in robotic autonomy is evident, but its reliance on simulations poses a challenge. Transferring policies to real-world scenarios often encounters the "reality gap", disrupting the transition from virtual to physical environments. The reality gap is exacerbated in the context of mapless navigation on Mars and Moon-like terrains, where unpredictable terrains and environmental factors play a significant role. Effective navigation requires a method attuned to these complexities and real-world data noise. We introduce a novel two-stage RL approach using offline noisy data. Our approach employs a teacher-student policy learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModular Robots and Swarm Intelligence · Reinforcement Learning in Robotics · Robotic Path Planning Algorithms