Teacher-Student Reinforcement Learning for Mapless Navigation using a Planetary Space Rover
Anton Bj{\o}rndahl Mortensen, Emil Tribler Pedersen, Laia Vives, Benedicto, Lionel Burg, Mads Rossen Madsen, Simon B{\o}gh

TL;DR
This paper presents a novel teacher-student reinforcement learning approach for mapless navigation of planetary rovers, improving real-world robustness and sim-to-real transfer in unpredictable terrains.
Contribution
Introduces a two-stage RL method using offline noisy data and a teacher-student paradigm to enhance rover navigation robustness and transferability.
Findings
Student policy outperforms teacher in real-world tests
Enhanced noise resilience in student policy
Improved sim-to-real transfer effectiveness
Abstract
We address the challenge of enhancing navigation autonomy for planetary space rovers using reinforcement learning (RL). The ambition of future space missions necessitates advanced autonomous navigation capabilities for rovers to meet mission objectives. RL's potential in robotic autonomy is evident, but its reliance on simulations poses a challenge. Transferring policies to real-world scenarios often encounters the "reality gap", disrupting the transition from virtual to physical environments. The reality gap is exacerbated in the context of mapless navigation on Mars and Moon-like terrains, where unpredictable terrains and environmental factors play a significant role. Effective navigation requires a method attuned to these complexities and real-world data noise. We introduce a novel two-stage RL approach using offline noisy data. Our approach employs a teacher-student policy learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModular Robots and Swarm Intelligence · Reinforcement Learning in Robotics · Robotic Path Planning Algorithms
