Transferable Deep Reinforcement Learning for Cross-Domain Navigation: from Farmland to the Moon
Shreya Santra, Thomas Robbins, Kazuya Yoshida

TL;DR
This paper demonstrates that deep reinforcement learning policies trained in terrestrial environments can effectively transfer to extraterrestrial terrains like the Moon, enabling zero-shot lunar navigation with minimal retraining.
Contribution
It shows the feasibility of cross-domain DRL policy transfer for planetary navigation, reducing the need for environment-specific tuning and retraining.
Findings
Policies trained in farmland achieve ~50% success in lunar simulations
DRL enables zero-shot transfer across visually and topographically distinct domains
Transfer reduces retraining costs for planetary exploration robots
Abstract
Autonomous navigation in unstructured environments is essential for field and planetary robotics, where robots must efficiently reach goals while avoiding obstacles under uncertain conditions. Conventional algorithmic approaches often require extensive environment-specific tuning, limiting scalability to new domains. Deep Reinforcement Learning (DRL) provides a data-driven alternative, allowing robots to acquire navigation strategies through direct interactions with their environment. This work investigates the feasibility of DRL policy generalization across visually and topographically distinct simulated domains, where policies are trained in terrestrial settings and validated in a zero-shot manner in extraterrestrial environments. A 3D simulation of an agricultural rover is developed and trained using Proximal Policy Optimization (PPO) to achieve goal-directed navigation and obstacle…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
