Six Degree-of-Freedom Body-Fixed Hovering over Unmapped Asteroids via LIDAR Altimetry and Reinforcement Meta-Learning
Brian Gaudet, Richard Linares, Roberto Furfaro

TL;DR
This paper presents a reinforcement meta-learning approach for six-degree-of-freedom hovering over unknown asteroids using LIDAR, enabling robust, shape-model-free control that generalizes to real asteroid shapes and conditions.
Contribution
The paper introduces a novel reinforcement meta-learning policy that enables asteroid hovering without prior shape or navigation data, adaptable to actuator failure and sensor noise.
Findings
Policy generalizes to real asteroid shapes (Bennu, Itokawa)
Operates without position/velocity estimates or shape models
Adapts to actuator failure and sensor noise
Abstract
We optimize a six degrees of freedom hovering policy using reinforcement meta-learning. The policy maps flash LIDAR measurements directly to on/off spacecraft body-frame thrust commands, allowing hovering at a fixed position and attitude in the asteroid body-fixed reference frame. Importantly, the policy does not require position and velocity estimates, and can operate in environments with unknown dynamics, and without an asteroid shape model or navigation aids. Indeed, during optimization the agent is confronted with a new randomly generated asteroid for each episode, insuring that it does not learn an asteroid's shape, texture, or environmental dynamics. This allows the deployed policy to generalize well to novel asteroid characteristics, which we demonstrate in our experiments. Moreover, our experiments show that the optimized policy adapts to actuator failure and sensor noise.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
