Six Degree-of-Freedom Body-Fixed Hovering over Unmapped Asteroids via   LIDAR Altimetry and Reinforcement Meta-Learning

Brian Gaudet; Richard Linares; Roberto Furfaro

arXiv:1911.08553·eess.SY·September 16, 2020

Six Degree-of-Freedom Body-Fixed Hovering over Unmapped Asteroids via LIDAR Altimetry and Reinforcement Meta-Learning

Brian Gaudet, Richard Linares, Roberto Furfaro

PDF

TL;DR

This paper presents a reinforcement meta-learning approach for six-degree-of-freedom hovering over unknown asteroids using LIDAR, enabling robust, shape-model-free control that generalizes to real asteroid shapes and conditions.

Contribution

The paper introduces a novel reinforcement meta-learning policy that enables asteroid hovering without prior shape or navigation data, adaptable to actuator failure and sensor noise.

Findings

01

Policy generalizes to real asteroid shapes (Bennu, Itokawa)

02

Operates without position/velocity estimates or shape models

03

Adapts to actuator failure and sensor noise

Abstract

We optimize a six degrees of freedom hovering policy using reinforcement meta-learning. The policy maps flash LIDAR measurements directly to on/off spacecraft body-frame thrust commands, allowing hovering at a fixed position and attitude in the asteroid body-fixed reference frame. Importantly, the policy does not require position and velocity estimates, and can operate in environments with unknown dynamics, and without an asteroid shape model or navigation aids. Indeed, during optimization the agent is confronted with a new randomly generated asteroid for each episode, insuring that it does not learn an asteroid's shape, texture, or environmental dynamics. This allows the deployed policy to generalize well to novel asteroid characteristics, which we demonstrate in our experiments. Moreover, our experiments show that the optimized policy adapts to actuator failure and sensor noise.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.