# Seeker based Adaptive Guidance via Reinforcement Meta-Learning Applied   to Asteroid Close Proximity Operations

**Authors:** Brian Gaudet, Richard Linares, Roberto Furfaro

arXiv: 1907.06098 · 2020-09-16

## TL;DR

This paper presents an adaptive guidance system for asteroid proximity operations that uses reinforcement meta-learning to enable real-time adaptation to unknown environmental dynamics and internal disturbances, reducing reliance on prior environmental characterization.

## Contribution

The work introduces a reinforcement meta-learning based guidance policy with a recurrent network that adapts in real-time to environmental forces and spacecraft disturbances during asteroid maneuvers.

## Key findings

- Successfully simulated landing maneuvers with environmental variability.
- Demonstrated robustness to actuator failure and sensor bias.
- Validated real-time adaptation to changes in spacecraft's mass properties.

## Abstract

Current practice for asteroid close proximity maneuvers requires extremely accurate characterization of the environmental dynamics and precise spacecraft positioning prior to the maneuver. This creates a delay of several months between the spacecraft's arrival and the ability to safely complete close proximity maneuvers. In this work we develop an adaptive integrated guidance, navigation, and control system that can complete these maneuvers in environments with unknown dynamics, with initial conditions spanning a large deployment region, and without a shape model of the asteroid. The system is implemented as a policy optimized using reinforcement meta-learning. The spacecraft is equipped with an optical seeker that locks to either a terrain feature, back-scattered light from a targeting laser, or an active beacon, and the policy maps observations consisting of seeker angles and LIDAR range readings directly to engine thrust commands. The policy implements a recurrent network layer that allows the deployed policy to adapt real time to both environmental forces acting on the agent and internal disturbances such as actuator failure and center of mass variation. We validate the guidance system through simulated landing maneuvers in a six degrees-of-freedom simulator. The simulator randomizes the asteroid's characteristics such as solar radiation pressure, density, spin rate, and nutation angle, requiring the guidance and control system to adapt to the environment. We also demonstrate robustness to actuator failure, sensor bias, and changes in the spacecraft's center of mass and inertia tensor. Finally, we suggest a concept of operations for asteroid close proximity maneuvers that is compatible with the guidance system.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1907.06098/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/1907.06098/full.md

## References

25 references — full list in the complete paper: https://tomesphere.com/paper/1907.06098/full.md

---
Source: https://tomesphere.com/paper/1907.06098