Reinforcement Learning for Robust Missile Autopilot Design

Bernardo Cortez

arXiv:2011.12956·cs.LG·September 21, 2021

Reinforcement Learning for Robust Missile Autopilot Design

Bernardo Cortez

PDF

Open Access

TL;DR

This paper pioneers the use of Reinforcement Learning for missile autopilot control, demonstrating that it can achieve both optimal performance and robustness to uncertainties through novel training methodologies.

Contribution

It introduces a new RL framework for flight control, including the SER training methodology, and enhances experience replay with BPER and reformulated HER techniques.

Findings

01

RL can control missile flight effectively.

02

Training in non-nominal environments improves robustness.

03

The proposed methods maintain nominal performance while increasing robustness.

Abstract

Designing missiles' autopilot controllers has been a complex task, given the extensive flight envelope and the nonlinear flight dynamics. A solution that can excel both in nominal performance and in robustness to uncertainties is still to be found. While Control Theory often debouches into parameters' scheduling procedures, Reinforcement Learning has presented interesting results in ever more complex tasks, going from videogames to robotic tasks with continuous action domains. However, it still lacks clearer insights on how to find adequate reward functions and exploration strategies. To the best of our knowledge, this work is pioneer in proposing Reinforcement Learning as a framework for flight control. In fact, it aims at training a model-free agent that can control the longitudinal flight of a missile, achieving optimal performance and robustness to uncertainties. To that end, under…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Guidance and Control Systems · Adversarial Robustness in Machine Learning

MethodsExperience Replay · Prioritized Experience Replay