How to Train your Quadrotor: A Framework for Consistently Smooth and   Responsive Flight Control via Reinforcement Learning

Siddharth Mysore; Bassel Mabsout; Kate Saenko; Renato Mancuso

arXiv:2012.06656·cs.RO·October 14, 2021

How to Train your Quadrotor: A Framework for Consistently Smooth and Responsive Flight Control via Reinforcement Learning

Siddharth Mysore, Bassel Mabsout, Kate Saenko, Renato Mancuso

PDF

TL;DR

This paper introduces RE+AL, a systematic framework for training reinforcement learning agents for quadrotor control that are smooth, stable, and transferable from simulation to real hardware, outperforming traditional controllers.

Contribution

The paper presents RE+AL, an improved training framework that enhances the Neuroflight system, addressing smoothness and transferability issues in RL-based quadrotor control.

Findings

01

RE+AL significantly improves control smoothness in RL agents.

02

Agents trained with RE+AL transfer reliably to real quadrotors.

03

RE+AL-trained agents outperform tuned PID controllers in tracking and efficiency.

Abstract

We focus on the problem of reliably training Reinforcement Learning (RL) models (agents) for stable low-level control in embedded systems and test our methods on a high-performance, custom-built quadrotor platform. A common but often under-studied problem in developing RL agents for continuous control is that the control policies developed are not always smooth. This lack of smoothness can be a major problem when learning controllers %intended for deployment on real hardware as it can result in control instability and hardware failure. Issues of noisy control are further accentuated when training RL agents in simulation due to simulators ultimately being imperfect representations of reality - what is known as the reality gap. To combat issues of instability in RL agents, we propose a systematic framework, `REinforcement-based transferable Agents through Learning' (RE+AL), for designing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.