Deep Reinforcement Learning Attitude Control of Fixed-Wing UAVs Using   Proximal Policy Optimization

Eivind B{\o}hn; Erlend M. Coates; Signe Moe; Tor Arne Johansen

arXiv:1911.05478·cs.RO·November 14, 2019

Deep Reinforcement Learning Attitude Control of Fixed-Wing UAVs Using Proximal Policy Optimization

Eivind B{\o}hn, Erlend M. Coates, Signe Moe, Tor Arne Johansen

PDF

2 Repos

TL;DR

This paper develops a deep reinforcement learning controller using PPO for fixed-wing UAV attitude control, extending flight capabilities and demonstrating robustness to disturbances compared to traditional PID controllers.

Contribution

It introduces a DRL-based attitude control method for UAVs that outperforms PID controllers in convergence and generalizes well to disturbances.

Findings

01

RL controller stabilizes UAV from various initial conditions.

02

RL outperforms PID in convergence cases.

03

RL generalizes to wind and turbulence disturbances.

Abstract

Contemporary autopilot systems for unmanned aerial vehicles (UAVs) are far more limited in their flight envelope as compared to experienced human pilots, thereby restricting the conditions UAVs can operate in and the types of missions they can accomplish autonomously. This paper proposes a deep reinforcement learning (DRL) controller to handle the nonlinear attitude control problem, enabling extended flight envelopes for fixed-wing UAVs. A proof-of-concept controller using the proximal policy optimization (PPO) algorithm is developed, and is shown to be capable of stabilizing a fixed-wing UAV from a large set of initial conditions to reference roll, pitch and airspeed values. The training process is outlined and key factors for its progression rate are considered, with the most important factor found to be limiting the number of variables in the observation vector, and including values…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.