TL;DR
This paper explores the use of advanced reinforcement learning algorithms to improve UAV attitude control within the inner loop, comparing their performance to traditional PID controllers in a high-fidelity simulation environment.
Contribution
It introduces an open-source simulation environment for training UAV attitude control with RL and evaluates multiple state-of-the-art RL algorithms against PID control.
Findings
RL algorithms show promise for high-precision attitude control
PPO outperforms other RL methods in simulation
RL-based controllers can match or exceed PID performance in certain scenarios
Abstract
Autopilot systems are typically composed of an "inner loop" providing stability and control, while an "outer loop" is responsible for mission-level objectives, e.g. way-point navigation. Autopilot systems for UAVs are predominately implemented using Proportional, Integral Derivative (PID) control systems, which have demonstrated exceptional performance in stable environments. However more sophisticated control is required to operate in unpredictable, and harsh environments. Intelligent flight control systems is an active area of research addressing limitations of PID control most recently through the use of reinforcement learning (RL) which has had success in other applications such as robotics. However previous work has focused primarily on using RL at the mission-level controller. In this work, we investigate the performance and accuracy of the inner control loop providing attitude…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
