Constrained Reinforcement Learning using Distributional Representation for Trustworthy Quadrotor UAV Tracking Control
Yanran Wang, David Boyle

TL;DR
This paper introduces a novel distributional reinforcement learning-based disturbance estimator combined with stochastic model predictive control to enhance quadrotor UAV tracking accuracy and reliability in complex environments.
Contribution
It proposes ConsDRED, an interpretable disturbance estimator with theoretical guarantees, integrated into a SMPC framework for improved quadrotor control.
Findings
Achieves at least 70% reduction in tracking errors.
Demonstrates convergent training in simulation and real-world.
Less sensitive to hyperparameters than existing methods.
Abstract
Simultaneously accurate and reliable tracking control for quadrotors in complex dynamic environments is challenging. As aerodynamics derived from drag forces and moment variations are chaotic and difficult to precisely identify, most current quadrotor tracking systems treat them as simple `disturbances' in conventional control approaches. We propose a novel, interpretable trajectory tracker integrating a Distributional Reinforcement Learning disturbance estimator for unknown aerodynamic effects with a Stochastic Model Predictive Controller (SMPC). The proposed estimator `Constrained Distributional Reinforced disturbance estimator' (ConsDRED) accurately identifies uncertainties between true and estimated values of aerodynamic effects. Simplified Affine Disturbance Feedback is used for control parameterization to guarantee convexity, which we then integrate with a SMPC. We theoretically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdaptive Dynamic Programming Control · Model Reduction and Neural Networks · Adaptive Control of Nonlinear Systems
