Constrained Reinforcement Learning using Distributional Representation   for Trustworthy Quadrotor UAV Tracking Control

Yanran Wang; David Boyle

arXiv:2302.11694·cs.RO·July 16, 2024·1 cites

Constrained Reinforcement Learning using Distributional Representation for Trustworthy Quadrotor UAV Tracking Control

Yanran Wang, David Boyle

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel distributional reinforcement learning-based disturbance estimator combined with stochastic model predictive control to enhance quadrotor UAV tracking accuracy and reliability in complex environments.

Contribution

It proposes ConsDRED, an interpretable disturbance estimator with theoretical guarantees, integrated into a SMPC framework for improved quadrotor control.

Findings

01

Achieves at least 70% reduction in tracking errors.

02

Demonstrates convergent training in simulation and real-world.

03

Less sensitive to hyperparameters than existing methods.

Abstract

Simultaneously accurate and reliable tracking control for quadrotors in complex dynamic environments is challenging. As aerodynamics derived from drag forces and moment variations are chaotic and difficult to precisely identify, most current quadrotor tracking systems treat them as simple `disturbances' in conventional control approaches. We propose a novel, interpretable trajectory tracker integrating a Distributional Reinforcement Learning disturbance estimator for unknown aerodynamic effects with a Stochastic Model Predictive Controller (SMPC). The proposed estimator `Constrained Distributional Reinforced disturbance estimator' (ConsDRED) accurately identifies uncertainties between true and estimated values of aerodynamic effects. Simplified Affine Disturbance Feedback is used for control parameterization to guarantee convexity, which we then integrate with a SMPC. We theoretically…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

alex-yanranwang/consdred-smpc
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdaptive Dynamic Programming Control · Model Reduction and Neural Networks · Adaptive Control of Nonlinear Systems