Designing a Robust Low-Level Agnostic Controller for a Quadrotor with   Actor-Critic Reinforcement Learning

Guilherme Siqueira Eduardo; Wouter Caarls

arXiv:2210.02964·cs.RO·October 7, 2022

Designing a Robust Low-Level Agnostic Controller for a Quadrotor with Actor-Critic Reinforcement Learning

Guilherme Siqueira Eduardo, Wouter Caarls

PDF

Open Access

TL;DR

This paper presents a domain-randomized Soft Actor-Critic based low-level controller for quadrotors, demonstrating improved robustness and adaptability in payload pickup/drop tasks with disturbances, outperforming traditional PID controllers.

Contribution

Introduces a domain randomization training method for a low-level RL controller that enhances robustness and generalization across varying quadrotor dynamics.

Findings

01

RL controller outperforms PID in payload tasks

02

Controller maintains performance across diverse quadrotor parameters

03

Domain randomization improves robustness to disturbances

Abstract

Purpose: Real-life applications using quadrotors introduce a number of disturbances and time-varying properties that pose a challenge to flight controllers. We observed that, when a quadrotor is tasked with picking up and dropping a payload, traditional PID and RL-based controllers found in literature struggle to maintain flight after the vehicle changes its dynamics due to interaction with this external object. Methods: In this work, we introduce domain randomization during the training phase of a low-level waypoint guidance controller based on Soft Actor-Critic. The resulting controller is evaluated on the proposed payload pick up and drop task with added disturbances that emulate real-life operation of the vehicle. Results & Conclusion: We show that, by introducing a certain degree of uncertainty in quadrotor dynamics during training, we can obtain a controller that is capable to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGuidance and Control Systems · Robotic Path Planning Algorithms · Adaptive Control of Nonlinear Systems