Improving Wind Resistance Performance of Cascaded PID Controlled   Quadcopters using Residual Reinforcement Learning

Yu Ishihara; Yuichi Hazama; Kousuke Suzuki; Jerry Jun Yokono; Kohtaro; Sabe; Kenta Kawamoto

arXiv:2308.01648·cs.RO·August 4, 2023

Improving Wind Resistance Performance of Cascaded PID Controlled Quadcopters using Residual Reinforcement Learning

Yu Ishihara, Yuichi Hazama, Kousuke Suzuki, Jerry Jun Yokono, Kohtaro, Sabe, Kenta Kawamoto

PDF

Open Access

TL;DR

This paper introduces a residual reinforcement learning method to enhance wind resistance in quadcopters with cascaded PID controllers, trained solely in simulation, achieving significant position deviation reduction and robustness in outdoor wind conditions.

Contribution

It presents a novel residual RL approach that improves wind disturbance compensation for quadcopters without hardware data collection or fine-tuning.

Findings

01

Reduces position deviation by ~50% in wind conditions

02

Maintains performance despite mass and lift coefficient changes

03

Effective in outdoor wind speeds over 13 m/s

Abstract

Wind resistance control is an essential feature for quadcopters to maintain their position to avoid deviation from target position and prevent collisions with obstacles. Conventionally, cascaded PID controller is used for the control of quadcopters for its simplicity and ease of tuning its parameters. However, it is weak against wind disturbances and the quadcopter can easily deviate from target position. In this work, we propose a residual reinforcement learning based approach to build a wind resistance controller of a quadcopter. By learning only the residual that compensates the disturbance, we can continue using the cascaded PID controller as the base controller of the quadcopter but improve its performance against wind disturbances. To avoid unexpected crashes and destructions of quadcopters, our method does not require real hardware for data collection and training. The controller…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVehicle Dynamics and Control Systems · Reinforcement Learning in Robotics · Autonomous Vehicle Technology and Safety

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Balanced Selection