Improving Wind Resistance Performance of Cascaded PID Controlled Quadcopters using Residual Reinforcement Learning
Yu Ishihara, Yuichi Hazama, Kousuke Suzuki, Jerry Jun Yokono, Kohtaro, Sabe, Kenta Kawamoto

TL;DR
This paper introduces a residual reinforcement learning method to enhance wind resistance in quadcopters with cascaded PID controllers, trained solely in simulation, achieving significant position deviation reduction and robustness in outdoor wind conditions.
Contribution
It presents a novel residual RL approach that improves wind disturbance compensation for quadcopters without hardware data collection or fine-tuning.
Findings
Reduces position deviation by ~50% in wind conditions
Maintains performance despite mass and lift coefficient changes
Effective in outdoor wind speeds over 13 m/s
Abstract
Wind resistance control is an essential feature for quadcopters to maintain their position to avoid deviation from target position and prevent collisions with obstacles. Conventionally, cascaded PID controller is used for the control of quadcopters for its simplicity and ease of tuning its parameters. However, it is weak against wind disturbances and the quadcopter can easily deviate from target position. In this work, we propose a residual reinforcement learning based approach to build a wind resistance controller of a quadcopter. By learning only the residual that compensates the disturbance, we can continue using the cascaded PID controller as the base controller of the quadcopter but improve its performance against wind disturbances. To avoid unexpected crashes and destructions of quadcopters, our method does not require real hardware for data collection and training. The controller…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVehicle Dynamics and Control Systems · Reinforcement Learning in Robotics · Autonomous Vehicle Technology and Safety
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Balanced Selection
