Robust Deterministic Policy Gradient for Disturbance Attenuation and Its Application to Quadrotor Control

Taeho Lee; Donghwan Lee

arXiv:2502.21057·cs.RO·December 4, 2025

Robust Deterministic Policy Gradient for Disturbance Attenuation and Its Application to Quadrotor Control

Taeho Lee, Donghwan Lee

PDF

TL;DR

This paper introduces a robust reinforcement learning algorithm, RDPG, that formulates disturbance attenuation as a two-player game, leading to more resilient quadrotor control under severe disturbances.

Contribution

It develops a novel RDPG algorithm combining deterministic policy gradients with deep RL, and applies it to improve disturbance robustness in quadrotor control.

Findings

01

RDPG outperforms existing methods in disturbance attenuation.

02

RDDPG enhances stability and sample efficiency.

03

Experiments show superior robustness and tracking accuracy.

Abstract

This paper presents a robust reinforcement learning algorithm called robust deterministic policy gradient (RDPG), which reformulates the H-infinity control problem as a two-player zero-sum dynamic game between a user and an adversary. The method combines deterministic policy gradients with deep reinforcement learning to train a robust policy that attenuates disturbances efficiently. A practical variant, robust deep deterministic policy gradient (RDDPG), integrates twin-delayed updates for stability and sample efficiency. Experiments on an unmanned aerial vehicle demonstrate superior robustness and tracking accuracy under severe disturbance conditions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.