Robust Reinforcement Learning in Continuous Control Tasks with Uncertainty Set Regularization
Yuan Zhang, Jianhong Wang, Joschka Boedecker

TL;DR
This paper introduces USR, a new regularizer for reinforcement learning that enhances robustness in continuous control tasks by modeling uncertainty sets and using adversarial generation, improving performance under environmental perturbations.
Contribution
The paper proposes USR, a novel uncertainty set regularizer for RL, and an adversarial method to generate uncertainty sets, applicable to any RL framework for improved robustness.
Findings
USR improves robustness in perturbed environments.
The adversarial uncertainty set generation enhances policy resilience.
Experimental results on RWRL benchmark show superior performance.
Abstract
Reinforcement learning (RL) is recognized as lacking generalization and robustness under environmental perturbations, which excessively restricts its application for real-world robotics. Prior work claimed that adding regularization to the value function is equivalent to learning a robust policy with uncertain transitions. Although the regularization-robustness transformation is appealing for its simplicity and efficiency, it is still lacking in continuous control tasks. In this paper, we propose a new regularizer named ncertainty et egularizer (USR), by formulating the uncertainty set on the parameter space of the transition function. In particular, USR is flexible enough to be plugged into any existing RL framework. To deal with unknown uncertainty sets, we further propose a novel adversarial approach to generate them based on the value function. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Reinforcement Learning in Robotics
