Wasserstein Robust Reinforcement Learning

Mohammed Amin Abdullah; Hang Ren; Haitham Bou Ammar; Vladimir; Milenkovic; Rui Luo; Mingtian Zhang; Jun Wang

arXiv:1907.13196·cs.LG·September 18, 2019·37 cites

Wasserstein Robust Reinforcement Learning

Mohammed Amin Abdullah, Hang Ren, Haitham Bou Ammar, Vladimir, Milenkovic, Rui Luo, Mingtian Zhang, Jun Wang

PDF

Open Access

TL;DR

This paper introduces $ ext{W} ext{R}^{2} ext{L}$, a Wasserstein-based robust reinforcement learning algorithm that improves performance and robustness in complex control tasks through a novel min-max formulation and scalable solver.

Contribution

It formulates robust reinforcement learning as a Wasserstein-constrained min-max game and proposes a new zero-order optimization method for efficient solving.

Findings

01

Significant performance gains over standard algorithms in MuJuCo environments.

02

Robustness to environment variations demonstrated in high-dimensional tasks.

03

Scalable solver applicable to complex reinforcement learning problems.

Abstract

Reinforcement learning algorithms, though successful, tend to over-fit to training environments hampering their application to the real-world. This paper proposes $W R^{2} L$ -- a robust reinforcement learning algorithm with significant robust performance on low and high-dimensional control tasks. Our method formalises robust reinforcement learning as a novel min-max game with a Wasserstein constraint for a correct and convergent solver. Apart from the formulation, we also propose an efficient and scalable solver following a novel zero-order optimisation method that we believe can be useful to numerical optimisation in general. We empirically demonstrate significant gains compared to standard and robust state-of-the-art algorithms on high-dimensional MuJuCo environments.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications