Robust Model-Based Reinforcement Learning with an Adversarial Auxiliary   Model

Siemen Herremans; Ali Anwar; Siegfried Mercelis

arXiv:2406.09976·cs.LG·July 2, 2024

Robust Model-Based Reinforcement Learning with an Adversarial Auxiliary Model

Siemen Herremans, Ali Anwar, Siegfried Mercelis

PDF

Open Access 1 Repo

TL;DR

This paper introduces a robust model-based reinforcement learning approach using an adversarial auxiliary model to improve policy robustness in high-dimensional control tasks without requiring parametric simulators.

Contribution

It proposes a novel adversarial auxiliary transition model within a robust MDP framework, enhancing policy robustness without additional environment constraints.

Findings

01

Improved robustness of policies in distorted MDPs.

02

Effective integration of adversarial auxiliary model into RL algorithm.

03

Enhanced performance on high-dimensional MuJoCo tasks.

Abstract

Reinforcement learning has demonstrated impressive performance in various challenging problems such as robotics, board games, and classical arcade games. However, its real-world applications can be hindered by the absence of robustness and safety in the learned policies. More specifically, an RL agent that trains in a certain Markov decision process (MDP) often struggles to perform well in nearly identical MDPs. To address this issue, we employ the framework of Robust MDPs (RMDPs) in a model-based setting and introduce a novel learned transition model. Our method specifically incorporates an auxiliary pessimistic model, updated adversarially, to estimate the worst-case MDP within a Kullback-Leibler uncertainty set. In comparison to several existing works, our work does not impose any additional conditions on the training environment, such as the need for a parametric simulator. To test…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rmbpo-eval/rmbpo-eval
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning