Domain Adversarial Reinforcement Learning

Bonnie Li; Vincent Fran\c{c}ois-Lavet; Thang Doan; Joelle Pineau

arXiv:2102.07097·cs.LG·February 16, 2021·6 cites

Domain Adversarial Reinforcement Learning

Bonnie Li, Vincent Fran\c{c}ois-Lavet, Thang Doan, Joelle Pineau

PDF

Open Access

TL;DR

This paper introduces a domain adversarial reinforcement learning method that enhances the generalization ability of agents to unseen visual domains by enforcing invariant representations through adversarial training.

Contribution

It proposes a novel domain adversarial training approach for reinforcement learning that improves zero-shot generalization to unseen visual environments.

Findings

01

Significant improvement in generalization to unseen domains.

02

Effective invariance of learned representations across visual variations.

03

Applicable to various visual domain shifts in RL environments.

Abstract

We consider the problem of generalization in reinforcement learning where visual aspects of the observations might differ, e.g. when there are different backgrounds or change in contrast, brightness, etc. We assume that our agent has access to only a few of the MDPs from the MDP distribution during training. The performance of the agent is then reported on new unknown test domains drawn from the distribution (e.g. unseen backgrounds). For this "zero-shot RL" task, we enforce invariance of the learned representations to visual domains via a domain adversarial optimization process. We empirically show that this approach allows achieving a significant generalization improvement to new unseen domains.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and ELM · Adaptive Dynamic Programming Control