Domain Adaptation In Reinforcement Learning Via Latent Unified State   Representation

Jinwei Xing; Takashi Nagata; Kexin Chen; Xinyun Zou; Emre Neftci,; Jeffrey L. Krichmar

arXiv:2102.05714·cs.LG·September 6, 2021·5 cites

Domain Adaptation In Reinforcement Learning Via Latent Unified State Representation

Jinwei Xing, Takashi Nagata, Kexin Chen, Xinyun Zou, Emre Neftci,, Jeffrey L. Krichmar

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a two-stage reinforcement learning method that learns a unified latent state representation to enable zero-shot domain adaptation across different visual environments, improving generalization in complex tasks.

Contribution

The paper proposes a novel two-stage RL approach that learns a cross-domain consistent latent state representation, enabling effective zero-shot transfer without additional training.

Findings

01

Achieves state-of-the-art domain adaptation in CarRacing variants.

02

Outperforms prior latent-representation and image translation methods.

03

Demonstrates effectiveness in complex autonomous driving simulations.

Abstract

Despite the recent success of deep reinforcement learning (RL), domain adaptation remains an open problem. Although the generalization ability of RL agents is critical for the real-world applicability of Deep RL, zero-shot policy transfer is still a challenging problem since even minor visual changes could make the trained agent completely fail in the new task. To address this issue, we propose a two-stage RL agent that first learns a latent unified state representation (LUSR) which is consistent across multiple domains in the first stage, and then do RL training in one source domain based on LUSR in the second stage. The cross-domain consistency of LUSR allows the policy acquired from the source domain to generalize to other target domains without extra training. We first demonstrate our approach in variants of CarRacing games with customized manipulations, and then verify it in CARLA,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

KarlXing/LUSR
pytorchOfficial

Videos

Domain Adaptation in Reinforcement Learning via Latent Unified State Representation· underline

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Reinforcement Learning in Robotics · Multimodal Machine Learning Applications

MethodsEntropy Regularization · Proximal Policy Optimization · CARLA: An Open Urban Driving Simulator