Cross-Domain Policy Adaptation by Capturing Representation Mismatch

Jiafei Lyu; Chenjia Bai; Jingwen Yang; Zongqing Lu; Xiu Li

arXiv:2405.15369·cs.LG·May 27, 2024·1 cites

Cross-Domain Policy Adaptation by Capturing Representation Mismatch

Jiafei Lyu, Chenjia Bai, Jingwen Yang, Zongqing Lu, Xiu Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel approach to transfer reinforcement learning policies across domains with different dynamics by using representation deviation as a reward penalty, leading to improved adaptation performance.

Contribution

The paper proposes a decoupled representation learning method that measures and penalizes representation mismatch to enhance policy transfer across domains with dynamics discrepancies.

Findings

01

Effective in environments with kinematic and morphology mismatch

02

Outperforms existing methods in transfer tasks

03

Representation deviation correlates with policy performance gap

Abstract

It is vital to learn effective policies that can be transferred to different domains with dynamics discrepancies in reinforcement learning (RL). In this paper, we consider dynamics adaptation settings where there exists dynamics mismatch between the source domain and the target domain, and one can get access to sufficient source domain data, while can only have limited interactions with the target domain. Existing methods address this problem by learning domain classifiers, performing data filtering from a value discrepancy perspective, etc. Instead, we tackle this challenge from a decoupled representation learning perspective. We perform representation learning only in the target domain and measure the representation deviations on the transitions from the source domain, which we show can be a signal of dynamics mismatch. We also show that representation deviation upper bounds…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dmksjfl/par
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Stream Mining Techniques