Contrastive Representation for Data Filtering in Cross-Domain Offline   Reinforcement Learning

Xiaoyu Wen; Chenjia Bai; Kang Xu; Xudong Yu; Yang Zhang; Xuelong Li,; Zhen Wang

arXiv:2405.06192·cs.LG·May 13, 2024

Contrastive Representation for Data Filtering in Cross-Domain Offline Reinforcement Learning

Xiaoyu Wen, Chenjia Bai, Kang Xu, Xudong Yu, Yang Zhang, Xuelong Li,, Zhen Wang

PDF

Open Access 1 Repo

TL;DR

This paper introduces a contrastive representation method for data filtering in cross-domain offline reinforcement learning, effectively measuring domain gaps and improving performance with limited target data.

Contribution

It proposes a novel contrastive learning approach to measure domain differences and a data filtering algorithm that enhances cross-domain RL performance.

Findings

01

Achieves 89.2% performance with only 10% target data

02

Outperforms state-of-the-art methods on various tasks

03

Effectively handles significant domain differences

Abstract

Cross-domain offline reinforcement learning leverages source domain data with diverse transition dynamics to alleviate the data requirement for the target domain. However, simply merging the data of two domains leads to performance degradation due to the dynamics mismatch. Existing methods address this problem by measuring the dynamics gap via domain classifiers while relying on the assumptions of the transferability of paired domains. In this paper, we propose a novel representation-based approach to measure the domain gap, where the representation is learned through a contrastive objective by sampling transitions from different domains. We show that such an objective recovers the mutual-information gap of transition functions in two domains without suffering from the unbounded issue of the dynamics gap in handling significantly different domains. Based on the representations, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

battlewen/igdf
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adaptive Dynamic Programming Control