Beyond OOD State Actions: Supported Cross-Domain Offline Reinforcement   Learning

Jinxin Liu; Ziqi Zhang; Zhenyu Wei; Zifeng Zhuang; Yachen Kang; Sibo; Gai; Donglin Wang

arXiv:2306.12755·cs.LG·June 23, 2023·2 cites

Beyond OOD State Actions: Supported Cross-Domain Offline Reinforcement Learning

Jinxin Liu, Ziqi Zhang, Zhenyu Wei, Zifeng Zhuang, Yachen Kang, Sibo, Gai, Donglin Wang

PDF

Open Access 1 Repo

TL;DR

This paper introduces BOSA, a method for cross-domain offline RL that addresses OOD transition dynamics, significantly improving data efficiency by leveraging source domain data from different environments.

Contribution

The paper proposes BOSA, a novel approach to handle OOD transition dynamics in cross-domain offline RL, enhancing data efficiency and compatibility with existing techniques.

Findings

01

BOSA achieves 74.4% of SOTA performance using only 10% of target data.

02

BOSA can be integrated with model-based offline RL and data augmentation methods.

03

Cross-domain data can significantly improve offline RL efficiency.

Abstract

Offline reinforcement learning (RL) aims to learn a policy using only pre-collected and fixed data. Although avoiding the time-consuming online interactions in RL, it poses challenges for out-of-distribution (OOD) state actions and often suffers from data inefficiency for training. Despite many efforts being devoted to addressing OOD state actions, the latter (data inefficiency) receives little attention in offline RL. To address this, this paper proposes the cross-domain offline RL, which assumes offline data incorporate additional source-domain data from varying transition dynamics (environments), and expects it to contribute to the offline data efficiency. To do so, we identify a new challenge of OOD transition dynamics, beyond the common OOD state actions issue, when utilizing cross-domain offline data. Then, we propose our method BOSA, which employs two support-constrained…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

thuml/SPOT
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics