ODRL: A Benchmark for Off-Dynamics Reinforcement Learning

Jiafei Lyu; Kang Xu; Jiacheng Xu; Mengbei Yan; Jingwen Yang; Zongzhang; Zhang; Chenjia Bai; Zongqing Lu; Xiu Li

arXiv:2410.20750·cs.LG·October 29, 2024

ODRL: A Benchmark for Off-Dynamics Reinforcement Learning

Jiafei Lyu, Kang Xu, Jiacheng Xu, Mengbei Yan, Jingwen Yang, Zongzhang, Zhang, Chenjia Bai, Zongqing Lu, Xiu Li

PDF

Open Access 1 Repo 1 Video

TL;DR

ODRL is a comprehensive benchmark designed to evaluate off-dynamics reinforcement learning algorithms across diverse settings, facilitating the assessment of their adaptation capabilities in varied domain shifts.

Contribution

This paper introduces ODRL, the first standardized benchmark for off-dynamics RL, including diverse tasks, settings, and a unified framework for evaluation.

Findings

01

No existing method outperforms others across all dynamics shifts.

02

ODRL enables systematic evaluation of off-dynamics RL algorithms.

03

Benchmark results highlight the need for more adaptable algorithms.

Abstract

We consider off-dynamics reinforcement learning (RL) where one needs to transfer policies across different domains with dynamics mismatch. Despite the focus on developing dynamics-aware algorithms, this field is hindered due to the lack of a standard benchmark. To bridge this gap, we introduce ODRL, the first benchmark tailored for evaluating off-dynamics RL methods. ODRL contains four experimental settings where the source and target domains can be either online or offline, and provides diverse tasks and a broad spectrum of dynamics shifts, making it a reliable platform to comprehensively evaluate the agent's adaptation ability to the target domain. Furthermore, ODRL includes recent off-dynamics RL algorithms in a unified framework and introduces some extra baselines for different settings, all implemented in a single-file manner. To unpack the true adaptation capability of existing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

offdynamicsrl/off-dynamics-rl
pytorchOfficial

Videos

ODRL: A Benchmark for Off-Dynamics Reinforcement Learning· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics

MethodsFocus