ODRL: A Benchmark for Off-Dynamics Reinforcement Learning
Jiafei Lyu, Kang Xu, Jiacheng Xu, Mengbei Yan, Jingwen Yang, Zongzhang, Zhang, Chenjia Bai, Zongqing Lu, Xiu Li

TL;DR
ODRL is a comprehensive benchmark designed to evaluate off-dynamics reinforcement learning algorithms across diverse settings, facilitating the assessment of their adaptation capabilities in varied domain shifts.
Contribution
This paper introduces ODRL, the first standardized benchmark for off-dynamics RL, including diverse tasks, settings, and a unified framework for evaluation.
Findings
No existing method outperforms others across all dynamics shifts.
ODRL enables systematic evaluation of off-dynamics RL algorithms.
Benchmark results highlight the need for more adaptable algorithms.
Abstract
We consider off-dynamics reinforcement learning (RL) where one needs to transfer policies across different domains with dynamics mismatch. Despite the focus on developing dynamics-aware algorithms, this field is hindered due to the lack of a standard benchmark. To bridge this gap, we introduce ODRL, the first benchmark tailored for evaluating off-dynamics RL methods. ODRL contains four experimental settings where the source and target domains can be either online or offline, and provides diverse tasks and a broad spectrum of dynamics shifts, making it a reliable platform to comprehensively evaluate the agent's adaptation ability to the target domain. Furthermore, ODRL includes recent off-dynamics RL algorithms in a unified framework and introduces some extra baselines for different settings, all implemented in a single-file manner. To unpack the true adaptation capability of existing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsReinforcement Learning in Robotics
MethodsFocus
