Video-Enhanced Offline Reinforcement Learning: A Model-Based Approach

Minting Pan; Yitao Zheng; Jiajian Li; Yunbo Wang; Xiaokang Yang

arXiv:2505.06482·cs.LG·May 20, 2025

Video-Enhanced Offline Reinforcement Learning: A Model-Based Approach

Minting Pan, Yitao Zheng, Jiajian Li, Yunbo Wang, Xiaokang Yang

PDF

Open Access 1 Video

TL;DR

VeoRL is a novel model-based offline reinforcement learning method that constructs an interactive world model from online video data, significantly improving policy performance in various visual control tasks.

Contribution

It introduces a new approach to offline RL by leveraging unlabeled videos to build world models, transferring knowledge to enhance policy learning.

Findings

01

Achieves over 100% performance improvement in some tasks

02

Effective in robotic manipulation, autonomous driving, and video games

03

Utilizes diverse online videos for world model construction

Abstract

Offline reinforcement learning (RL) enables policy optimization using static datasets, avoiding the risks and costs of extensive real-world exploration. However, it struggles with suboptimal offline behaviors and inaccurate value estimation due to the lack of environmental interaction. We present Video-Enhanced Offline RL (VeoRL), a model-based method that constructs an interactive world model from diverse, unlabeled video data readily available online. Leveraging model-based behavior guidance, our approach transfers commonsense knowledge of control policy and physical dynamics from natural videos to the RL agent within the target domain. VeoRL achieves substantial performance gains (over 100% in some cases) across visual control tasks in robotic manipulation, autonomous driving, and open-world video games.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Video-Enhanced Offline Reinforcement Learning: A Model-Based Approach· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Adversarial Robustness in Machine Learning