Robotic Offline RL from Internet Videos via Value-Function Pre-Training
Chethan Bhateja, Derek Guo, Dibya Ghosh, Anikait Singh, Manan Tomar,, Quan Vuong, Yevgen Chebotar, Sergey Levine, Aviral Kumar

TL;DR
This paper introduces V-PTR, a system that leverages large-scale human video datasets to pre-train value functions for robotic offline reinforcement learning, enabling better generalization and robustness in manipulation tasks.
Contribution
The paper presents a novel approach to use video data for pre-training value functions in robotic offline RL, bridging the gap between observation-only videos and RL requirements.
Findings
V-PTR improves policy performance on manipulation tasks.
Value functions learned from videos enhance downstream RL.
The system generalizes well across diverse tasks.
Abstract
Pre-training on Internet data has proven to be a key ingredient for broad generalization in many modern ML systems. What would it take to enable such capabilities in robotic reinforcement learning (RL)? Offline RL methods, which learn from datasets of robot experience, offer one way to leverage prior data into the robotic learning pipeline. However, these methods have a "type mismatch" with video data (such as Ego4D), the largest prior datasets available for robotics, since video offers observation-only experience without the action or reward annotations needed for RL methods. In this paper, we develop a system for leveraging large-scale human video datasets in robotic offline RL, based entirely on learning value functions via temporal-difference learning. We show that value learning on video datasets learns representations that are more conducive to downstream robotic offline RL than…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics
