Pre-Training for Robots: Offline RL Enables Learning New Tasks from a Handful of Trials
Aviral Kumar, Anikait Singh, Frederik Ebert, Mitsuhiko Nakamoto,, Yanlai Yang, Chelsea Finn, Sergey Levine

TL;DR
This paper introduces PTR, a framework that combines offline reinforcement learning with minimal task-specific data to enable robots to learn new tasks efficiently, even in new environments, without extensive retraining.
Contribution
PTR extends offline RL with key design choices, allowing effective transfer and rapid fine-tuning on new robotic tasks using limited demonstrations.
Findings
PTR successfully learns new tasks with as few as 10 demonstrations.
PTR outperforms prior methods in real-world robotic experiments.
Autonomous fine-tuning improves robot performance without additional demonstrations.
Abstract
Progress in deep learning highlights the tremendous potential of utilizing diverse robotic datasets for attaining effective generalization and makes it enticing to consider leveraging broad datasets for attaining robust generalization in robotic learning as well. However, in practice, we often want to learn a new skill in a new environment that is unlikely to be contained in the prior data. Therefore we ask: how can we leverage existing diverse offline datasets in combination with small amounts of task-specific data to solve new tasks, while still enjoying the generalization benefits of training on large amounts of data? In this paper, we demonstrate that end-to-end offline RL can be an effective approach for doing this, without the need for any representation learning or vision-based pre-training. We present pre-training for robots (PTR), a framework based on offline RL that attempts…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · COVID-19 diagnosis using AI
MethodsQ-Learning
