Pre-Training for Robots: Offline RL Enables Learning New Tasks from a   Handful of Trials

Aviral Kumar; Anikait Singh; Frederik Ebert; Mitsuhiko Nakamoto,; Yanlai Yang; Chelsea Finn; Sergey Levine

arXiv:2210.05178·cs.RO·September 26, 2023·1 cites

Pre-Training for Robots: Offline RL Enables Learning New Tasks from a Handful of Trials

Aviral Kumar, Anikait Singh, Frederik Ebert, Mitsuhiko Nakamoto,, Yanlai Yang, Chelsea Finn, Sergey Levine

PDF

Open Access 1 Repo

TL;DR

This paper introduces PTR, a framework that combines offline reinforcement learning with minimal task-specific data to enable robots to learn new tasks efficiently, even in new environments, without extensive retraining.

Contribution

PTR extends offline RL with key design choices, allowing effective transfer and rapid fine-tuning on new robotic tasks using limited demonstrations.

Findings

01

PTR successfully learns new tasks with as few as 10 demonstrations.

02

PTR outperforms prior methods in real-world robotic experiments.

03

Autonomous fine-tuning improves robot performance without additional demonstrations.

Abstract

Progress in deep learning highlights the tremendous potential of utilizing diverse robotic datasets for attaining effective generalization and makes it enticing to consider leveraging broad datasets for attaining robust generalization in robotic learning as well. However, in practice, we often want to learn a new skill in a new environment that is unlikely to be contained in the prior data. Therefore we ask: how can we leverage existing diverse offline datasets in combination with small amounts of task-specific data to solve new tasks, while still enjoying the generalization benefits of training on large amounts of data? In this paper, we demonstrate that end-to-end offline RL can be an effective approach for doing this, without the need for any representation learning or vision-based pre-training. We present pre-training for robots (PTR), a framework based on offline RL that attempts…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

asap7772/ptr
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · COVID-19 diagnosis using AI

MethodsQ-Learning