Robot Fine-Tuning Made Easy: Pre-Training Rewards and Policies for   Autonomous Real-World Reinforcement Learning

Jingyun Yang; Max Sobol Mark; Brandon Vu; Archit Sharma; Jeannette; Bohg; Chelsea Finn

arXiv:2310.15145·cs.RO·October 24, 2023·1 cites

Robot Fine-Tuning Made Easy: Pre-Training Rewards and Policies for Autonomous Real-World Reinforcement Learning

Jingyun Yang, Max Sobol Mark, Brandon Vu, Archit Sharma, Jeannette, Bohg, Chelsea Finn

PDF

Open Access

TL;DR

This paper presents RoboFuME, a system that enables robots to efficiently learn new manipulation tasks with minimal human effort by leveraging pre-trained models, offline data, and autonomous online fine-tuning in real-world settings.

Contribution

The paper introduces RoboFuME, a reset-free, autonomous fine-tuning system that combines offline reinforcement learning and vision-language models to facilitate real-world robot learning.

Findings

01

RoboFuME can learn new tasks within 3 hours of autonomous experience.

02

It effectively incorporates diverse pre-existing datasets from different sources.

03

Outperforms prior methods in simulation experiments.

Abstract

The pre-train and fine-tune paradigm in machine learning has had dramatic success in a wide range of domains because the use of existing data or pre-trained models on the internet enables quick and easy learning of new tasks. We aim to enable this paradigm in robotic reinforcement learning, allowing a robot to learn a new task with little human effort by leveraging data and models from the Internet. However, reinforcement learning often requires significant human effort in the form of manual reward specification or environment resets, even if the policy is pre-trained. We introduce RoboFuME, a reset-free fine-tuning system that pre-trains a multi-task manipulation policy from diverse datasets of prior experiences and self-improves online to learn a target task with minimal human intervention. Our insights are to utilize calibrated offline reinforcement learning techniques to ensure…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications

MethodsSparse Evolutionary Training