FLaRe: Achieving Masterful and Adaptive Robot Policies with Large-Scale   Reinforcement Learning Fine-Tuning

Jiaheng Hu; Rose Hendrix; Ali Farhadi; Aniruddha Kembhavi; Roberto; Martin-Martin; Peter Stone; Kuo-Hao Zeng; Kiana Ehsani

arXiv:2409.16578·cs.RO·October 2, 2024

FLaRe: Achieving Masterful and Adaptive Robot Policies with Large-Scale Reinforcement Learning Fine-Tuning

Jiaheng Hu, Rose Hendrix, Ali Farhadi, Aniruddha Kembhavi, Roberto, Martin-Martin, Peter Stone, Kuo-Hao Zeng, Kiana Ehsani

PDF

Open Access

TL;DR

FLaRe is a reinforcement learning fine-tuning framework that significantly improves generalization and adaptation of robot policies across unseen environments and new tasks, leveraging large-scale pretraining and stabilization techniques.

Contribution

The paper introduces FLaRe, a novel RL fine-tuning approach that enhances pre-trained robot policies for better generalization and rapid adaptation to new tasks and embodiments.

Findings

01

Achieves 79.5% success rate on unseen environments

02

Outperforms prior methods by +23.6% in simulation

03

Enables rapid adaptation within less than a day of fine-tuning

Abstract

In recent years, the Robotics field has initiated several efforts toward building generalist robot policies through large-scale multi-task Behavior Cloning. However, direct deployments of these policies have led to unsatisfactory performance, where the policy struggles with unseen states and tasks. How can we break through the performance plateau of these models and elevate their capabilities to new heights? In this paper, we propose FLaRe, a large-scale Reinforcement Learning fine-tuning framework that integrates robust pre-trained representations, large-scale training, and gradient stabilization techniques. Our method aligns pre-trained policies towards task completion, achieving state-of-the-art (SoTA) performance both on previously demonstrated and on entirely novel tasks and embodiments. Specifically, on a set of long-horizon mobile manipulation tasks, FLaRe achieves an average…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics

MethodsSparse Evolutionary Training