FLaRe: Achieving Masterful and Adaptive Robot Policies with Large-Scale Reinforcement Learning Fine-Tuning
Jiaheng Hu, Rose Hendrix, Ali Farhadi, Aniruddha Kembhavi, Roberto, Martin-Martin, Peter Stone, Kuo-Hao Zeng, Kiana Ehsani

TL;DR
FLaRe is a reinforcement learning fine-tuning framework that significantly improves generalization and adaptation of robot policies across unseen environments and new tasks, leveraging large-scale pretraining and stabilization techniques.
Contribution
The paper introduces FLaRe, a novel RL fine-tuning approach that enhances pre-trained robot policies for better generalization and rapid adaptation to new tasks and embodiments.
Findings
Achieves 79.5% success rate on unseen environments
Outperforms prior methods by +23.6% in simulation
Enables rapid adaptation within less than a day of fine-tuning
Abstract
In recent years, the Robotics field has initiated several efforts toward building generalist robot policies through large-scale multi-task Behavior Cloning. However, direct deployments of these policies have led to unsatisfactory performance, where the policy struggles with unseen states and tasks. How can we break through the performance plateau of these models and elevate their capabilities to new heights? In this paper, we propose FLaRe, a large-scale Reinforcement Learning fine-tuning framework that integrates robust pre-trained representations, large-scale training, and gradient stabilization techniques. Our method aligns pre-trained policies towards task completion, achieving state-of-the-art (SoTA) performance both on previously demonstrated and on entirely novel tasks and embodiments. Specifically, on a set of long-horizon mobile manipulation tasks, FLaRe achieves an average…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics
MethodsSparse Evolutionary Training
