Steering Your Generalists: Improving Robotic Foundation Models via Value   Guidance

Mitsuhiko Nakamoto; Oier Mees; Aviral Kumar; Sergey Levine

arXiv:2410.13816·cs.RO·February 26, 2025

Steering Your Generalists: Improving Robotic Foundation Models via Value Guidance

Mitsuhiko Nakamoto, Oier Mees, Aviral Kumar, Sergey Levine

PDF

Open Access 1 Repo

TL;DR

This paper introduces Value-Guided Policy Steering (V-GPS), a method that improves robotic policies by re-ranking actions with a learned value function, enhancing performance across various policies and robotic platforms without retraining.

Contribution

The paper presents V-GPS, a universal, non-intrusive approach to enhance existing robotic policies through offline RL-based action re-ranking, applicable across different architectures and datasets.

Findings

01

V-GPS improves performance of five different policies.

02

Consistent gains across 12 robotic tasks.

03

Applicable without fine-tuning or policy access.

Abstract

Large, general-purpose robotic policies trained on diverse demonstration datasets have been shown to be remarkably effective both for controlling a variety of robots in a range of different scenes, and for acquiring broad repertoires of manipulation skills. However, the data that such policies are trained on is generally of mixed quality -- not only are human-collected demonstrations unlikely to perform the task perfectly, but the larger the dataset is, the harder it is to curate only the highest quality examples. It also remains unclear how optimal data from one embodiment is for training on another embodiment. In this paper, we present a general and broadly applicable approach that enhances the performance of such generalist robot policies at deployment time by re-ranking their actions according to a value function learned via offline RL. This approach, which we call Value-Guided…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nakamotoo/v-gps
jax

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSemantic Web and Ontologies · Simulation Techniques and Applications · Model-Driven Software Engineering Techniques