Steering Your Generalists: Improving Robotic Foundation Models via Value Guidance
Mitsuhiko Nakamoto, Oier Mees, Aviral Kumar, Sergey Levine

TL;DR
This paper introduces Value-Guided Policy Steering (V-GPS), a method that improves robotic policies by re-ranking actions with a learned value function, enhancing performance across various policies and robotic platforms without retraining.
Contribution
The paper presents V-GPS, a universal, non-intrusive approach to enhance existing robotic policies through offline RL-based action re-ranking, applicable across different architectures and datasets.
Findings
V-GPS improves performance of five different policies.
Consistent gains across 12 robotic tasks.
Applicable without fine-tuning or policy access.
Abstract
Large, general-purpose robotic policies trained on diverse demonstration datasets have been shown to be remarkably effective both for controlling a variety of robots in a range of different scenes, and for acquiring broad repertoires of manipulation skills. However, the data that such policies are trained on is generally of mixed quality -- not only are human-collected demonstrations unlikely to perform the task perfectly, but the larger the dataset is, the harder it is to curate only the highest quality examples. It also remains unclear how optimal data from one embodiment is for training on another embodiment. In this paper, we present a general and broadly applicable approach that enhances the performance of such generalist robot policies at deployment time by re-ranking their actions according to a value function learned via offline RL. This approach, which we call Value-Guided…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Simulation Techniques and Applications · Model-Driven Software Engineering Techniques
