FASTER: Value-Guided Sampling for Fast RL

Perry Dong; Alexander Swerdlow; Dorsa Sadigh; Chelsea Finn

arXiv:2604.19730·cs.LG·April 22, 2026

FASTER: Value-Guided Sampling for Fast RL

Perry Dong, Alexander Swerdlow, Dorsa Sadigh, Chelsea Finn

PDF

1 Repo

TL;DR

FASTER is a lightweight, value-guided sampling method for reinforcement learning that improves policy performance and reduces computational costs by filtering action candidates during the denoising process.

Contribution

It introduces a novel MDP-based approach to filter action samples in diffusion policies, enhancing efficiency without sacrificing performance.

Findings

01

FASTER improves performance on long-horizon manipulation tasks.

02

It reduces training and inference computational costs.

03

Achieves state-of-the-art results among compared methods.

Abstract

Some of the most performant reinforcement learning algorithms today can be prohibitively expensive as they use test-time scaling methods such as sampling multiple action candidates and selecting the best one. In this work, we propose FASTER, a method for getting the benefits of sampling-based test-time scaling of diffusion-based policies without the computational cost by tracing the performance gain of action samples back to earlier in the denoising process. Our key insight is that we can model the denoising of multiple action candidates and selecting the best one as a Markov Decision Process (MDP) where the goal is to progressively filter action candidates before denoising is complete. With this MDP, we can learn a policy and value function in the denoising space that predicts the downstream value of action candidates in the denoising process and filters them while maximizing returns.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

alexanderswerdlow/faster
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.