Flow-Direct: Feedback-Efficient and Reusable Guidance for Flow Models via Non-Parametric Guidance Field
Kim Yong Tan, Yueming Lyu, Ivor Tsang, Yew-Soon Ong

TL;DR
Flow-Direct introduces a persistent, non-parametric guidance field for flow models, significantly improving feedback efficiency and reusability in optimizing application-specific objectives using external reward functions.
Contribution
It provides a theoretically grounded, practical framework that leverages all collected samples to refine guidance, enabling reusable and multi-objective sample generation.
Findings
Guidance field derived from log-density ratio guides distribution transformation.
Empirical guidance field improves with more reward-evaluated samples.
Framework enables reusable guidance for generating samples satisfying multiple objectives.
Abstract
Training-free guidance enables pre-trained diffusion and flow models to optimize application-specific objectives using feedback from external black-box reward functions. However, existing methods are feedback-inefficient because reward feedback is used only transiently to inform a localized gradient approximation or a discrete search decision, and is subsequently discarded. To address this limitation, we propose Flow-Direct, a framework that guides the generation process via a persistent guidance field. Theoretically, this guidance field is analytically derived from the log-density ratio between the base and reward-weighted target distributions; it transports the pre-trained distribution to the target distribution. In practice, the field is implemented as a non-parametric estimator constructed from all accumulated reward-evaluated samples. As more samples are collected during…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
