Rethinking Test Time Scaling for Flow-Matching Generative Models
Qingtao Yu, Changlin Song, Minghao Sun, Zhengyang Yu, Vinay Kumar Verma, Soumya Roy, Sumit Negi, Hongdong Li, Dylan Campbell

TL;DR
This paper introduces DOG-Trim, a new method for improving test-time scaling in flow-matching generative models by enhancing diversity and reward accuracy, leading to doubled performance gains over existing methods.
Contribution
The paper proposes Repel and NARF strategies to address diversity and reward bias issues, enabling more effective global search in flow-matching models during test time.
Findings
Achieves approximately twice the performance improvement compared to scaling-free baselines.
Demonstrates effectiveness of diversity and reward strategies in test-time scaling.
Outperforms existing methods under the same computational budget.
Abstract
The performance of text-to-image diffusion models may be improved at test-time by scaling computation to search for a generated image that maximizes a given reward function. While existing trajectory level exploration methods improve the effectiveness of test-time scaling for standard diffusion models, they are largely incompatible with modern flow matching models, which use deterministic sampling. This imposes significant computational overhead on local trajectory search, making the trade-offs less favorable compared to global search. However, global search strategies like trajectory pruning face two critical challenges: the sharp, low-diversity distributions characteristic of scaled flow models that restrict the candidate search space, and the bias of reward models in the early denoising process. To overcome these limitations, we propose Repel, a token-level mechanism that encourages…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Neuroimaging Techniques and Applications · Medical Image Segmentation Techniques
