FastFlow: Accelerating The Generative Flow Matching Models with Bandit Inference

Divya Jyoti Bajpai; Dhruv Bhardwaj; Soumya Roy; Tejas Duseja; Harsh Agarwal; Aashay Sandansing; Manjesh Kumar Hanawal

arXiv:2602.11105·cs.CV·February 12, 2026

FastFlow: Accelerating The Generative Flow Matching Models with Bandit Inference

Divya Jyoti Bajpai, Dhruv Bhardwaj, Soumya Roy, Tejas Duseja, Harsh Agarwal, Aashay Sandansing, Manjesh Kumar Hanawal

PDF

Open Access 3 Reviews

TL;DR

FastFlow is an adaptive inference framework that accelerates flow matching models by intelligently skipping denoising steps using bandit algorithms, achieving over 2.6x speedup without sacrificing quality.

Contribution

It introduces a plug-and-play method that approximates minor denoising steps and uses bandit learning to determine optimal skipping, generalizing across tasks.

Findings

01

Achieves over 2.6x speedup in generation tasks.

02

Maintains high-quality outputs comparable to full models.

03

Seamlessly integrates with existing flow matching pipelines.

Abstract

Flow-matching models deliver state-of-the-art fidelity in image and video generation, but the inherent sequential denoising process renders them slower. Existing acceleration methods like distillation, trajectory truncation, and consistency approaches are static, require retraining, and often fail to generalize across tasks. We propose FastFlow, a plug-and-play adaptive inference framework that accelerates generation in flow matching models. FastFlow identifies denoising steps that produce only minor adjustments to the denoising path and approximates them without using the full neural network models used for velocity predictions. The approximation utilizes finite-difference velocity estimates from prior predictions to efficiently extrapolate future states, enabling faster advancements along the denoising path at zero compute cost. This enables skipping computation at intermediary steps.…

Peer Reviews

Decision·ICLR 2026 Poster

Reviewer 01Rating 4Confidence 4

Strengths

I think the most notable point is that the method is training-free and easy to integrate into existing flow matching pipelines. The speedup figures are plausible given the cost model of flow samplers where every velocity evaluation dominates wall time. I also like recasting the step-size selection as a bandit objective, whichh directly encodes the speed-accuracy tradeoff

Weaknesses

- I think the novelty is thinner than the paper suggests. The velocity extrapolator collapses to a two step Adams Bashforth style predictor in uniform time-step. The work should at least acknowledge this equivalence and position itself relative to, for example, PNDM [1], and other linear multistep sampling strategies already common in diffusion code bases. Empirically, a direct comparison to a simple two step predictor that still evaluates the model at checkpoints would be informative. Moreover

Reviewer 02Rating 6Confidence 3

Strengths

- the motivation to accelerate flow-matching models is reasonable. - the proposed method is plug-and-play and introduce negalectable extra costs

Weaknesses

- missing comparisons on ImageNet 256

Reviewer 03Rating 6Confidence 3

Strengths

Unlike static acceleration schemes, FastFlow adapts to each sample’s complexity. The multi-armed bandit dynamically decides per timestep how many steps to skip, meaning simpler cases automatically run faster while complex cases get more compute. This adaptive inference is novel and ensures no one-size-fits-all schedule, leading to greater robustness across diverse inputs. The proposed framework is model-agnostic and plug-and-play, so it can be applied to existing pretrained flow-matching models

Weaknesses

It requires some exploration to learn the optimal skipping policy. You acknowledge that the speedup may not fully materialize in the very first steps or first few samples due to this exploration phase. In practice, you mitigate this by seeding the bandit with one full generation, but if a user only generates a handful of samples, the adaptive policy might not have time to reach peak efficiency. In scenarios with very few inference runs, the benefit of FastFlow could be less pronounced. The effe

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Human Pose and Action Recognition · Image Enhancement Techniques