TL;DR
The paper introduces F2D2, a method that drastically reduces the neural function evaluations needed for both sampling and likelihood evaluation in flow-based models, enabling faster and more efficient generative modeling.
Contribution
F2D2 jointly distills sampling and likelihood evaluation in flow models, achieving two orders of magnitude speedup with minimal additional components.
Findings
F2D2 achieves accurate likelihood with few NFEs.
F2D2 maintains high sample quality with fewer evaluations.
A lightweight self-guidance method outperforms large models with fewer steps.
Abstract
Log-likelihood evaluation enables important capabilities in generative models, including model comparison, certain fine-tuning objectives, and many downstream applications. Yet paradoxically, some of today's best generative models -- diffusion and flow-based models -- still require hundreds to thousands of neural function evaluations (NFEs) to compute a single likelihood. While recent distillation methods have successfully accelerated sampling to just a few steps, they achieve this at the cost of likelihood tractability: existing approaches either abandon likelihood computation entirely or still require expensive integration over full trajectories. We present fast flow joint distillation (F2D2), a framework that simultaneously reduces the number of NFEs required for both sampling and likelihood evaluation by two orders of magnitude. Our key insight is that in continuous normalizing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
