TL;DR
MixFlow introduces a novel training strategy that combines mixed source distributions to reduce path curvature in rectified flows, leading to faster sampling and improved image generation quality.
Contribution
The paper proposes MixFlow, a new method that trains on mixtures of distributions to better align source and data, enhancing efficiency and quality in rectified flow models.
Findings
Improves generation quality by 12% in FID over standard rectified flow.
Reduces required sampling steps for high-quality generation.
Accelerates training convergence significantly.
Abstract
Diffusion models and their variations, such as rectified flows, generate diverse and high-quality images, but they are still hindered by slow iterative sampling caused by the highly curved generative paths they learn. An important cause of high curvature, as shown by previous work, is independence between the source distribution (standard Gaussian) and the data distribution. In this work, we tackle this limitation by two complementary contributions. First, we attempt to break away from the standard Gaussian assumption by introducing , a general formulation that conditions the source distribution on an arbitrary signal that aligns it better with the data distribution. Then, we present MixFlow, a simple but effective training strategy that reduces the generative path curvatures and considerably improves sampling efficiency. MixFlow trains a flow model on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
