Improving Progressive Generation with Decomposable Flow Matching

Moayed Haji-Ali; Willi Menapace; Ivan Skorokhodov; Arpit Sahni; Sergey Tulyakov; Vicente Ordonez; Aliaksandr Siarohin

arXiv:2506.19839·cs.CV·June 25, 2025

Improving Progressive Generation with Decomposable Flow Matching

Moayed Haji-Ali, Willi Menapace, Ivan Skorokhodov, Arpit Sahni, Sergey Tulyakov, Vicente Ordonez, Aliaksandr Siarohin

PDF

Open Access

TL;DR

This paper introduces Decomposable Flow Matching (DFM), a simple multi-scale framework for progressive visual media generation that improves quality and convergence speed without complex architectures.

Contribution

DFM applies Flow Matching independently at each scale, enabling effective progressive generation with minimal modifications and architectural simplicity.

Findings

01

DFM achieves 35.2% improvement in FDD scores on ImageNet-1k 512px.

02

DFM outperforms prior multistage frameworks in visual quality.

03

DFM accelerates convergence when finetuning large models like FLUX.

Abstract

Generating high-dimensional visual modalities is a computationally intensive task. A common solution is progressive generation, where the outputs are synthesized in a coarse-to-fine spectral autoregressive manner. While diffusion models benefit from the coarse-to-fine nature of denoising, explicit multi-stage architectures are rarely adopted. These architectures have increased the complexity of the overall approach, introducing the need for a custom diffusion formulation, decomposition-dependent stage transitions, add-hoc samplers, or a model cascade. Our contribution, Decomposable Flow Matching (DFM), is a simple and effective framework for the progressive generation of visual media. DFM applies Flow Matching independently at each level of a user-defined multi-scale representation (such as Laplacian pyramid). As shown by our experiments, our approach improves visual quality for both…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputer Graphics and Visualization Techniques