Flow Matching in Latent Space
Quan Dao, Hao Phung, Binh Nguyen, Anh Tran

TL;DR
This paper introduces a latent space flow matching framework for generative modeling that improves efficiency, scalability, and conditional generation capabilities, demonstrating strong results on multiple datasets.
Contribution
It pioneers the application of flow matching in latent spaces of autoencoders, enhancing computational efficiency and enabling conditional generation tasks.
Findings
Effective high-resolution image synthesis with reduced computational cost
Successful integration of various conditions into flow matching
Theoretical bounds on Wasserstein-2 distance between distributions
Abstract
Flow matching is a recent framework to train generative models that exhibits impressive empirical performance while being relatively easier to train compared with diffusion-based models. Despite its advantageous properties, prior methods still face the challenges of expensive computing and a large number of function evaluations of off-the-shelf solvers in the pixel space. Furthermore, although latent-based generative methods have shown great success in recent years, this particular model type remains underexplored in this area. In this work, we propose to apply flow matching in the latent spaces of pretrained autoencoders, which offers improved computational efficiency and scalability for high-resolution image synthesis. This enables flow-matching training on constrained computational resources while maintaining their quality and flexibility. Additionally, our work stands as a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications
