Flow Matching in Latent Space

Quan Dao; Hao Phung; Binh Nguyen; Anh Tran

arXiv:2307.08698·cs.CV·July 18, 2023·6 cites

Flow Matching in Latent Space

Quan Dao, Hao Phung, Binh Nguyen, Anh Tran

PDF

Open Access 1 Repo

TL;DR

This paper introduces a latent space flow matching framework for generative modeling that improves efficiency, scalability, and conditional generation capabilities, demonstrating strong results on multiple datasets.

Contribution

It pioneers the application of flow matching in latent spaces of autoencoders, enhancing computational efficiency and enabling conditional generation tasks.

Findings

01

Effective high-resolution image synthesis with reduced computational cost

02

Successful integration of various conditions into flow matching

03

Theoretical bounds on Wasserstein-2 distance between distributions

Abstract

Flow matching is a recent framework to train generative models that exhibits impressive empirical performance while being relatively easier to train compared with diffusion-based models. Despite its advantageous properties, prior methods still face the challenges of expensive computing and a large number of function evaluations of off-the-shelf solvers in the pixel space. Furthermore, although latent-based generative methods have shown great success in recent years, this particular model type remains underexplored in this area. In this work, we propose to apply flow matching in the latent spaces of pretrained autoencoders, which offers improved computational efficiency and scalability for high-resolution image synthesis. This enables flow-matching training on constrained computational resources while maintaining their quality and flexibility. Additionally, our work stands as a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vinairesearch/lfm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Neural Network Applications