Generator Matching: Generative modeling with arbitrary Markov processes

Peter Holderrieth; Marton Havasi; Jason Yim; Neta Shaul; Itai Gat,; Tommi Jaakkola; Brian Karrer; Ricky T. Q. Chen; Yaron Lipman

arXiv:2410.20587·cs.LG·February 28, 2025

Generator Matching: Generative modeling with arbitrary Markov processes

Peter Holderrieth, Marton Havasi, Jason Yim, Neta Shaul, Itai Gat,, Tommi Jaakkola, Brian Karrer, Ricky T. Q. Chen, Yaron Lipman

PDF

Open Access 3 Reviews

TL;DR

Generator Matching is a versatile framework for generative modeling using arbitrary Markov processes, unifying existing methods and enabling new process designs for improved multimodal data generation.

Contribution

It introduces a unified framework that leverages generator functions for arbitrary Markov processes, expanding the design space and enabling multimodal generative models.

Findings

01

Unifies diffusion, flow matching, and discrete diffusion models.

02

Enables construction of multimodal and superposed Markov generative models.

03

Empirically improves image and multimodal generation performance.

Abstract

We introduce Generator Matching, a modality-agnostic framework for generative modeling using arbitrary Markov processes. Generators characterize the infinitesimal evolution of a Markov process, which we leverage for generative modeling in a similar vein to flow matching: we construct conditional generators which generate single data points, then learn to approximate the marginal generator which generates the full data distribution. We show that Generator Matching unifies various generative modeling methods, including diffusion models, flow matching and discrete diffusion models. Furthermore, it expands the design space to new and unexplored Markov processes such as jump processes. Finally, Generator Matching enables the construction of superpositions of Markov generative models and enables the construction of multimodal models in a rigorous manner. We empirically validate our method on…

Peer Reviews

Decision·ICLR 2025 Oral

Reviewer 01Rating 8Confidence 3

Strengths

**Significance and originality.** The main result of the paper lies in developing a generalization of the existing generative models into one theoretical framework. For example, the universal characterization of generators (provided by Theorem 1) appears for the first time in machine learning literature. Moreover, the result is stated for time-inhomogeneous Markov processes while the vast majority of mathematical literature focuses on studying properties of time-homogeneous ones. The paper also

Weaknesses

The main weakness of the paper is that the authors ignore previous results that have been done for jump processes [1, 2]. For example, [1] introduces a theoretical framework for constructing Diffusion Models in discrete-state spaces for an arbitrary Markov process, which can be either discrete time or continuous-time in nature. [1] uses generators and adjoint generators together with Chapman–Kolmogorov equations. Moreover, [1] constructs a generative model using the pure-death process, which, in

Reviewer 02Rating 8Confidence 4

Strengths

As discussed above, The GM framework extends the FM approach, building on the well-established relationships between marginal distributions/scores/flows and their data-conditional counterparts. From my persepective, this framework presents several notable innovations: 1. The introduction of jump, a type of stochastic process in the continuous domain with non-continuous trajectories, which largely unexplored within the generative AI community. Particularly compelling is the result showing that t

Weaknesses

1. The writing style in some part of the paper, especially section 6, feels overly general and abstract. While the GM framework is indeed comprehensive, it primarily applies to only 4 specific examples: flow, diffusion, jump in $\mathbb{R}^d$, and CTMC in finite state spaces. Although the theoretical concepts like target-affine (TA) loss and Bregman divergences are elegant, they lack practical utility in these contexts. This abstraction, without sufficient illustrative examples, limits the reade

Reviewer 03Rating 8Confidence 2

Strengths

1. This paper has the ambitious goal of presenting a unifying framework for many (currently popular) generative modeling paradigms. Even unifying continuous and discrete models, and jump processes. This can be very relevant for the community, and the theory seems adequate and well supported. 2. Experiments are performed on diverse applications: proteins and images

Weaknesses

1. It is unclear to me, and difficult to gauge, in how far the GM framework has benefits over *simple, naive* combinations of different modeling paradigms. I.e. how will the GM framework help future researchers in practice to build better models, other than only providing a theoretical unificiation?

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsElectric Power System Optimization

MethodsDiffusion