Latent Abstractions in Generative Diffusion Models

Giulio Franzese; Mattia Martini; Giulio Corallo; Paolo Papotti; Pietro; Michiardi

arXiv:2410.03368·cs.LG·October 7, 2024

Latent Abstractions in Generative Diffusion Models

Giulio Franzese, Mattia Martini, Giulio Corallo, Paolo Papotti, Pietro, Michiardi

PDF

Open Access 3 Reviews

TL;DR

This paper introduces a new theoretical framework for understanding how diffusion models generate high-dimensional data by leveraging low-dimensional latent abstractions, supported by empirical validation.

Contribution

It extends existing theories with a novel formulation of joint dynamics and an information-theoretic measure, providing a new perspective on SDE-based generative models.

Findings

01

Diffusion models can be viewed as non-linear filters influenced by latent abstractions.

02

Latent abstractions emerge at different stages of the generative process.

03

The theory is empirically validated with experimental results.

Abstract

In this work we study how diffusion-based generative models produce high-dimensional data, such as an image, by implicitly relying on a manifestation of a low-dimensional set of latent abstractions, that guide the generative process. We present a novel theoretical framework that extends NLF, and that offers a unique perspective on SDE-based generative models. The development of our theory relies on a novel formulation of the joint (state and measurement) dynamics, and an information-theoretic measure of the influence of the system state on the measurement process. According to our theory, diffusion models can be cast as a system of SDE, describing a non-linear filter in which the evolution of unobservable latent abstractions steers the dynamics of an observable measurement process (corresponding to the generative pathways). In addition, we present an empirical study to validate our…

Peer Reviews

Decision·Submitted to ICLR 2025

Reviewer 01Rating 3Confidence 2

Strengths

They rigorously define the connection between nonlinear filtering and diffusion.

Weaknesses

The paper asks the reader to spend hours understanding their notations and results before describing the motivation in section 4. The paper is very challenging to read as is and I strongly recommend the authors clearly and succinctly describe their contributions before the technical details, which should also mostly be moved to the appendix. While a connection between two mathematical frameworks is challenging to derive, it is only interesting if it leads to a deeper understanding of either te

Reviewer 02Rating 6Confidence 2

Strengths

* Well-motivated introduction, situating the work within existing literature and highlighting its contributions. * The structure is logically organized, and the mathematical formulations are rigorous, enhancing the soundness of the theory. * The proposed information-theoretic measure effectively captures changes in latent abstractions during generation, with experiments showing jumps in the measure that align with theoretical expectations.

Weaknesses

I have no significant weaknesses to report. Please see questions for minor points of clarification.

Reviewer 03Rating 6Confidence 3

Strengths

The NLF viewpoint on diffusion model is to my knowledge new, and potentially interesting. The use of mutual information measurement is also helpful in quantifying the effect of noising/denoising on the latent variables (here labels). The experiments seem carefully carried out.

Weaknesses

The methodology is sound provided that the datasets can be understood as being dictated by a low-dimensional set of latent variables, and that it is possible to understand the generative process as inferring the latent variables in a Baysian way. The experiments are also conducted solely on such a synthetic dataset, with each sample having a predetermined set of labels. Either the including of a theoretical discussion on this underlying basic assumption, or devising experiments on more diverse d

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques

MethodsSparse Evolutionary Training · Diffusion