How Compositional Generalization and Creativity Improve as Diffusion Models are Trained
Alessandro Favero, Antonio Sclocchi, Francesco Cagnetta, Pascal Frossard, Matthieu Wyart

TL;DR
This paper explores how diffusion models learn hierarchical compositional rules, showing that their ability to generate coherent data improves with more training data and time, supported by theoretical and empirical analysis.
Contribution
It introduces a theoretical framework linking diffusion models' learning process to hierarchical clustering of features, revealing how sample complexity scales with feature context size.
Findings
Diffusion models learn composition rules with sample complexity similar to word2vec.
Higher-level features require more data to identify, leading to hierarchical learning.
Generated data coherence increases with training time and dataset size.
Abstract
Natural data is often organized as a hierarchical composition of features. How many samples do generative models need in order to learn the composition rules, so as to produce a combinatorially large number of novel data? What signal in the data is exploited to learn those rules? We investigate these questions in the context of diffusion models both theoretically and empirically. Theoretically, we consider a simple probabilistic context-free grammar - a tree-like graphical model used to represent the hierarchical and compositional structure of data such as language and images. We demonstrate that diffusion models learn the grammar's composition rules with the sample complexity required for clustering features with statistically similar context, a process similar to the word2vec algorithm. However, this clustering emerges hierarchically: higher-level features associated with longer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCreativity in Education and Neuroscience
MethodsDiffusion
