Test-Time Compositional Generalization in Diffusion Models via Concept Discovery
Zekun Wang, Anant Gupta, Tianyi Zhu, Christopher J. MacLellan

TL;DR
This paper explores how pretrained diffusion models can discover and compose concepts at test time for out-of-distribution queries without predefined concept libraries, using gradient-based methods and a product-of-experts approach.
Contribution
It introduces a novel method for test-time concept discovery and composition in diffusion models, enabling out-of-distribution generation without prior concept annotations.
Findings
The proposed method outperforms baselines on ColorMNIST and CelebA composition benchmarks.
The time-indexed score geometry contains reusable density-mode concepts.
The approach enables test-time compositional generation without a predefined concept library.
Abstract
Compositional generalization requires models to produce novel configurations from familiar parts. In diffusion models, prior compositional generation methods typically assume that the relevant concepts or conditioning signals are already available. We instead ask whether a pretrained diffusion model can discover query-specific concepts from the time-indexed scores it learns for the noisy marginals and compose them at test time. Given a single out-of-distribution query, our method performs gradient ascent on at multiple noising timesteps to recover local density modes, maps these modes into clean-space Gaussians, greedily selects relevant prototypes with a submodular likelihood objective, and combines them into a product-of-experts (PoE) teacher model with an analytic score. This teacher model can be sampled directly through…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
