Heat Death of Generative Models in Closed-Loop Learning
Matteo Marchi, Stefano Soatto, Pratik Chaudhari, Paulo Tabuada

TL;DR
This paper investigates the stability of generative models in closed-loop learning, revealing that without enough external data, models tend to degenerate into trivial or uniform distributions, especially at non-zero temperatures.
Contribution
The paper provides a theoretical analysis of the dynamics of generative models in closed-loop settings using dynamical systems, highlighting conditions leading to collapse or uniformity.
Findings
Models degenerate without sufficient external data
Non-zero temperature causes distribution collapse or uniformity
Dynamical systems analysis explains stability conditions
Abstract
Improvement and adoption of generative machine learning models is rapidly accelerating, as exemplified by the popularity of LLMs (Large Language Models) for text, and diffusion models for image generation. As generative models become widespread, data they generate is incorporated into shared content through the public web. This opens the question of what happens when data generated by a model is fed back to the model in subsequent training campaigns. This is a question about the stability of the training process, whether the distribution of publicly accessible content, which we refer to as "knowledge", remains stable or collapses. Small scale empirical experiments reported in the literature show that this closed-loop training process is prone to degenerating. Models may start producing gibberish data, or sample from only a small subset of the desired data distribution (a phenomenon…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Neural Networks and Applications · Reinforcement Learning in Robotics
MethodsSparse Evolutionary Training · Diffusion
