Universality of the $\pi^2/6$ Pathway in Avoiding Model Collapse
Apratim Dey, David Donoho

TL;DR
This paper investigates the universal behavior of the $rac{}{6}$ risk bound in avoiding model collapse across various statistical models, explaining why augment workflows prevent collapse while discard workflows do not.
Contribution
It demonstrates the universality of the $rac{}{6}$ risk bound in augment workflows across many models, extending previous specific results to a broad class of statistical models.
Findings
The $rac{}{6}$ risk bound applies broadly to many models in augment workflows.
Discard workflows tend to lead to model collapse, while augment workflows prevent it.
A flexible framework is provided to compare different workflows via Gaussian process simulations.
Abstract
Researchers in empirical machine learning recently spotlighted their fears of so-called Model Collapse. They imagined a discard workflow, where an initial generative model is trained with real data, after which the real data are discarded, and subsequently, the model generates synthetic data on which a new model is trained. They came to the conclusion that models degenerate as model-fitting generations proceed. However, other researchers considered an augment workflow, where the original real data continue to be used in each generation of training, augmented by synthetic data from models fit in all earlier generations. Empirical results on canonical datasets and learning procedures confirmed the occurrence of model collapse under the discard workflow and avoidance of model collapse under the augment workflow. Under the augment workflow, theoretical evidence also confirmed avoidance in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsParticle physics theoretical and experimental studies · Quantum chaos and dynamical systems · Computational Physics and Python Applications
MethodsLinear Regression
