When Does Model Collapse Occur in Structured Interactive Learning?
Yuchen Wu, Kangjie Zhou, Weijie Su

TL;DR
This paper analyzes when model collapse occurs in structured interactive learning environments, revealing that the interaction graph topology critically influences model performance degradation.
Contribution
It formalizes model interactions using directed graphs, derives conditions for model collapse, and provides theoretical and empirical insights into the phenomenon.
Findings
Model collapse depends on the topology of the interaction graph.
Explicit necessary and sufficient conditions for collapse are established.
Finite-sample and asymptotic guarantees are provided for different estimators.
Abstract
The proliferation of generative artificial intelligence has given rise to an interactive learning environment, where model parameters are continuously updated using not only data generated by natural processes, but also synthetic outputs produced by other models. This paradigm introduces two major challenges: (1) training data are no longer drawn exclusively from the target population, undermining a core assumption of classical statistical learning, and (2) model training processes become inherently correlated, as models interact with one another through repeated exposure to each other's synthetic outputs in a potentially complex manner. Establishing reliable statistical inference in such structured interactive learning environments therefore remains an important open problem. In particular, there is growing concern about model collapse, a phenomenon in which the performance of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
