A Limit Theory of Foundation Models: A Mathematical Approach to Understanding Emergent Intelligence and Scaling Laws
Jun Shu, Junxiong Jia, Deyu Meng, Zongben Xu

TL;DR
This paper develops a mathematical limit theory framework to understand emergent intelligence in foundation models, linking it to model architecture, training scale, and data size.
Contribution
It introduces a formal limit theory for emergent intelligence, connecting it to the properties of a limit architecture and providing conditions for emergence.
Findings
Emergent intelligence depends on training steps, data size, and architecture.
The critical Lipschitz condition Lip(T)=1 supports existing theories.
Emergent intelligence can be realized in finite architectures despite infinite-dimensional theory.
Abstract
Emergent intelligence have played a major role in the modern AI development. While existing studies primarily rely on empirical observations to characterize this phenomenon, a rigorous theoretical framework remains underexplored. This study attempts to develop a mathematical approach to formalize emergent intelligence from the perspective of limit theory. Specifically, we introduce a performance function E(N, P, K), dependent on data size N, model size P and training steps K, to quantify intelligence behavior. We posit that intelligence emerges as a transition from finite to effectively infinite knowledge, and thus recast emergent intelligence as existence of the limit , with emergent abilities corresponding to the limiting behavior. This limit theory helps reveal that emergent intelligence originates from the existence of a parameter-limit…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
