A Limit Theory of Foundation Models: A Mathematical Approach to Understanding Emergent Intelligence and Scaling Laws

Jun Shu; Junxiong Jia; Deyu Meng; Zongben Xu

arXiv:2604.24037·cs.LG·May 13, 2026

A Limit Theory of Foundation Models: A Mathematical Approach to Understanding Emergent Intelligence and Scaling Laws

Jun Shu, Junxiong Jia, Deyu Meng, Zongben Xu

PDF

TL;DR

This paper develops a mathematical limit theory framework to understand emergent intelligence in foundation models, linking it to model architecture, training scale, and data size.

Contribution

It introduces a formal limit theory for emergent intelligence, connecting it to the properties of a limit architecture and providing conditions for emergence.

Findings

01

Emergent intelligence depends on training steps, data size, and architecture.

02

The critical Lipschitz condition Lip(T)=1 supports existing theories.

03

Emergent intelligence can be realized in finite architectures despite infinite-dimensional theory.

Abstract

Emergent intelligence have played a major role in the modern AI development. While existing studies primarily rely on empirical observations to characterize this phenomenon, a rigorous theoretical framework remains underexplored. This study attempts to develop a mathematical approach to formalize emergent intelligence from the perspective of limit theory. Specifically, we introduce a performance function E(N, P, K), dependent on data size N, model size P and training steps K, to quantify intelligence behavior. We posit that intelligence emerges as a transition from finite to effectively infinite knowledge, and thus recast emergent intelligence as existence of the limit $lim_{N, P, K \to \infty} E (N, P, K)$ , with emergent abilities corresponding to the limiting behavior. This limit theory helps reveal that emergent intelligence originates from the existence of a parameter-limit…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.