A non-ergodic framework for understanding emergent capabilities in Large Language Models
Javier Mar\'in

TL;DR
This paper introduces a non-ergodic theoretical framework based on the adjacent possible to explain how large language models develop emergent capabilities through phase transitions influenced by architectural and training constraints.
Contribution
It provides the first mathematical model linking constraints to capability emergence in language models, supported by experimental validation across multiple models.
Findings
Capabilities emerge through discrete phase transitions.
Interactions of constraints shape the emergence process.
Experimental evidence supports the non-ergodic, phase transition model.
Abstract
Large language models have emergent capabilities that come unexpectedly at scale, but we need a theoretical framework to explain why and how they emerge. We prove that language models are actually non-ergodic systems while providing a mathematical framework based on Stuart Kauffman's theory of the adjacent possible (TAP) to explain capability emergence. Our resource-constrained TAP equation demonstrates how architectural, training, and contextual constraints interact to shape model capabilities through phase transitions in semantic space. We prove through experiments with three different language models that capacities emerge through discrete transitions guided by constraint interactions and path-dependent exploration. This framework provides a theoretical basis for understanding emergence in language models and guides the development of architectures that can guide capability emergence.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Expert finding and Q&A systems · Language and cultural evolution
