A non-ergodic framework for understanding emergent capabilities in Large   Language Models

Javier Mar\'in

arXiv:2501.01638·cs.CL·March 3, 2025

A non-ergodic framework for understanding emergent capabilities in Large Language Models

Javier Mar\'in

PDF

Open Access

TL;DR

This paper introduces a non-ergodic theoretical framework based on the adjacent possible to explain how large language models develop emergent capabilities through phase transitions influenced by architectural and training constraints.

Contribution

It provides the first mathematical model linking constraints to capability emergence in language models, supported by experimental validation across multiple models.

Findings

01

Capabilities emerge through discrete phase transitions.

02

Interactions of constraints shape the emergence process.

03

Experimental evidence supports the non-ergodic, phase transition model.

Abstract

Large language models have emergent capabilities that come unexpectedly at scale, but we need a theoretical framework to explain why and how they emerge. We prove that language models are actually non-ergodic systems while providing a mathematical framework based on Stuart Kauffman's theory of the adjacent possible (TAP) to explain capability emergence. Our resource-constrained TAP equation demonstrates how architectural, training, and contextual constraints interact to shape model capabilities through phase transitions in semantic space. We prove through experiments with three different language models that capacities emerge through discrete transitions guided by constraint interactions and path-dependent exploration. This framework provides a theoretical basis for understanding emergence in language models and guides the development of architectures that can guide capability emergence.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Expert finding and Q&A systems · Language and cultural evolution