# Scaling laws for function diversity and specialization across socioeconomic and biological complex systems

**Authors:** Vicky Chuqiao Yang, James Holehouse, Hyejin Youn, José Ignacio Arroyo, Sidney Redner, Geoffrey B. West, Christopher P. Kempes

PMC · DOI: 10.1073/pnas.2509729123 · Proceedings of the National Academy of Sciences of the United States of America · 2026-02-12

## TL;DR

This paper presents a unified theory showing how diversity and specialization scale in complex systems like microbes, cities, and federal agencies, revealing universal patterns.

## Contribution

A general mathematical model and empirical framework that unify function diversity and specialization across diverse complex systems.

## Key findings

- Function diversity scales sublinearly with system size in most systems, following Heaps’ law.
- Cities show logarithmic scaling of function diversity, unlike other systems.
- A generalized Yule-Simon model with two parameters explains diversification and specialization across systems.

## Abstract

Diversification and specialization are central to complex adaptive systems, yet overarching principles across domains remain elusive. We introduce a general theory that unifies diversity and specialization across disparate systems, including microbes, federal agencies, companies, universities, and cities, characterized by two key parameters. We show from extensive data that function diversity scales with system size as a sublinear power law-resembling Heaps’ law-in all but cities, where it is logarithmic. Our theory explains both behaviors and suggests that function creation depends on system goals and structure: federal agencies tend to ensure functional coverage; cities slow new function growth as old ones expand, and cells occupy an intermediate position. Once functions are introduced, their growth follows a remarkably universal pattern across all systems.

Function diversity, the range of tasks individuals perform, and specialization, the distribution of function abundances, are fundamental to complex adaptive systems. In the absence of overarching principles, these properties have appeared domain-specific. Here, we introduce an empirical framework and a mathematical model for the diversification and specialization of functions across disparate systems, including bacteria, federal agencies, universities, corporations, and cities. We find that the number of functions grows sublinearly with system size, with exponents from 0.35 to 0.57, consistent with Heaps’ law. In contrast, cities exhibit logarithmic scaling. To explain these empirical findings, we generalize the Yule-Simon model by introducing two key parameters: a diversification parameter that characterizes how existing functions inhibit the creation of new ones and a specialization parameter that describes how a function’s attractiveness depends on its abundance. Our model enables cross-system comparisons, from microorganisms to metropolitan areas. The analysis suggests that what drives the creation of new functions depends on the system’s goals and structure: federal agencies tend to ensure comprehensive coverage of necessary functions; cities tend to slow the creation of new occupations as existing ones expand; and cells occupy an intermediate position. Once functions are introduced, their growth follows a remarkably universal pattern across all systems.

## Full-text entities

- **Genes:** UBXN11 (UBX domain protein 11) [NCBI Gene 91544] {aka COA-1, PP2243, SOC, SOCI, UBXD5}
- **Chemicals:** PNAS (MESH:D020135)
- **Species:** Bacteria Latreille et al. 1825 (Bacteria stick insect, genus) [taxon 629395], Homo sapiens (human, species) [taxon 9606], Bartonella henselae (species) [taxon 38323]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12912993/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12912993/full.md

## References

82 references — full list in the complete paper: https://tomesphere.com/paper/PMC12912993/full.md

---
Source: https://tomesphere.com/paper/PMC12912993