Zipf and Heaps laws from dependency structures in component systems
Andrea Mazzolini, Jacopo Grilli, Eleonora De Lazzari, Matteo Osella,, Marco Cosentino Lagomarsino, Marco Gherardi

TL;DR
This paper introduces a simple, analytically tractable model based on dependency structures that explains the emergence of Zipf and Heaps laws in complex systems with multiple component copies.
Contribution
It extends previous binary models to include component multiplicities and provides a mean-field analytical approach to derive key regularities.
Findings
Reproduces Zipf's law with a core set of components
Shows Heaps law with three regimes: linear, sub-linear, and saturation
Analytical results match numerical simulations
Abstract
Complex natural and technological systems can be considered, on a coarse-grained level, as assemblies of elementary components: for example, genomes as sets of genes, or texts as sets of words. On one hand, the joint occurrence of components emerges from architectural and specific constraints in such systems. On the other hand, general regularities may unify different systems, such as the broadly studied Zipf and Heaps laws, respectively concerning the distribution of component frequencies and their number as a function of system size. Dependency structures (i.e., directed networks encoding the dependency relations between the components in a system) were proposed recently as a possible organizing principles underlying some of the regularities observed. However, the consequences of this assumption were explored only in binary component systems, where solely the presence or absence of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
