Empirical Tests of Zipf's law Mechanism In Open Source Linux Distribution
T. Maillart, D. Sornette, S. Spaeth, G. Von Krogh

TL;DR
This paper empirically tests the mechanisms behind Zipf's law in Linux distributions, confirming proportional growth, specific variance scaling, and tail behavior of package links over decades.
Contribution
It provides three empirical tests validating key assumptions of stochastic growth models explaining Zipf's law in open source Linux software.
Findings
Growth of package links follows Gibrat's law.
Variance of growth scales with the square root of time.
Tail of new package link distribution converges to Zipf's law exponent 1.
Abstract
The evolution of open source software projects in Linux distributions offers a remarkable example of a growing complex self-organizing adaptive system, exhibiting Zipf's law over four full decades. We present three tests of the usually assumed ingredients of stochastic growth models that have been previously conjectured to be at the origin of Zipf's law: (i) the growth observed between successive releases of the number of in-directed links of packages obeys Gibrat's law of proportional growth; (ii) the average growth increment of the number of in-directed links of packages over a time interval is proportional to , while its standard deviation is proportional to ; (iii) the distribution of the number of in-directed links of new packages appearing in evolving versions of Debian Linux distributions has a tail thinner than Zipf's law, with an exponent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
