The common patterns of abundance: the log series and Zipf's law
Steven A. Frank

TL;DR
This paper presents a unified theoretical framework explaining why Zipf's law and the log series are common abundance patterns across various domains, based on invariances and conserved averages.
Contribution
It introduces a general theory that unifies Zipf's law and the log series as endpoints of a broader probability distribution framework.
Findings
Zipf's law arises in language, city sizes, and corporate sizes.
The log series pattern appears in biological species abundances.
The theory encompasses intermediate patterns between Zipf's law and the log series.
Abstract
In a language corpus, the probability that a word occurs times is often proportional to . Assigning rank, , to words according to their abundance, vs typically has a slope of minus one. That simple Zipf's law pattern also arises in the population sizes of cities, the sizes of corporations, and other patterns of abundance. By contrast, for the abundances of different biological species, the probability of a population of size is typically proportional to , declining exponentially for larger , the log series pattern. This article shows that the differing patterns of Zipf's law and the log series arise as the opposing endpoints of a more general theory. The general theory follows from the generic form of all probability patterns as a consequence of conserved average values and the associated invariances of scale. To understand the common patterns…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
