Artificial Increasing Returns to Scale and the Problem of Sampling from Lognormals
Andres Gomez-Lievano, Vladislav Vysotsky, Jose Lobo

TL;DR
This paper demonstrates that apparent increasing returns to scale in urban data can be artificially generated by sampling variability in lognormal productivity distributions, and provides methods to distinguish real from spurious effects.
Contribution
It analytically explains how sampling from lognormal distributions can produce artificial scaling effects and offers a statistical test to identify genuine urban scaling phenomena.
Findings
Artificial scaling exponents emerge when sample sizes are small.
The model predicts when observed scaling is likely an artifact.
Empirical validation with Colombian wage data confirms the phenomenon.
Abstract
We show how increasing returns to scale in urban scaling can artificially emerge, systematically and predictably, without any sorting or positive externalities. We employ a model where individual productivities are independent and identically distributed lognormal random variables across all cities. We use extreme value theory to demonstrate analytically the paradoxical emergence of increasing returns to scale when the variance of log-productivity is larger than twice the log-size of the population size of the smallest city in a cross-sectional regression. Our contributions are to derive an analytical prediction for the artificial scaling exponent arising from this mechanism and to develop a simple statistical test to try to tell whether a given estimate is real or an artifact. Our analytical results are validated analyzing simulations and real microdata of wages across municipalities…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
