Analyses of Baby Name Popularity Distribution in U.S. for the Last 131   Years

Wentian Li

arXiv:1205.1078·nlin.AO·January 23, 2013·Complex.

Analyses of Baby Name Popularity Distribution in U.S. for the Last 131 Years

Wentian Li

PDF

TL;DR

This study analyzes 131 years of U.S. baby name data, revealing that name popularity distribution fits a combined Beta and power-law model rather than traditional Zipf or Beta models alone.

Contribution

It introduces a novel empirical model combining Beta and power-law functions to accurately describe baby name popularity distribution.

Findings

01

Name popularity follows a piecewise distribution with Beta and power-law components.

02

Neither pure Zipf's law nor Beta distribution alone fit the data well.

03

The combined model provides a better fit for the entire dataset.

Abstract

We examine the complete dataset of baby name popularity collected by U.S. Social Security Administration for the last 131 years (1880-2010). The ranked baby name popularity can be fitted empirically by a piecewise function consisting of Beta function for the high-ranking names and power-law function for low-ranking names, but not power-law (Zipf's law) or Beta function by itself.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.