Structural Foundations for Leading Digit Laws: Beyond Probabilistic Mixtures
Vladimir Berman

TL;DR
This paper introduces a deterministic, structural approach to understanding leading digit distributions, moving beyond probabilistic models to explain diverse digit patterns through arithmetic and algorithmic properties.
Contribution
It develops a shift-invariant functional equation framework that captures various digit distributions, including deviations from scale-invariance, providing a unified foundation for digital phenomena.
Findings
Explains diverse digit distributions via explicit affine-plus-periodic formulas.
Analyzes digit patterns in prime numbers and recurrence sequences.
Highlights fractal and block-structured features in datasets.
Abstract
This article presents a modern deterministic framework for the study of leading significant digit distributions in numerical data. Rather than relying on traditional probabilistic or mixture-based explanations, we demonstrate that the observed frequencies of leading digits are determined by the underlying arithmetic, algorithmic, and structural properties of the data-generating process. Our approach centers on a shift-invariant functional equation, whose general solution is given by explicit affine-plus-periodic formulas. This structural formulation explains the diversity of digit distributions encountered in both empirical and mathematical datasets, including cases with pronounced deviations from logarithmic or scale-invariant profiles. We systematically analyze digit distributions in finite and infinite datasets, address deterministic sequences such as prime numbers and recurrence…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
