On the discrepancy of powers of random variables

Nicolas Chenavier; Dominique Schneider

arXiv:1705.04626·math.PR·May 15, 2017

On the discrepancy of powers of random variables

Nicolas Chenavier, Dominique Schneider

PDF

TL;DR

This paper investigates how the distribution of the mantissas of powered independent random variables converges to Benford's law, providing bounds and conditions for almost sure convergence.

Contribution

It offers an upper bound on the deviation from Benford's law for powers of random variables and establishes almost sure convergence under polynomial growth of exponents.

Findings

01

Deviation converges to zero almost surely for polynomial growth of exponents.

02

Provides explicit upper bounds for the deviation from Benford's law.

03

Demonstrates convergence behavior of mantissa distributions of powered variables.

Abstract

Let $(d_{n})$ be a sequence of positive numbers and let $(X_{n})$ be a sequence of positive independent random variables. We provide an upper bound for the deviation between the distribution of the mantissaes of $(X_{n}^{d_{n}})$ and the Benford's law. If $d_{n}$ goes to infinity at a rate at most polynomial, this deviation converges a.s. to 0 as $N$ goes to infinity.

Figures1

Click any figure to enlarge with its caption.

Tables1

Table 1. Table 1: a simulation of the frequencies of the first significant digits of X 1 d , … , X N d superscript subscript 𝑋 1 𝑑 … superscript subscript 𝑋 𝑁 𝑑 X_{1}^{d},\ldots,X_{N}^{d} , where X n subscript 𝑋 𝑛 X_{n} has a uniform distribution on [ 1 , n ] 1 𝑛 [1,n] for each n ≥ 1 𝑛 1 n\geq 1 , with N = 1000 𝑁 1000 N=1000 and d = 2 𝑑 2 d=2 ( Scilab © superscript Scilab © {\text{Scilab}}^{\copyright} ).

First digit	$(X_{n}^{d})$	Benford’s law
1	0.308	0.306
2	0.204	0.184
3	0.096	0.116
4	0.116	0.106
5	0.084	0.082
6	0.068	0.055
7	0.060	0.050
8	0.028	0.053
9	0.036	0.048

Equations111

N \to \infty lim \frac{1}{N} n = 1 \sum N 1_{F (x_{n}) = k} = lo g_{10} (1 + \frac{1}{k}), k = 1, \dots, 9,

N \to \infty lim \frac{1}{N} n = 1 \sum N 1_{F (x_{n}) = k} = lo g_{10} (1 + \frac{1}{k}), k = 1, \dots, 9,

μ_{10} ([1, a)) = lo g_{10} a, (1 \leq a < 10),

μ_{10} ([1, a)) = lo g_{10} a, (1 \leq a < 10),

N \to \infty lim \frac{1}{N} n = 1 \sum N 1_{M_{10} (x_{n}) \in [1, a)} = μ_{10} ([1, a)) .

N \to \infty lim \frac{1}{N} n = 1 \sum N 1_{M_{10} (x_{n}) \in [1, a)} = μ_{10} ([1, a)) .

N \to \infty lim \frac{1}{N} n = 1 \sum N e^{2 iπ h l o g_{10} x_{n}} = 0.

N \to \infty lim \frac{1}{N} n = 1 \sum N e^{2 iπ h l o g_{10} x_{n}} = 0.

D_{N} (u) = 0 \leq a < b < 1 sup \frac{1}{N} n = 1 \sum N 1_{[a, b)} ({u_{n}}) - (b - a) .

D_{N} (u) = 0 \leq a < b < 1 sup \frac{1}{N} n = 1 \sum N 1_{[a, b)} ({u_{n}}) - (b - a) .

D \sim_{N} (x) = 1 \leq s < t < 10 sup \frac{1}{N} n = 1 \sum N 1_{[s, t)} (M_{10} (x_{n})) - μ_{10} ([s, t)) .

D \sim_{N} (x) = 1 \leq s < t < 10 sup \frac{1}{N} n = 1 \sum N 1_{[s, t)} (M_{10} (x_{n})) - μ_{10} ([s, t)) .

E [e^{2 iπ h l o g X_{n}}] \leq c_{1} h^{- γ} + c_{2} h^{δ} r_{n} .

E [e^{2 iπ h l o g X_{n}}] \leq c_{1} h^{- γ} + c_{2} h^{δ} r_{n} .

D \sim_{N} (X^{(d)} (ω)) \leq C_{0} (ω) \cdot (lo g N)^{2} \cdot N^{- \frac{1}{2}} + c_{0} (\frac{1}{N} n = 1 \sum N (d_{n})^{- γ} + (lo g N)^{\frac{1}{δ + 1}} \cdot N^{- \frac{m i n { β - δ θ , 1 }}{δ + 1}}),

D \sim_{N} (X^{(d)} (ω)) \leq C_{0} (ω) \cdot (lo g N)^{2} \cdot N^{- \frac{1}{2}} + c_{0} (\frac{1}{N} n = 1 \sum N (d_{n})^{- γ} + (lo g N)^{\frac{1}{δ + 1}} \cdot N^{- \frac{m i n { β - δ θ , 1 }}{δ + 1}}),

1 \leq s < t < 10 sup \frac{1}{N} n = 1 \sum N 1_{[s, t)} (M_{10} (X_{n}^{d_{n}} (ω))) - μ_{10} ([s, t)) \leq C (ω) \cdot \frac{1}{N} n = 1 \sum N d_{n}^{- γ} .

1 \leq s < t < 10 sup \frac{1}{N} n = 1 \sum N 1_{[s, t)} (M_{10} (X_{n}^{d_{n}} (ω))) - μ_{10} ([s, t)) \leq C (ω) \cdot \frac{1}{N} n = 1 \sum N d_{n}^{- γ} .

D \sim_{N} (X^{d} (ω)) \leq C_{0} (ω) \cdot (lo g N)^{2} \cdot N^{- \frac{1}{2}} + c_{0} (d^{- γ} + (lo g N)^{\frac{1}{δ + 1}} \cdot N^{- \frac{m i n { β , 1 }}{δ + 1}}),

D \sim_{N} (X^{d} (ω)) \leq C_{0} (ω) \cdot (lo g N)^{2} \cdot N^{- \frac{1}{2}} + c_{0} (d^{- γ} + (lo g N)^{\frac{1}{δ + 1}} \cdot N^{- \frac{m i n { β , 1 }}{δ + 1}}),

D \sim_{N} (x) \leq \frac{1}{H + 1} + h = 1 \sum H \frac{1}{h} \frac{1}{N} n = 1 \sum N e^{2 iπ h l o g_{10} x_{n}} .

D \sim_{N} (x) \leq \frac{1}{H + 1} + h = 1 \sum H \frac{1}{h} \frac{1}{N} n = 1 \sum N e^{2 iπ h l o g_{10} x_{n}} .

E N > K \geq 1 sup T \geq 1 sup exp ϵ \cdot \frac{max _{∣ t ∣ \leq T} \sum _{n = K + 1}^{N} a _{n} ( e ^{2 iπ t Y_{n}} - E [ e ^{2 iπ t Y_{n}} ] ) ^{2}}{lo g ( 1 + T ) lo g ( 1 + N ^{η} ) \sum _{n = K + 1}^{N} ∣ a _{n} ∣ ^{2}} \leq C .

E N > K \geq 1 sup T \geq 1 sup exp ϵ \cdot \frac{max _{∣ t ∣ \leq T} \sum _{n = K + 1}^{N} a _{n} ( e ^{2 iπ t Y_{n}} - E [ e ^{2 iπ t Y_{n}} ] ) ^{2}}{lo g ( 1 + T ) lo g ( 1 + N ^{η} ) \sum _{n = K + 1}^{N} ∣ a _{n} ∣ ^{2}} \leq C .

D \sim_{N} (X^{(d)}) \leq \frac{1}{H + 1} + h = 1 \sum H \frac{1}{h} \frac{1}{N} n = 1 \sum N e^{2 iπ h l o g_{10} X_{n}^{d_{n}}} .

D \sim_{N} (X^{(d)}) \leq \frac{1}{H + 1} + h = 1 \sum H \frac{1}{h} \frac{1}{N} n = 1 \sum N e^{2 iπ h l o g_{10} X_{n}^{d_{n}}} .

D \sim_{N} (X^{(d)}) \leq \frac{1}{H + 1} + h = 1 \sum H \frac{1}{h} \frac{1}{N} n = 1 \sum N E [e^{2 iπ h l o g_{10} X_{n}^{d_{n}}}] + h = 1 \sum H \frac{1}{h} \frac{1}{N} n = 1 \sum N (e^{2 iπ h l o g_{10} X_{n}^{d_{n}}} - E [e^{2 iπ h l o g_{10} X_{n}^{d_{n}}}]) .

D \sim_{N} (X^{(d)}) \leq \frac{1}{H + 1} + h = 1 \sum H \frac{1}{h} \frac{1}{N} n = 1 \sum N E [e^{2 iπ h l o g_{10} X_{n}^{d_{n}}}] + h = 1 \sum H \frac{1}{h} \frac{1}{N} n = 1 \sum N (e^{2 iπ h l o g_{10} X_{n}^{d_{n}}} - E [e^{2 iπ h l o g_{10} X_{n}^{d_{n}}}]) .

E N > 1 sup T \geq 1 sup ∣ t ∣ \leq T max \frac{\sum _{n = 2}^{N} ( e ^{2 iπ t l o g_{10} X_{n}^{d_{n}}} - E [ e ^{2 iπ t l o g_{10} X_{n}^{d_{n}}} ] ) ^{2}}{lo g ( 1 + T ) lo g ( 1 + N ^{η} ) ( N - 1 )} \leq C .

E N > 1 sup T \geq 1 sup ∣ t ∣ \leq T max \frac{\sum _{n = 2}^{N} ( e ^{2 iπ t l o g_{10} X_{n}^{d_{n}}} - E [ e ^{2 iπ t l o g_{10} X_{n}^{d_{n}}} ] ) ^{2}}{lo g ( 1 + T ) lo g ( 1 + N ^{η} ) ( N - 1 )} \leq C .

\frac{1}{N} n = 1 \sum N (e^{2 iπ t l o g_{10} X_{n}^{d_{n}}} - E [e^{2 iπ t l o g_{10} X_{n}^{d_{n}}}]) \leq c (ω) \cdot lo g (1 + T) \cdot \frac{lo g ( 1 + N ^{η} )}{N} .

\frac{1}{N} n = 1 \sum N (e^{2 iπ t l o g_{10} X_{n}^{d_{n}}} - E [e^{2 iπ t l o g_{10} X_{n}^{d_{n}}}]) \leq c (ω) \cdot lo g (1 + T) \cdot \frac{lo g ( 1 + N ^{η} )}{N} .

h = 1 \sum H \frac{1}{h} \frac{1}{N} n = 1 \sum N (e^{2 iπ h l o g_{10} X_{n}^{d_{n}}} - E [e^{2 iπ h l o g_{10} X_{n}^{d_{n}}}]) \leq c (ω) h = 1 \sum H \frac{1}{h} lo g (1 + H) \cdot \frac{lo g ( 1 + N ^{η} )}{N} \leq c^{'} (ω) lo g H lo g (1 + H) \cdot \frac{lo g ( 1 + N ^{η} )}{N} .

h = 1 \sum H \frac{1}{h} \frac{1}{N} n = 1 \sum N (e^{2 iπ h l o g_{10} X_{n}^{d_{n}}} - E [e^{2 iπ h l o g_{10} X_{n}^{d_{n}}}]) \leq c (ω) h = 1 \sum H \frac{1}{h} lo g (1 + H) \cdot \frac{lo g ( 1 + N ^{η} )}{N} \leq c^{'} (ω) lo g H lo g (1 + H) \cdot \frac{lo g ( 1 + N ^{η} )}{N} .

\frac{1}{N} n = 1 \sum N E [e^{2 iπ h l o g_{10} X_{n}^{d_{n}}}] \leq \frac{1}{N} n = 1 \sum N_{0} E [e^{2 iπ h d_{n} l o g_{10} X_{n}}] + \frac{1}{N} n = N_{0} + 1 \sum N E [e^{2 iπ h d_{n} l o g_{10} X_{n}}] .

\frac{1}{N} n = 1 \sum N E [e^{2 iπ h l o g_{10} X_{n}^{d_{n}}}] \leq \frac{1}{N} n = 1 \sum N_{0} E [e^{2 iπ h d_{n} l o g_{10} X_{n}}] + \frac{1}{N} n = N_{0} + 1 \sum N E [e^{2 iπ h d_{n} l o g_{10} X_{n}}] .

\frac{1}{N} n = 1 \sum N E [e^{2 iπ h l o g_{10} X_{n}^{d_{n}}}] \leq \frac{N _{0}}{N} + c_{1} \cdot \frac{1}{N} n = 1 \sum N (\frac{h d _{n}}{lo g ( 10 )})^{- γ} + c_{2} \cdot \frac{1}{N} n = 1 \sum N (\frac{h d _{n}}{lo g ( 10 )})^{δ} r_{n} .

\frac{1}{N} n = 1 \sum N E [e^{2 iπ h l o g_{10} X_{n}^{d_{n}}}] \leq \frac{N _{0}}{N} + c_{1} \cdot \frac{1}{N} n = 1 \sum N (\frac{h d _{n}}{lo g ( 10 )})^{- γ} + c_{2} \cdot \frac{1}{N} n = 1 \sum N (\frac{h d _{n}}{lo g ( 10 )})^{δ} r_{n} .

h = 1 \sum H \frac{1}{h} \frac{1}{N} n = 1 \sum N E [e^{2 iπ h l o g_{10} X_{n}^{d_{n}}}] \leq c \cdot (\frac{lo g H}{N} + \frac{1}{N} n = 1 \sum N (d_{n})^{- γ} + \frac{1}{N} n = 1 \sum N (d_{n})^{δ} r_{n} \cdot H^{δ}) .

h = 1 \sum H \frac{1}{h} \frac{1}{N} n = 1 \sum N E [e^{2 iπ h l o g_{10} X_{n}^{d_{n}}}] \leq c \cdot (\frac{lo g H}{N} + \frac{1}{N} n = 1 \sum N (d_{n})^{- γ} + \frac{1}{N} n = 1 \sum N (d_{n})^{δ} r_{n} \cdot H^{δ}) .

D \sim_{N} (X^{(d)}) \leq \frac{1}{H + 1} + c^{''} (ω) \cdot lo g H \cdot lo g (1 + H) \cdot \frac{lo g ( 1 + N ^{η} )}{N} + c \cdot (\frac{1}{N} n = 1 \sum N (d_{n})^{- γ} + lo g N \cdot N^{- m i n {β - δ θ, 1}} \cdot H^{δ}) .

D \sim_{N} (X^{(d)}) \leq \frac{1}{H + 1} + c^{''} (ω) \cdot lo g H \cdot lo g (1 + H) \cdot \frac{lo g ( 1 + N ^{η} )}{N} + c \cdot (\frac{1}{N} n = 1 \sum N (d_{n})^{- γ} + lo g N \cdot N^{- m i n {β - δ θ, 1}} \cdot H^{δ}) .

H = ⌊ (lo g N)^{- \frac{1}{δ + 1}} \cdot N^{\frac{m i n { β - δ θ , 1 }}{δ + 1}} ⌋ + 1.

H = ⌊ (lo g N)^{- \frac{1}{δ + 1}} \cdot N^{\frac{m i n { β - δ θ , 1 }}{δ + 1}} ⌋ + 1.

E [e^{2 iπ h l o g X_{n}}] = \frac{1}{n} k = 1 \sum n e^{2 iπ h l o g k} \leq \frac{1}{n} + \frac{1}{n} k = ⌊ n ⌋ + 1 \sum n e^{2 iπ h l o g k},

E [e^{2 iπ h l o g X_{n}}] = \frac{1}{n} k = 1 \sum n e^{2 iπ h l o g k} \leq \frac{1}{n} + \frac{1}{n} k = ⌊ n ⌋ + 1 \sum n e^{2 iπ h l o g k},

E [e^{2 iπ h l o g X_{n}}] \leq \frac{8}{h} + \frac{1 + 4 h}{n} + \frac{6}{n} + \frac{3 h}{n n} .

E [e^{2 iπ h l o g X_{n}}] \leq \frac{8}{h} + \frac{1 + 4 h}{n} + \frac{6}{n} + \frac{3 h}{n n} .

d \to \infty lim N \to \infty lim sup D \sim_{N} (X^{d}) = 0.

d \to \infty lim N \to \infty lim sup D \sim_{N} (X^{d}) = 0.

N \to \infty lim sup D \sim_{N} (X^{d}) \leq \frac{1}{H + 1} + h = 1 \sum H \frac{1}{h} N \to \infty lim sup \frac{1}{N} n = 1 \sum N E [e^{2 iπ h d l o g_{10} X_{n}}] .

N \to \infty lim sup D \sim_{N} (X^{d}) \leq \frac{1}{H + 1} + h = 1 \sum H \frac{1}{h} N \to \infty lim sup \frac{1}{N} n = 1 \sum N E [e^{2 iπ h d l o g_{10} X_{n}}] .

d \to \infty lim N \to \infty lim sup \frac{1}{N} n = 1 \sum N E [e^{2 iπ h d l o g_{10} X_{n}}] = 0.

d \to \infty lim N \to \infty lim sup \frac{1}{N} n = 1 \sum N E [e^{2 iπ h d l o g_{10} X_{n}}] = 0.

E [e^{2 iπ h l o g X_{n}}] \leq c_{1} h^{- 1} + c_{2} h n^{- β}

E [e^{2 iπ h l o g X_{n}}] \leq c_{1} h^{- 1} + c_{2} h n^{- β}

k = 1 \sum N e^{2 iπ h l o g k} P (X_{n} = k) = P (X_{n} = N + 1) j = 1 \sum N e^{2 iπ h l o g j} - k = 1 \sum N j = 1 \sum k e^{2 iπ h l o g j} (P (X_{n} = k + 1) - P (X_{n} = k)) .

k = 1 \sum N e^{2 iπ h l o g k} P (X_{n} = k) = P (X_{n} = N + 1) j = 1 \sum N e^{2 iπ h l o g j} - k = 1 \sum N j = 1 \sum k e^{2 iπ h l o g j} (P (X_{n} = k + 1) - P (X_{n} = k)) .

k = 1 \sum N j = 1 \sum k e^{2 iπ h l o g j} (P (X_{n} = k + 1) - P (X_{n} = k)) \leq \frac{c _{1}}{h} + h c_{2} n^{- β},

k = 1 \sum N j = 1 \sum k e^{2 iπ h l o g j} (P (X_{n} = k + 1) - P (X_{n} = k)) \leq \frac{c _{1}}{h} + h c_{2} n^{- β},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

On the discrepancy of powers of random variables

Nicolas Chenavier111Université Littoral Côte d’Opale, EA 2797, LMPA, 50 rue Ferdinand Buisson, F-62228 Calais, France. E-mail: [email protected], corresponding author, Dominique Schneider 222Université Littoral Côte d’Opale, EA 2797, LMPA, 50 rue Ferdinand Buisson, F-62228 Calais, France. E-mail: [email protected]

Abstract

Let $(d_{n})$ be a sequence of positive numbers and let $(X_{n})$ be a sequence of positive independent random variables. We provide an upper bound for the deviation between the distribution of the mantissaes of $(X_{n}^{d_{n}})$ and the Benford’s law. If $d_{n}$ goes to infinity at a rate at most polynomial, this deviation converges a.s. to 0 as $N$ goes to infinity.

Keywords: Benford’s law; discrepancy; mantissa.

AMS 2010 Subject Classifications: 60B10 . 11K38

1 Introduction

A sequence of positive numbers $(x_{n})$ is said to satisfy the first digit phenomenon if

[TABLE]

where $F(x_{n})$ is the first digit of $x_{n}$ , and where $\mathbb{1}_{A}\,$ denotes the indicator function of any subset $A$ . Such a phenomenon was observed by Benford and Newcomb on real life numbers [1, 13]. It is extensively used in various domains, such as fraud detection [14], computer design [8] and image processing [17]. As an extension of the first digit phenomenon, the notion of Benford sequence is introduced as follows. Let $\mu_{10}$ be the measure on the interval $[1,10)$ defined by

[TABLE]

where $\log_{10}a$ denotes the logarithm in base $10$ of $a$ . Let ${\mathcal{M}}_{10}(x)$ be the mantissa in base $10$ of a positive number $x$ , i.e. ${\mathcal{M}}_{10}(x)$ is the unique number in $[1,10)\,$ such that there exists an integer $k$ satisfying $x={\mathcal{M}}_{10}(x)10^{k}$ . A set of numbers $(x_{n})$ is referred to as a Benford sequence if for any $1\leq a<10$ , we have

[TABLE]

In particular, each Benford sequence satisfies the first digit phenomenon since $F(x)=k$ if and only if $\mathcal{M}_{10}(x)\in[k,k+1)$ , with $x>0$ , $k=1,\ldots,9$ . For instance, the sequences $(2^{n})$ , $(n!)$ and $(n^{n})$ are Benford. For various examples of sequences of positive numbers whose mantissae are (or approach to be) distributed with respect to $\mu_{10}$ , see e.g. [5, 6]. More recently, several authors have provided examples of sequences of random variables whose mantissa distribution converges to $\mu_{10}$ [3, 10, 16] or whose the sequence of mantissae is almost surely distributed with respect to $\mu_{10}$ . For a wide panorama on Benford sequences, see the reference books [2, 12].

It is well known that a sequence $(x_{n})$ of positive numbers is Benford in base $10$ if and only if the sequence of its fractional parts $(\{\log_{10}x_{n}\})$ is uniformly distributed in $[0,1)$ . According to the Weyl’s criterion (see e.g. [9], p7), the sequence $(x_{n})$ is Benford if and only if, for any $h\in\mathbf{Z}^{*}$ , we have

[TABLE]

To define a deviation between a sequence and the Benford’s law, the notion of discrepancy is introduced as follows. Let $u=(u_{n})$ be a sequence of real numbers. The discrepancy modulo 1 of order $N$ of $u$ , associated with the natural density, is defined as

[TABLE]

For more details on the discrepancy, see e.g. [9], p100–131. For a sequence $x=(x_{n})$ , if we set $x_{n}=10^{u_{n}}$ , we write $\overset{\sim}{D}_{N}(x)={D}_{N}(u)$ . The quantity $\overset{\sim}{D}_{N}(x)$ deals with the deviation between $\mu_{10}$ and the distribution of the first $N$ terms of $(\mathcal{M}_{10}(x_{n}))$ since $\{\log_{10}x_{n}\}=\log_{10}(\mathcal{M}_{10}(x_{n}))$ . Hence

[TABLE]

In particular, $x=(x_{n})$ is Benford if and only if $\overset{\sim}{D}_{N}(x)$ converges to 0 as $N$ goes to infinity. Through misuse of language, we also say that $\overset{\sim}{D}_{N}(x)$ is the discrepancy of $x=(x_{n})$ .

In this paper, we consider the following problem. Let $(X_{n})$ be a sequence of positive independent random variables. We say that $(X_{n})$ is a.s. Benford if $\omega-\operatorname{\mathbb{P}}a.s.$ the sequence $(X_{n}(\omega))$ is Benford. As observed in [7], several deterministic sequences at a power $d$ tend to be Benford when the power $d$ is large enough. The aim of our paper is to provide general conditions on the distribution of the random sequence $X=(X_{n})$ to ensure that $X^{(d)}=(X_{n}^{d_{n}})$ is a.s. Benford for any sequence of positive numbers $(d_{n})$ such that $d_{n}$ converges to infinity at a rate at most polynomial.

First, we give some notation. In what follows, the function $\log$ denotes the natural logarithm. For any functions $f$ , $g$ , we write $g(x)\underset{x\rightarrow\infty}{\sim}f(x)$ if and only if $\frac{g(x)}{f(x)}\underset{x\rightarrow\infty}{\longrightarrow}1$ . Moreover, we write $g(x)=O(f(x))$ if and only if there exists a positive number $M$ and a real number $x_{0}$ such that $|g(x)|\leq M|f(x)|$ for any $x\geq x_{0}$ .

We are now prepared to state our first theorem, which provides an upper bound for the discrepancy.

Theorem 1.

Let $(d_{n})$ be a (deterministic) sequence of positive numbers such that $d_{n}=O\left(n^{\theta}\right)$ for some $\theta\geq 0$ . Let $X=(X_{n})$ be a sequence of positive independent random variables satisfying the following two conditions:

(i)

there exists $\alpha>0$ such that $\sum_{n=1}^{\infty}\operatorname{\mathbb{P}}\left(|\log X_{n}|>n^{\alpha}\right)<\infty;$ 2. (ii)

there exists a sequence of nonnegative numbers $(r_{n})$ , with $r_{n}=O(n^{-\beta})$ for some $\beta>0$ , and their exist four constants $c_{1},c_{2},\gamma,\delta>0$ , such that for $n$ large enough and for each $h\in\mathbf{N}^{*}$ , we have

[TABLE]

Then there exist an integrable random variable $C_{0}$ and a constant $c_{0}$ such that, for any $N\geq 1$ , we have $\omega-\operatorname{\mathbb{P}}a.s.$

[TABLE]

where $X^{(d)}(\omega)=(X_{n}^{d_{n}}(\omega))$ .

The above theorem is obvious if the upper bound does not converge to 0. However, if $\delta\theta<\beta$ , it provides a non-trivial estimate for the discrepancy when $d_{n}$ goes to infinity at a rate at most polynomial. As a consequence, we obtain the following result.

Corollary 2.

Let $(d_{n})$ be such that $d_{n}=O\left(n^{\theta}\right)$ for some $\theta>0$ and $d_{n}\underset{n\rightarrow\infty}{\longrightarrow}\infty$ . Assume that $X=(X_{n})$ satisfies the assumptions (i) and (ii) for some $\alpha,\beta,\gamma,\delta>0$ , with $\delta\theta<\beta$ . Then $\overset{\sim}{D}_{N}(X^{d}(\omega))$ converges $\omega-\operatorname{\mathbb{P}}$ a.s. to 0, at a rate of convergence provided in Theorem 1. In particular, the sequence $(X_{n}^{d}(\omega))$ is a.s. Benford.

In particular, if $X=(X_{n})$ and $(d_{n})$ satisfy the assumptions of Corollary 2, with the more restrictive condition $d_{n}=O\left(n^{\sigma}\right)$ for each $\sigma>0$ , then the discrepancy of $X^{(d)}(\omega)$ can be bounded as follows:

[TABLE]

It is rather surprising that $X^{(d)}(\omega)$ is a.s. Benford for a sequence $d=(d_{n})$ which converges arbitrarily slowly to infinity. On the opposite, it appears that for several classes of (deterministic) sequences $(x_{n})$ , the sequence $(x_{n}^{d_{n}})$ is Benford, when $(d_{n})$ converges to infinity at a rate at less polynomial (see e.g. Theorem 2 in [11]). As a second consequence of Theorem 1, the following corollary deals with the case where the sequence $(d_{n})$ is constant.

Corollary 3.

Let $d_{n}=d$ for each $n\geq 1$ and let $X=(X_{n})$ be such that the assumptions (i) and (ii) hold for some $\alpha,\beta,\gamma,\delta>0$ . Then there exist an integrable random variable $C_{0}(\omega)$ and a constant $c_{0}$ such that, for any $N\geq 1$ , we have $\omega-\operatorname{\mathbb{P}}a.s.$

[TABLE]

where $X^{d}(\omega)=(X_{n}^{d}(\omega))$ .

In particular, as $d$ goes to infinity, the sequence $X^{d}=(X_{n}^{d})$ tends to be a.s. Benford in the sense that its discrepancy converges to 0 as $d,N\rightarrow\infty$ . In a different context, such a convergence was already observed in Theorem 1 in [7], in which it is stated that two (deterministic) sequences at a large power tend to be Benford.

The assumption (i) of Theorem 1 is few restrictive. Indeed, thanks to the Markov’s inequality, such a condition is satisfied when $\operatorname{\mathbb{E}}\left[\,X_{n}\,\right]$ and $\operatorname{\mathbb{E}}\left[\,X_{n}^{-1}\,\right]$ are negligible compared to $n^{-1-\epsilon}e^{n^{\alpha}}$ for some $\alpha,\epsilon>0$ . The assumption (ii) of Theorem 1 is in a way classical and is discussed in Remark 1.

Our paper is organized as follows. In Section 2, we prove Theorem 1. This result is illustrated through several examples of standard distributions in Section 3. These examples deal with discrete and continuous random variables respectively. In the rest of the paper, we denote by $c$ a generic constant which is independent of $\omega$ , $N$ and $(d_{n})$ , but which may depend on other quantities.

2 Proof of Theorem 1

To prove Theorem 1, we apply two well-known inequalities. The first one deals with the discrepancy and is referred to as the Erdös-Turán inequality (see e.g. [RT]).

Theorem 4.

(Erdös-Turán inequality) Let $x=(x_{n})$ be a sequence of real numbers and let $N\geq 1$ . Then, for every integer $H\geq 1$ , we have

[TABLE]

The second inequality which we apply gives a deviation beween a sum of unit random complex numbers and the expectation of this sum. Such a result is due to Cohen and Cuny (Theorem 4.10 in [4]) and is re-written in our context.

Theorem 5.

*(Cohen & Cuny, 2006)

Let $(Y_{n})$ be a sequence of independent random variables, with values in $\mathbf{R}$ . Assume that there exists $\eta>0$ , such that $\sum_{n=1}^{\infty}\operatorname{\mathbb{P}}\left(|Y_{n}|>n^{\eta}\right)<\infty$ . Let $(a_{n})$ be a sequence of complex numbers. Then there exist universal constants $\epsilon>0$ and $C>0$ , such that*

[TABLE]

In the rest of the paper, with a slight abuse of notation, we omit the dependence in $\omega$ , e.g. we write $\overset{\sim}{D}_{N}(X^{(d)})$ instead of $\overset{\sim}{D}_{N}(X^{(d)}(\omega))$ . We are now prepared to prove our first theorem. Proof of Theorem 1. According to the Erdös-Turán inequality, we have for any $H\geq 1$ ,

[TABLE]

Hence,

[TABLE]

First, we provide an upper bound for the term on the bottom. To do it, we take $a_{n}=1$ , $Y_{n}=\log_{10}X_{n}^{d_{n}}$ and $K=1$ . Since $d_{n}=O(n^{\theta})$ , we obtain for $n$ large enough that $\operatorname{\mathbb{P}}\left(|Y_{n}|>n^{\eta}\right)\leq\operatorname{\mathbb{P}}\left(|\log X_{n}|>n^{\alpha}\right)$ with $\eta>\alpha+\theta$ . Hence, according to the assumption (i), we have $\sum_{n=1}^{\infty}\operatorname{\mathbb{P}}\left(|Y_{n}|>n^{\eta}\right)<\infty$ . It follows from Theorem 5 that

[TABLE]

In particular, there exists an integrable random variable $c(\omega)$ such that, for any $N\geq 2$ , $T\geq 1$ , $|t|\leq T$ we have $\omega-\operatorname{\mathbb{P}}a.s.$

[TABLE]

Notice that we have considered a sum over $n=1,\ldots,N$ and not over $n=2,\ldots,N$ in the above equation because $\left|e^{2i\pi t\log_{10}X_{1}}-\operatorname{\mathbb{E}}\left[\,e^{2i\pi t\log_{10}X_{1}}\,\right]\right|\leq 2$ . By taking $T=H$ and $t=h$ , we obtain for any $N\geq 1,H\geq 1$ that

[TABLE]

Secondly, we provide an upper bound for the second term in the right-hand side in (2). To do it, let $N_{0}$ be such that the inequality (1) holds for each $N\geq N_{0}$ . Then

[TABLE]

Bounding $\left|\operatorname{\mathbb{E}}\left[\,e^{2i\pi hd_{n}\log_{10}X_{n}}\,\right]\right|$ by 1 in the first sum and applying the inequality (1) in the second sum for the right-hand side, we get

[TABLE]

Besides, $\sum_{h=1}^{H}\frac{1}{h}\leq c\log H$ , $\sum_{h=1}^{H}\frac{1}{h^{1+\gamma}}\leq c$ and $\sum_{h=1}^{H}\frac{1}{h^{1-\delta}}\leq cH^{\delta}$ . This implies that

[TABLE]

Since $d_{n}=O\left(n^{\theta}\right)$ and $r_{n}=O\left(n^{-\beta}\right)$ , we have $\frac{1}{N}\sum_{n=1}^{N}(d_{n})^{\delta}r_{n}\leq c\cdot\log N\cdot N^{-1}$ if $\beta-\delta\theta=1$ and $\frac{1}{N}\sum_{n=1}^{N}(d_{n})^{\delta}r_{n}\leq c\cdot N^{-\min\{\beta-\delta\theta,1\}}$ otherwise. This together with (2) and (3) implies that

[TABLE]

Optimizing the right-hand side over $H\geq 1$ , we conclude the proof of Theorem 1 by taking

[TABLE]

$\square$

Remark 1.

The assumption given in Equation (1) has been chosen in such a way that it holds when $X_{n}$ follows the (discrete) uniform distribution on $\{1,\ldots,n\}$ . Indeed, in this case, we have

[TABLE]

According to the Van der Corput’s theorem (see e.g. [9], p17), this shows that

[TABLE]

In particular, this satisfies Equation (1) with $\gamma=\frac{1}{2}$ , $\delta=1$ and $r_{n}=\frac{1}{\sqrt{n}}$ . However, our assumption (ii) and our assumption on the independence of the random variables $X_{n}$ remain restrictive. We hope, in a future paper, to extent Theorem 1 with more general conditions.

Remark 2.

The main tool to derive the rate of the discrepancy is contained in Theorem 5. Besides, as a consequence of Corollary 3, we deduce that $\omega-\operatorname{\mathbb{P}}a.s.$

[TABLE]

In particular, when $d$ is large, the sequence $X^{d}=(X_{n}^{d})$ tends to be a Benford sequence. However, Theorem 5 is not necessary to derive Equation (4) because the latter can be proved directly by standard arguments. Indeed, it follows from the law of large numbers (for independent non-stationary random variables) and the Erdös-Turán inequality that for all fixed $H\geq 1$ ,

[TABLE]

Besides, according to (1), we know that

[TABLE]

Hence, by taking $H\rightarrow\infty$ , this proves that $\lim_{d\rightarrow\infty}\limsup_{N\rightarrow\infty}\overset{\sim}{D}_{N}(X^{d})=0$ . However, the main contribution of our paper is to provide an explicit rate of convergence for the discrepancy of $X^{d}$ as $d$ goes to infinity.

3 Examples

In this section, we give several examples of sequences of random variables satisfying the assumptions (i) and (ii) of Theorem 1. Our examples deal with discrete and continuous random variables respectively.

3.1 Discrete random variables

The following proposition provides sufficient conditions for discrete random variables to ensure that the assumption (ii) of Theorem 1 is satisfied for $\gamma=\delta=1$ .

Proposition 6.

Let $(X_{n})$ be a sequence of random variables with finite expectation and such that $X_{n}\geq 1$ a.s.. Assume that there exists a sequence of modes $(m_{n})$ such that the sequences $(\operatorname{\mathbb{P}}\left(X_{n}=k\right))_{k\leq m_{n}}$ and $(\operatorname{\mathbb{P}}\left(X_{n}=k\right))_{k>m_{n}}$ are non-decreasing and non-increasing respectively. Moreover, assume that for some $\beta>0$ one of the two following cases is satisfied:

•

Case 1:* $m_{n}\cdot n^{-\beta}\underset{n\rightarrow\infty}{\longrightarrow}\infty$ and $\sup_{n\geq 1}m_{n}\operatorname{\mathbb{P}}\left(X_{n}=m_{n}\right)<\infty$ ;*

•

Case 2:* $\sup_{n\geq 1}m_{n}<\infty$ , $\operatorname{\mathbb{P}}\left(X_{n}=m_{n}\right)=O\left(n^{-\beta}\right)$ and $\operatorname{\mathbb{E}}\left[\,\frac{1}{X_{n}}\,\right]=O\left(n^{-\beta}\right)$ .*

Then for $n$ large enough and for each $h\geq 1$ , we have:

[TABLE]

where $c_{1},c_{2}$ are two constants.

Proof of Proposition 6. First, we provide a generic upper bound for $\operatorname{\mathbb{E}}\left[\,e^{2i\pi h\log X_{n}}\,\right]$ which is independent of the two above cases. Then we deduce a specific upper bound for this expectation which depends this time on the case which is considered.

To do it, we write $\operatorname{\mathbb{E}}\left[\,e^{2i\pi h\log X_{n}}\,\right]=\lim_{N\rightarrow\infty}\sum_{k=1}^{N}e^{2i\pi h\log k}\operatorname{\mathbb{P}}\left(X_{n}=k\right)$ . Let $N\geq 1$ be fixed. It follows from the Abel transformation that

[TABLE]

Since $\left|\operatorname{\mathbb{P}}\left(X_{n}=N+1\right)\sum_{j=1}^{N}e^{2i\pi h\log j}\right|\leq N\operatorname{\mathbb{P}}\left(X_{n}=N+1\right)$ converges to 0 as $N$ goes to infinity (because $\operatorname{\mathbb{E}}\left[\,X_{n}\,\right]<\infty$ ), it is enough prove that

[TABLE]

for some constants $c_{1},c_{2}$ . To do it, we apply the following lemma.

Lemma 7.

For each $h\geq 1$ , $k\geq 1$ , we have

[TABLE]

Proof of Lemma 7. First, we notice that

[TABLE]

where $R_{k}(f)\mathrel{\mathop{\mathchar 58\relax}}=\sum_{j=0}^{k-1}\int_{\frac{j}{k}}^{\frac{j+1}{k}}f\left(\frac{j+1}{k}\right)\mathrm{d}t$ is the Riemann sum of the function $f\mathrel{\mathop{\mathchar 58\relax}}t\mapsto t^{2i\pi h}$ on $[0,1]$ with $n$ regular steps of length $n^{-1}$ . Hence

[TABLE]

where the second inequality comes from the fact that $\int_{0}^{1}f(t)\mathrm{d}t=\frac{1}{2i\pi h+1}$ . Besides,

[TABLE]

where the last line is a consequence of the mean value inequality. Integrating the right-hand side over $t$ , we get

[TABLE]

This concludes the proof of Lemma 7. $\square$ According to Lemma 7, we have

[TABLE]

Since the sequences $(\operatorname{\mathbb{P}}\left(X_{n}=k\right))_{k\leq m_{n}}$ and $(\operatorname{\mathbb{P}}\left(X_{n}=k\right))_{k\geq m_{n}}$ are non-decreasing and non-increasing respectively, we get

[TABLE]

With standard computations, we get:

[TABLE]

Using the fact that $\log\left(1+\frac{1}{k}\right)\operatorname{\mathbb{P}}\left(X_{n}=k+1\right)\leq\frac{1}{k}\operatorname{\mathbb{P}}\left(X_{n}=k\right)$ for each $k\geq m_{n}$ , we deduce that

[TABLE]

where

[TABLE]

and

[TABLE]

The inequality (6) is independent of the two cases considered in the assumptions of Proposition 6. Now, we deal with the terms $c_{1}$ and $s_{n}$ by discussing these two cases.

•

Case 1: if $m_{n}\cdot n^{-\beta}\underset{n\rightarrow\infty}{\longrightarrow}\infty$ for some $\beta>0$ and $\sup_{n\geq 1}m_{n}\operatorname{\mathbb{P}}\left(X_{n}=m_{n}\right)<\infty$ , we obtain that $c_{1}<\infty$ . Moreover, $s_{n}=O\left(n^{-\beta}\right)$ since $\log m_{n}=O(m_{n})$ and

[TABLE]

•

Case 2: if $\sup_{n\geq 1}m_{n}<\infty$ , $\operatorname{\mathbb{P}}\left(X_{n}=m_{n}\right)=O\left(n^{-\beta}\right)$ and $\operatorname{\mathbb{E}}\left[\,\frac{1}{X_{n}}\,\right]=O\left(n^{-\beta}\right)$ for some $\beta>0$ , we also obtain that $c_{1}<\infty$ and $s_{n}=O\left(n^{-\beta}\right)$ .

This concludes the proof of Proposition 6. $\square$ We give below three examples of sequences of random variables $X=(X_{n})$ by checking the assumption (i) of Theorem 1 and one of the two cases of Proposition 6. According to Theorem 1 and Proposition 6, the discrepancy for each example can be bounded as follows:

[TABLE]

In particular, if $(d_{n})\rightarrow\infty$ with $d_{n}=O\left(n^{\theta}\right)$ and $\theta>\beta$ , the sequence $X^{(d)}=(X_{n}^{d_{n}})$ is a.s. Benford.

Example 1.

Assume that $X_{n}$ has a geometric distribution with parameter $p_{n}=O\left(n^{-\beta}\right)$ . Here $m_{n}=1$ , so that $\operatorname{\mathbb{P}}\left(X_{n}=1\right)=p_{n}=O\left(n^{-\beta}\right)$ . We also obtain the same order for $\operatorname{\mathbb{E}}\left[\,\frac{1}{X_{n}}\,\right]=-\frac{p_{n}}{1-p_{n}}\cdot\log(1-p_{n})$ . In particular, the third conditions of Case 2 are satisfied. Besides, if $p_{n}e^{n^{\alpha}}n^{-\alpha^{\prime}}\underset{n\rightarrow\infty}{\longrightarrow}\infty$ for some $\alpha>0,\alpha^{\prime}>1$ , the assumption (i) holds since

[TABLE]

according to the Markov’s inequality.

Example 2.

Let $X_{n}$ be a random variable with distribution $\operatorname{\mathbb{P}}\left(X_{n}=k\right)=\frac{\alpha_{n}}{(n+k)^{1+\epsilon}}$ , where $\alpha_{n}$ is the normalizing constant and $\epsilon>0$ . In particular, we have

[TABLE]

since

[TABLE]

Here $m_{n}=1$ and the third conditions of Case 2 are satisfied. Indeed, the first one is trivial and for the second one we have $\operatorname{\mathbb{P}}\left(X_{n}=1\right)=O\left(n^{-(1+\epsilon)}\right)$ . For the third condition, let $\beta<1$ . According to (7), we have $\frac{1}{k}\cdot\frac{\alpha_{n}\cdot n^{\beta}}{(n+k)^{1+\epsilon}}\leq\frac{\epsilon}{k(k+1)^{1-\beta}}$ . It follows from the dominated convergence theorem that

[TABLE]

This checks the third condition of Case 2 for each $\beta<1$ . Besides, the assumption (i) holds since for each $n\geq 1$ and for each $\alpha>0$ , we have

[TABLE]

Example 3.

Assume that $X_{n}$ has a (discrete) uniform distribution in $\{a_{n},\ldots,b_{n}\}$ , with $a_{n}<b_{n}$ , $b_{n}\cdot n^{-\beta}\rightarrow\infty$ for some $\beta>0$ , and $\limsup\frac{a_{n}}{b_{n}}<1$ . Here we take $m_{n}=b_{n}$ . The two conditions of Case 1 are satisfied. Indeed, the first one holds because $b_{n}\cdot n^{-\beta}\rightarrow\infty$ . The second one comes from the fact that $\limsup\frac{a_{n}}{b_{n}}<1$ and $m_{n}\operatorname{\mathbb{P}}\left(X_{n}=m_{n}\right)=\frac{b_{n}}{b_{n}-a_{n}+1}.$ Besides, a sufficient and few restrictive assumption on $b_{n}$ to ensure that the assumption (i) holds is: $b_{n}=O(e^{n^{\alpha}})$ for some $\alpha>0$ . Notice that if $\frac{a_{n}}{b_{n}}$ converges to 1, the random variables $X_{n}$ are asymptotically deterministic. It is not surprising that the property (b) cannot hold in this context since there exist deterministic sequences such that, at any power $d$ , the sequences are not Benford.

3.2 Continuous random variables

Let $X=(X_{n})$ be a sequence of random variables. We first state three properties which imply the assumption (ii) of Theorem 1 when they are simultaneously satisfied.

(a)

For any $n\geq 1$ , the density $f_{n}$ of $X_{n}$ exists and is a piecewise absolutely continuous function. In what follows, we denote by $k_{n}$ the number of sub-domains of $f_{n}$ and by $I_{n,j}\mathrel{\mathop{\mathchar 58\relax}}=[a_{n,j},b_{n,j}]$ the $j$ -th sub-domain, with $a_{n,j}\leq b_{n,j}\leq a_{n,j+1}$ for each $1\leq j\leq k_{n}-1$ . The $k_{n}$ -th interval is of the form $I_{n,k_{n}}=[a_{n,k_{n}},+\infty)$ . In particular, $f_{n}$ is a.e. differentiable on $\bigcup_{j=1}^{k_{n}}I_{n,j}$ and $f_{n}=0$ on the complement. 2. (b)

$\limsup_{N\rightarrow\infty}\sum_{j=1}^{k_{N}}\sup_{x\in I_{N,j}}|xf_{N}(x)|<\infty$ . 3. (c)

$\limsup_{N\rightarrow\infty}\sum_{j=1}^{k_{N}}\int_{I_{N,j}}|xf^{\prime}_{N}(x)|\mathrm{d}x<\infty$ .

Under the above assumptions, the following proposition ensures that the assumption (ii) of Theorem 1 holds, with $\gamma=1$ and $a_{n}=0$ for each $n\geq 1$ .

Proposition 8.

If the properties hold (a), (b) and (c) hold simultaneously, then for $n$ large enough and for each $h\in\mathbf{N}^{*}$ , we have $\left|\operatorname{\mathbb{E}}\left[\,e^{2i\pi h\log X_{n}}\,\right]\right|\leq c_{1}h^{-1}$ .

Proof of Proposition 8. It is enough to prove the following inequality:

[TABLE]

To do it, we assume without loss of generality that $k_{n}=1$ for each $n$ , with $I_{n,j}=\mathrel{\mathop{\mathchar 58\relax}}I_{n}=[a_{n},b_{n}]$ . In particular, the density $f_{n}$ is absolutely continuous on $[a_{n},b_{n}]$ and equals 0 on the complement. This gives for any $N\geq 1,h\geq 1$

[TABLE]

In particular, we have $\limsup_{N\rightarrow\infty}\sup_{h\in\mathbf{N}^{*}}h\left|\operatorname{\mathbb{E}}\left[\,e^{2i\pi h\log X_{N}}\,\right]\right|<\infty$ provided that the three above properties hold. $\square$

Notice that if $g_{n}$ denotes the density of $X_{n}^{-1}$ , we can easily show that $g_{n}$ satisfies the above assumptions if and only if the ones are satisfied by the density of $X_{n}$ . This suggests that our assumptions are not very restrictive. We give below three examples of distributions of random variables which satisfy the assumption (i) of Theorem 1 and the three conditions (a), (b) and (c) of Proposition 8. According to Theorem 1 and Proposition 8, the discrepancy for each example can be bounded as follows:

[TABLE]

To obtain the rate of the discrepancy, we have taken $\delta=1$ and $\beta\rightarrow\infty$ . In particular, if $(d_{n})\rightarrow\infty$ with $d_{n}=O\left(n^{\theta}\right)$ for some $\theta>0$ , the sequence $X^{(d)}=(X_{n}^{d_{n}})$ is a.s. Benford.

Example 4.

If $X_{n}$ has an exponential distribution with parameter $\lambda_{n}>0$ , the properties (a), (b) and (c) hold simultaneously, with $k_{n}=1$ . Indeed, the first one is trivially satisfied and for the second and the third ones, we get:

[TABLE]

Besides, for each $\alpha>0$ , we have

[TABLE]

Hence the assumption (i) is satisfied if there exists $\alpha^{\prime}$ such that $\lambda_{n}e^{n^{\alpha^{\prime}}}\underset{n\rightarrow\infty}{\longrightarrow}\infty$ and $\lambda_{n}e^{-n^{\alpha^{\prime}}}\underset{n\rightarrow\infty}{\longrightarrow}0$ .

Example 5.

Assume that $X_{n}$ has a standard Fréchet distribution with parameter $\alpha_{n}>0$ , i.e. $\operatorname{\mathbb{P}}\left(X_{n}\leq x\right)=e^{-x^{-\alpha_{n}}}$ if $x\geq 0$ and $\operatorname{\mathbb{P}}\left(X_{n}\leq x\right)=0$ otherwise. The property (a) holds. Moreover, if $\inf_{n\geq 1}\alpha_{n}>0$ and $\sup_{n\geq 1}\alpha_{n}<\infty$ , we can easily prove that the properties (b) and (c) are satisfied. Besides, the assumption (i) is also satisfied since for each $\alpha>0$ , we have

[TABLE]

where the right-hand side is the term of a convergent series.

Example 6.

If $X_{n}$ has a (continuous) uniform distribution on $[a_{n},b_{n}]$ , with $a_{n}<b_{n}$ , the properties (a) and (c) hold. Moreover, the property (b) is satisfied when $\limsup\frac{a_{n}}{b_{n}}<1$ . Besides, a sufficient and few restrictive assumption on $a_{n},b_{n}$ to ensure that the assumption (i) holds is: $e^{-n^{\alpha}}=O(a_{n})$ and $b_{n}=O(e^{n^{\alpha}})$ for some $\alpha>0$ . Unsurprisingly, the assumptions on $b_{n}$ are very similar to those considered for a (discrete) uniform distribution.

3.3 A numerical illustration

In this section, we give a numerical illustration of a sequence of independent random variables $(X_{n})$ such that $(X_{n}^{d})$ is almost a Benford sequence. For each $n$ , the distribution of $X_{n}$ is assumed to be the (continuous) uniform distribution on $[1,n]$ . This sequence satisfies the assumptions of Theorem 1 (see Example 6). In Table 1, we provide the frequencies of the first significant digit of $X_{1}^{d},\ldots,X_{N}^{d}$ , with $N=1000$ and $d=2$ . It appears that the distribution of frequencies of $(X_{n}^{d})$ is close to the Benford’s law.

Bibliography17

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] F. Benford. The law of anomalous numbers. Proceedings of the American Philosophical Society , (78): 551–572, 1938.
2[2] A. Berger, T.P. Hill. An introduction to Benford’s law. Princeton University Press, Princeton, NJ , 2015.
3[3] N. Chenavier, B. Massé, and D. Schneider. Products of random variables and the first digit phenomenon, available in https://arxiv.org/abs/1512.06049 , 2015
4[4] G. Cohen, C. Cuny. On random almost periodic series and random ergodic theory. Ergodic Theory Dynam. Systems , (26): 683–709, 2006
5[5] D. I. A. Cohen, T. M. Katz. Prime numbers and the first digit phenomenon. J. Number Theory , (18): 261–268, 1984
6[6] P. Diaconis. The distribution of leading digits and uniform distribution mod mod {\rm mod} 1 1 1 . Ann. Probability , (5): 72–81, 1977
7[7] D. Eliahou, B. Massé, D. Schneider. On the mantissa distribution of powers of natural and prime numbers. Acta Math. Hungar. , (139): 49–63, 2013
8[8] R. W. Hamming. On the distribution of numbers. Bell System Tech. J. , (49): 1609–1625, 1970

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

On the discrepancy of powers of random variables

Abstract

1 Introduction

Theorem 1**.**

Corollary 2**.**

Corollary 3**.**

2 Proof of Theorem 1

Theorem 4**.**

Theorem 5**.**

Remark 1**.**

Remark 2**.**

3 Examples

3.1 Discrete random variables

Proposition 6**.**

Lemma 7**.**

Example 1**.**

Example 2**.**

Example 3**.**

3.2 Continuous random variables

Proposition 8**.**

Example 4**.**

Example 5**.**

Example 6**.**

3.3 A numerical illustration

Theorem 1.

Corollary 2.

Corollary 3.

Theorem 4.

Theorem 5.

Remark 1.

Remark 2.

Proposition 6.

Lemma 7.

Example 1.

Example 2.

Example 3.

Proposition 8.

Example 4.

Example 5.

Example 6.