Error bounds for the normal approximation to the length of a Ewens   partition

Koji Tsukuda

arXiv:1904.01729·math.ST·March 29, 2022

Error bounds for the normal approximation to the length of a Ewens partition

Koji Tsukuda

PDF

TL;DR

This paper derives error bounds for the normal approximation of the Ewens partition length, revealing how the approximation accuracy varies across different asymptotic regimes as parameters grow large.

Contribution

It provides the first explicit error bounds for the normal approximation of Ewens partition lengths under various asymptotic conditions.

Findings

01

Error bounds depend on the asymptotic regime.

02

Normal approximation improves as n and θ grow large.

03

Decay rate of error varies with the relationship between n and θ.

Abstract

Let $K (= K_{n, θ})$ be a positive integer-valued random variable whose distribution is given by $P (K = x) = \overset{s}{ˉ} (n, x) θ^{x} / (θ)_{n}$ $(x = 1, \dots, n)$ , where $θ$ is a positive number, $n$ is a positive integer, $(θ)_{n} = θ (θ + 1) \dots (θ + n - 1)$ and $\overset{s}{ˉ} (n, x)$ is the coefficient of $θ^{x}$ in $(θ)_{n}$ for $x = 1, \dots, n$ . This formula describes the distribution of the length of a Ewens partition, which is a standard model of random partitions. As $n$ tends to infinity, $K$ asymptotically follows a normal distribution. Moreover, as $n$ and $θ$ simultaneously tend to infinity, if $n^{2} / θ \to \infty$ , $K$ also asymptotically follows a normal distribution. In this paper, error bounds for the normal approximation are provided. The result shows that the decay rate of the error changes due to asymptotic regimes.

Equations172

P (K = x) = \frac{s ˉ ( n , x ) θ ^{x}}{( θ ) _{n}} (x = 1, \dots, n),

P (K = x) = \frac{s ˉ ( n , x ) θ ^{x}}{( θ ) _{n}} (x = 1, \dots, n),

E [K] = θ i = 1 \sum n \frac{1}{θ + i - 1}, var (K) = θ i = 1 \sum n \frac{i - 1}{( θ + i - 1 ) ^{2}},

E [K] = θ i = 1 \sum n \frac{1}{θ + i - 1}, var (K) = θ i = 1 \sum n \frac{i - 1}{( θ + i - 1 ) ^{2}},

E [K] \sim var (K) \sim θ lo g n

E [K] \sim var (K) \sim θ lo g n

Z_{n, θ} = \frac{K - θ lo g n}{θ lo g n}

Z_{n, θ} = \frac{K - θ lo g n}{θ lo g n}

∥ f ∥_{\infty} = x \in R sup ∣ f (x) ∣

∥ f ∥_{\infty} = x \in R sup ∣ f (x) ∣

X_{n, θ} = \frac{K - μ _{0}}{σ _{0}} and Y_{n, θ} = \frac{K - μ _{T}}{σ _{T}},

X_{n, θ} = \frac{K - μ _{0}}{σ _{0}} and Y_{n, θ} = \frac{K - μ _{T}}{σ _{T}},

μ_{0} = E [K], σ_{0}^{2} = var (K),

μ_{0} = E [K], σ_{0}^{2} = var (K),

μ_{T} = θ lo g (1 + \frac{n}{θ}), and σ_{T}^{2} = θ (lo g (1 + \frac{n}{θ}) + \frac{θ}{n + θ} - 1) .

μ_{T} = θ lo g (1 + \frac{n}{θ}), and σ_{T}^{2} = θ (lo g (1 + \frac{n}{θ}) + \frac{θ}{n + θ} - 1) .

lo g (1 + x) - 2 + \frac{3}{x + 1} - \frac{1}{( x + 1 ) ^{2}} = 0.

lo g (1 + x) - 2 + \frac{3}{x + 1} - \frac{1}{( x + 1 ) ^{2}} = 0.

θ (lo g (1 + \frac{n}{θ}) - 1 + \frac{θ}{n + θ}) + \frac{n}{2 ( θ + n )} - 1 > 0

θ (lo g (1 + \frac{n}{θ}) - 1 + \frac{θ}{n + θ}) + \frac{n}{2 ( θ + n )} - 1 > 0

∥ F_{n, θ} - Φ ∥_{\infty} \leq C γ_{1}

∥ F_{n, θ} - Φ ∥_{\infty} \leq C γ_{1}

γ_{1} = \frac{θ { lo g ( 1 + \frac{n}{θ} ) - \frac{5}{3} + \frac{3 θ}{n + θ} - \frac{2 θ ^{2}}{( n + θ ) ^{2}} + \frac{2 θ ^{3}}{3 ( n + θ ) ^{3}} } + 4 + \frac{n}{n + θ}}{{ θ ( lo g ( 1 + \frac{n}{θ} ) - 1 + \frac{θ}{n + θ} ) + \frac{n}{2 ( θ + n )} - 1 } ^{3/2}} .

γ_{1} = \frac{θ { lo g ( 1 + \frac{n}{θ} ) - \frac{5}{3} + \frac{3 θ}{n + θ} - \frac{2 θ ^{2}}{( n + θ ) ^{2}} + \frac{2 θ ^{3}}{3 ( n + θ ) ^{3}} } + 4 + \frac{n}{n + θ}}{{ θ ( lo g ( 1 + \frac{n}{θ} ) - 1 + \frac{θ}{n + θ} ) + \frac{n}{2 ( θ + n )} - 1 } ^{3/2}} .

∥ F_{n, θ} - Φ ∥_{\infty}

∥ F_{n, θ} - Φ ∥_{\infty}

∥ G_{n, θ} - Φ ∥_{\infty} = ⎩ ⎨ ⎧ O (1/ θ lo g (n / θ)) O (1/ θ) O (1/ n^{2} / θ) (Case A), (Case B), (Case C1) .

∥ G_{n, θ} - Φ ∥_{\infty} = ⎩ ⎨ ⎧ O (1/ θ lo g (n / θ)) O (1/ θ) O (1/ n^{2} / θ) (Case A), (Case B), (Case C1) .

θ {lo g (1 + \frac{n}{θ}) - 2 + \frac{3 θ}{n + θ} - \frac{θ ^{2}}{( n + θ ) ^{2}}} - 3 + \frac{n}{2 ( n + θ )} > 0.

θ {lo g (1 + \frac{n}{θ}) - 2 + \frac{3 θ}{n + θ} - \frac{θ ^{2}}{( n + θ ) ^{2}}} - 3 + \frac{n}{2 ( n + θ )} > 0.

∥ F_{n, θ} - Φ ∥_{\infty} \geq \frac{γ _{2}}{D} - γ_{3}

∥ F_{n, θ} - Φ ∥_{\infty} \geq \frac{γ _{2}}{D} - γ_{3}

γ_{2} = \frac{θ { lo g ( 1 + \frac{n}{θ} ) - 2 + \frac{3 θ}{n + θ} - \frac{θ ^{2}}{( n + θ ) ^{2}} } - 3 + \frac{n}{2 ( n + θ )}}{{ θ ( lo g ( 1 + \frac{n}{θ} ) - 1 + \frac{θ}{n + θ} ) + \frac{n}{θ + n} } ^{3/2}},

γ_{2} = \frac{θ { lo g ( 1 + \frac{n}{θ} ) - 2 + \frac{3 θ}{n + θ} - \frac{θ ^{2}}{( n + θ ) ^{2}} } - 3 + \frac{n}{2 ( n + θ )}}{{ θ ( lo g ( 1 + \frac{n}{θ} ) - 1 + \frac{θ}{n + θ} ) + \frac{n}{θ + n} } ^{3/2}},

γ_{3} = \frac{θ { \frac{1}{3} - \frac{θ}{n + θ} + \frac{θ ^{2}}{( n + θ ) ^{2}} - \frac{θ ^{3}}{3 ( n + θ ) ^{3}} } + 2}{{ θ ( lo g ( 1 + \frac{n}{θ} ) - 1 + \frac{θ}{n + θ} ) + \frac{n}{2 ( θ + n )} - 1 } ^{2}} .

γ_{3} = \frac{θ { \frac{1}{3} - \frac{θ}{n + θ} + \frac{θ ^{2}}{( n + θ ) ^{2}} - \frac{θ ^{3}}{3 ( n + θ ) ^{3}} } + 2}{{ θ ( lo g ( 1 + \frac{n}{θ} ) - 1 + \frac{θ}{n + θ} ) + \frac{n}{2 ( θ + n )} - 1 } ^{2}} .

θ {lo g (1 + \frac{n}{θ}) - 2 + \frac{3 θ}{n + θ} - \frac{θ ^{2}}{( n + θ ) ^{2}}} + 2 + \frac{n}{n + θ} < 0.

θ {lo g (1 + \frac{n}{θ}) - 2 + \frac{3 θ}{n + θ} - \frac{θ ^{2}}{( n + θ ) ^{2}}} + 2 + \frac{n}{n + θ} < 0.

∥ F_{n, θ} - Φ ∥_{\infty} \geq \frac{γ _{4}}{D} - γ_{3}

∥ F_{n, θ} - Φ ∥_{\infty} \geq \frac{γ _{4}}{D} - γ_{3}

γ_{4} = \frac{- [ θ { lo g ( 1 + \frac{n}{θ} ) - 2 + \frac{3 θ}{n + θ} - \frac{θ ^{2}}{( n + θ ) ^{2}} } + 2 + \frac{n}{n + θ} ]}{{ θ ( lo g ( 1 + \frac{n}{θ} ) - 1 + \frac{θ}{n + θ} ) + \frac{n}{θ + n} } ^{3/2}} .

γ_{4} = \frac{- [ θ { lo g ( 1 + \frac{n}{θ} ) - 2 + \frac{3 θ}{n + θ} - \frac{θ ^{2}}{( n + θ ) ^{2}} } + 2 + \frac{n}{n + θ} ]}{{ θ ( lo g ( 1 + \frac{n}{θ} ) - 1 + \frac{θ}{n + θ} ) + \frac{n}{θ + n} } ^{3/2}} .

∥ F_{n, θ} - Φ ∥_{\infty} ≍ ⎩ ⎨ ⎧ 1/ θ lo g (n / θ) 1/ θ 1/ n^{2} / θ (Case A), (Case B^{⋆}), (Case C1) .

∥ F_{n, θ} - Φ ∥_{\infty} ≍ ⎩ ⎨ ⎧ 1/ θ lo g (n / θ) 1/ θ 1/ n^{2} / θ (Case A), (Case B^{⋆}), (Case C1) .

P (ξ_{i} = 1) = p_{i} = \frac{θ}{θ + i - 1}, P (ξ_{i} = 0) = 1 - p_{i} (i = 1, 2, \dots) .

P (ξ_{i} = 1) = p_{i} = \frac{θ}{θ + i - 1}, P (ξ_{i} = 0) = 1 - p_{i} (i = 1, 2, \dots) .

P (K = x) = P (i = 1 \sum n ξ_{i} = x) (x = 1, \dots, n),

P (K = x) = P (i = 1 \sum n ξ_{i} = x) (x = 1, \dots, n),

i = 1 \sum n E [∣ ξ_{i} - p_{i} ∣^{2}] = θ i = 1 \sum n \frac{1}{θ + i - 1} - θ^{2} i = 1 \sum n \frac{1}{( θ + i - 1 ) ^{2}}

i = 1 \sum n E [∣ ξ_{i} - p_{i} ∣^{2}] = θ i = 1 \sum n \frac{1}{θ + i - 1} - θ^{2} i = 1 \sum n \frac{1}{( θ + i - 1 ) ^{2}}

i = 1 \sum n E [∣ ξ_{i} - p_{i} ∣^{3}]

i = 1 \sum n E [∣ ξ_{i} - p_{i} ∣^{3}]

i = 1 \sum n E [(ξ_{i} - p_{i})^{3}] = θ i = 1 \sum n \frac{1}{θ + i - 1} - 3 θ^{2} i = 1 \sum n \frac{1}{( θ + i - 1 ) ^{2}} + 2 θ^{3} i = 1 \sum n \frac{1}{( θ + i - 1 ) ^{3}}

i = 1 \sum n E [(ξ_{i} - p_{i})^{3}] = θ i = 1 \sum n \frac{1}{θ + i - 1} - 3 θ^{2} i = 1 \sum n \frac{1}{( θ + i - 1 ) ^{2}} + 2 θ^{3} i = 1 \sum n \frac{1}{( θ + i - 1 ) ^{3}}

i = 1 \sum n (E [∣ ξ_{i} - p_{i} ∣^{2}])^{2}

i = 1 \sum n (E [∣ ξ_{i} - p_{i} ∣^{2}])^{2}

i = 1 \sum n E [(ξ_{i} - p_{i})^{m}]

i = 1 \sum n E [(ξ_{i} - p_{i})^{m}]

θ (lo g (1 + \frac{n}{θ}) - 1 + \frac{θ}{n + θ}) + \frac{n}{2 ( θ + n )} - 1

θ (lo g (1 + \frac{n}{θ}) - 1 + \frac{θ}{n + θ}) + \frac{n}{2 ( θ + n )} - 1

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Error bounds for the normal approximation to

the length of a Ewens partition††thanks: This work was partly supported by Japan Society for the Promotion of Science KAKENHI Grant Number 16H02791, 18K13454.

Koji Tsukuda Graduate School of Arts and Sciences, The University of Tokyo, 3-8-1 Komaba, Meguro-ku, Tokyo 153-8902, Japan.

Abstract

Let $K(=K_{n,\theta})$ be a positive integer-valued random variable whose distribution is given by ${\rm P}(K=x)=\bar{s}(n,x)\theta^{x}/(\theta)_{n}$ $(x=1,\ldots,n)$ , where $\theta$ is a positive number, $n$ is a positive integer, $(\theta)_{n}=\theta(\theta+1)\cdots(\theta+n-1)$ and $\bar{s}(n,x)$ is the coefficient of $\theta^{x}$ in $(\theta)_{n}$ for $x=1,\ldots,n$ . This formula describes the distribution of the length of a Ewens partition, which is a standard model of random partitions. As $n$ tends to infinity, $K$ asymptotically follows a normal distribution. Moreover, as $n$ and $\theta$ simultaneously tend to infinity, if $n^{2}/\theta\to\infty$ , $K$ also asymptotically follows a normal distribution. In this paper, error bounds for the normal approximation are provided. The result shows that the decay rate of the error changes due to asymptotic regimes.

1 Introduction

Consider a nonnegative integer-valued random variable $K(=K_{n,\theta})$ that follows

[TABLE]

where $\theta$ is a positive value, $n$ is a positive integer, $(\theta)_{n}=\theta(\theta+1)\cdots(\theta+n-1)$ and $\bar{s}(n,x)$ is the coefficient of $\theta^{x}$ in $(\theta)_{n}$ . This distribution is known as the falling factorial distribution (Watterson, 1974a, equation (2.22)), STR1F (i.e., the Stirling family of distributions with finite support related to the Stirling number of the first kind) (Sibuya, 1986, 1988), and the Ewens distribution (Kabluchko, Marynych and Sulzbach, 2016). The formula (1.1) describes the distribution of the length of a Ewens partition, which is a standard model of random partitions. A random partition is called a Ewens partition when the distribution of the partition is given by the Ewens sampling formula. The Ewens sampling formula and (1.1) appear in a lot of scientific fields and have been extensively studied; see, e.g., Johnson, Kotz and Balakrishnan (1997, Chapter 41) or Crane (2016). In the context of population genetics, (1.1) was discussed in Ewens (1972) as the distribution of the number of allelic types included in a sample of size $n$ from the infinitely-many neutral allele model with scaled mutation rate $\theta$ ; see also Durrett (2008, Section 1.3). Moreover, in the context of nonparametric Bayesian inference, (1.1) describes the law of the number of distinct values in a sample from the Dirichlet process; see, e.g., Ghosal and van der Vaart (2017, Section 4.1). Furthermore, as introduced in Sibuya (1986), (1.1) relates to several statistical or combinatorial topics such as permutations, sequential rank order statistics and binary search trees.

Simple calculations imply that

[TABLE]

and

[TABLE]

as $n\to\infty$ . Let $\tilde{F}_{n,\theta}(\cdot)$ be the distribution function of the random variable

[TABLE]

standardized by the leading terms of the mean and variance, and $\Phi(\cdot)$ be the distribution function of the standard normal distribution. By calculating the moment generating function of $Z_{n,\theta}$ , Watterson (1974b) proved that $Z_{n,\theta}$ converges in distribution to the standard normal distribution; that is, $\tilde{F}_{n,\theta}(x)\to\Phi(x)$ as $n\to\infty$ for any $x\in\mathbb{R}$ . For the history concerning this result, we refer readers to Arratia and Tavaré (1992, Remark after Theorem 3). In particular, when $\theta=1$ Goncharov (1944) proved that $\tilde{F}_{n,1}(x)\to\Phi(x)$ for any $x\in\mathbb{R}$ . From a theoretical perspective, it is important to derive error bounds for the approximation. Yamato (2013) discussed the first-order Edgeworth expansion of $\tilde{F}_{n,\theta}(\cdot)$ via the Poisson approximation (Arratia and Tavaré, 1992, Remark after Theorem 3) and proved that $\|\tilde{F}_{n,\theta}-\Phi\|_{\infty}=O\left(1/\sqrt{\log{n}}\right)$ , where $\|\cdot\|_{\infty}$ is the $\ell^{\infty}$ -norm defined by

[TABLE]

for a bounded function $f:\mathbb{R}\ni x\mapsto f(x)$ . Note that when $\theta=1$ , Hwang (1998, Example 1) showed that $\|\tilde{F}_{n,1}-\Phi\|_{\infty}=O\left(1/\sqrt{\log{n}}\right)$ . Kabluchko, Marynych and Sulzbach (2016) derived the Edgeworth expansion of the probability function of $K$ , and provided the first-order Edgeworth expansion of $\tilde{F}_{n,\theta}(\cdot)$ .

As the standardization of $Z_{n,\theta}$ comes from (1.2), the normal approximation only works well when $n$ is sufficiently large with respect to $\theta$ . However, this assumption has limited validity in practical cases, so it is important to consider alternative standardized variables; see, e.g., Yamato (2013) and Yamato, Nomachi and Toda (2015). In particular, we consider the random variables $X_{n,\theta}$ and $Y_{n,\theta}$ defined by

[TABLE]

where

[TABLE]

These are standardized random variables that use the exact moments and approximate moments, respectively. Denote the distribution functions of $X_{n,\theta}$ and $Y_{n,\theta}$ by $F_{n,\theta}(\cdot)$ and $G_{n,\theta}(\cdot)$ , respectively. Then, Tsukuda (2017, Theorem 2 and Remark 6) proved that, under the asymptotic regime $n^{2}/\theta\to\infty$ and $\theta\not\to 0$ as $n\to\infty$ (see subsection 1.1 for the explicit assumptions), both $F_{n,\theta}(x)$ and $G_{n,\theta}(x)$ converge to $\Phi(x)$ as $n\to\infty$ for any $x\in\mathbb{R}$ . The problem considered in this paper is to provide upper and lower bounds for the approximation errors $\|F_{n,\theta}-\Phi\|_{\infty}$ and $\|G_{n,\theta}-\Phi\|_{\infty}$ .

Remark 1.

It holds that $\mu_{0}\sim\mu_{T}$ and $\sigma_{0}\sim\sigma_{T}$ as $n\to\infty$ with $n^{2}/\theta\to\infty$ .

1.1 Assumptions and asymptotic regimes

As explained in the Introduction, the regime $n\to\infty$ with fixed $\theta$ is sometimes unrealistic. Hence, we consider asymptotic regimes in which $\theta$ increases as $n$ increases. Such regimes have been discussed in Feng (2007, Section 4) and Tsukuda (2017, 2019). We follow these studies. In this subsection, let us summarize the assumptions on $n$ and $\theta$ .

First, $\theta$ is assumed to be nondecreasing with respect to $n$ . Moreover, when we take the limit operation, $n^{2}/\theta\to\infty$ is assumed.

The following asymptotic regimes are discussed in this paper:

•

Case A: $n/\theta\to\infty$

•

Case B: $n/\theta\to c$ , where $0<c<\infty$

•

Case C: $n/\theta\to 0$

•

Case C1: $n/\theta\to 0$ and $n^{2}/\theta\to\infty$

Remark 2.

Feng (2007)** was apparently the first to consider the asymptotic regimes in which $n$ and $\theta$ simultaneously tend to infinity. Specifically, Cases A, B, and C were considered by Feng (2007, Section 4). Case C1 was introduced by Tsukuda (2017).

Furthermore, let $c^{\star}$ be the unique positive root of the equation

[TABLE]

Then, we introduce a new regime, Case B⋆, as follows:

•

Case B⋆: $n/\theta\to c$ , where $0<c<\infty$ and $c\neq c^{\star}$ .

Remark 3.

Solving (1.3) numerically gives $c^{\star}=2.16258\cdots$ .

2 Main results

This section presents Theorems 2.1 and 2.4 which are the main results of this paper and their corollaries. Proofs of the results in this section are provided in Section 4.

2.1 An upper error bound

In this subsection, an upper bound for the error $\|F_{n,\theta}-\Phi\|_{\infty}$ is given in Theorem 2.1, and its convergence rate is given in Corollary 2.2. Moreover, the convergence rate of the upper bound for the error $\|G_{n,\theta}-\Phi\|_{\infty}$ is given in Corollary 2.3.

We now present the first main theorem of this paper.

Theorem 2.1.

Assume that there exists $n_{0}(=n_{0}(\theta))$ such that

[TABLE]

for all $n\geq n_{0}$ . Then, it holds that

[TABLE]

for all $n\geq n_{0}$ , where $C$ is a constant not larger than 0.5591 and

[TABLE]

Remark 4.

Under our asymptotic regime ( $n^{2}/\theta\to\infty$ ), (2.1) is valid for sufficiently large $n$ .

Remark 5.

The constant $C$ in Theorem 2.1 is the universal constant appearing in the Berry–Esseen theorem.

Theorem 2.1 and asymptotic evaluations of the numerator and denominator of $\gamma_{1}$ yield the following corollary.

Corollary 2.2.

In Cases A, B, and C1, it holds that

[TABLE]

Using Corollary 2.2, we can obtain the following convergence rate of the error bound for the normal approximation to $Y_{n,\theta}$ .

Corollary 2.3.

It holds that

[TABLE]

2.2 Evaluation of the decay rate

In this subsection, a lower bound for the error $\|F_{n,\theta}-\Phi\|_{\infty}$ is given in Theorem 2.4. Together with Theorem 2.1, this theorem yields the decay rate of $\|F_{n,\theta}-\Phi\|_{\infty}$ , as stated in Corollary 2.5.

We now present the second main theorem of this paper.

Theorem 2.4.

(i) Assume that there exists $n_{1}(=n_{1}(\theta))$ such that, for all $n\geq n_{1}$ , (2.1), ${\rm var}(K)\geq 1$ and

[TABLE]

Then, it holds that

[TABLE]

for all $n\geq n_{1}$ , where $D$ is some constant,

[TABLE]

and

[TABLE]

(ii) Assume that there exists $n_{2}(=n_{2}(\theta))$ such that, for all $n\geq n_{2}$ , (2.1), ${\rm var}(K)\geq 1$ and

[TABLE]

Then, it holds that

[TABLE]

for all $n\geq n_{2}$ , where $D$ is some constant, $\gamma_{3}$ is as defined in (2.5), and

[TABLE]

Remark 6.

Under our asymptotic regime ( $n^{2}/\theta\to\infty$ ), ${\rm var}(K)\geq 1$ is valid for sufficiently large $n$ . In Case A, (2.3) is valid for sufficiently large $n$ . In Case B⋆, if $c>c^{\star}$ then (2.3) is valid for sufficiently large $n$ , and if $c<c^{\star}$ then (2.6) is valid for sufficiently large $n$ . In Case C1, (2.6) is valid for sufficiently large $n$ .

Remark 7.

The constant $D$ in Theorem 2.4 is the universal constant introduced by Hall and Barbour (1984). Note that this constant was denoted as $C$ in their theorem.

As a corollary to Theorems 2.1 and 2.4, we can make the following statement regarding the decay rate of $\|F_{n,\theta}-\Phi\|_{\infty}$ .

Corollary 2.5.

It holds that

[TABLE]

3 Some preliminary results

3.1 A representation of $K$ by a Bernoulli sequence

Consider an independent Bernoulli random sequence $\{\xi_{i}\}_{i\geq 1}(=\{\xi_{i,\theta}\}_{i\geq 1})$ defined by

[TABLE]

Then,

[TABLE]

that is, $\mathcal{L}(K)$ equals $\mathcal{L}(\sum_{i=1}^{n}\xi_{i})$ ; see, e.g., Johnson, Kotz and Balakrishnan (1997, equation (41.12)) or Sibuya (1986, Proposition 2.1). By virtue of this relation, and after some preparation, we will prove the results presented in Section 2. To use the Berry–Esseen-type theorem for independent random sequences (see Lemma B.1), we will evaluate the sum of the second- and third-order absolute central moments of $\{\xi_{i}\}_{i=1}^{n}$ . That is, we will evaluate

[TABLE]

and

[TABLE]

To derive a lower bound result, we will evaluate

[TABLE]

and

[TABLE]

Remark 8.

It follows from the binomial theorem that

[TABLE]

for any $m=2,3,\ldots$ .

3.2 Evaluations for moments

In this subsection, we evaluate several sums of moments of $\{\xi_{i}\}_{i=1}^{n}$ .

Lemma 3.1.

(i) It holds that

[TABLE]

(ii) If $n^{2}/\theta\to\infty$ then it holds that

[TABLE]

(iii) In particular, it holds that

[TABLE]

Proof of Lemma 3.1.

(i) The desired inequality is an immediate consequence of (3.2) and Lemma A.1. (ii) As

[TABLE]

for any $x>0$ , it holds that

[TABLE]

whereas the remainder does not diverge to $\pm\infty$ . This implies the assertion. (iii) The assertion is a direct consequence of (ii) (for Case C, the result follows from the Taylor expansion of $\log\left(1+x\right)-1+1/(x+1)$ as $x\to 0$ ). ∎

Lemma 3.2.

(i) It holds that

[TABLE]

(ii) If $n^{2}/\theta\to\infty$ , then it holds that

[TABLE]

(iii) In particular, it holds that

[TABLE]

Proof of Lemma 3.2.

(i) The desired inequality is an immediate consequence of (LABEL:K3am) and Lemma A.1. (ii) As

[TABLE]

for any $x>0$ , it holds that

[TABLE]

whereas the remainder does not diverge to $\pm\infty$ . This implies the assertion. (iii) The assertion is a direct consequence of (ii) (for Case C, the result follows from the Taylor expansion of $\log\left(1+x\right)-5/3+3/(x+1)-2/(x+1)^{2}+2/\{3(x+1)^{3}\}$ as $x\to 0$ ). ∎

Lemma 3.3.

(i) It holds that

[TABLE]

(ii) In Case A, B⋆, or C, it holds that

[TABLE]

Proof of Lemma 3.3.

(i) The desired inequality is an immediate consequence of (3.4) and Lemma A.1. (ii) In Case A, the assertion holds because

[TABLE]

whereas the remainder does not diverge to $\pm\infty$ . In Case B⋆, the assertion holds because

[TABLE]

and $\theta\to\infty$ , whereas the remainder does not diverge to $\pm\infty$ . In Case C, the assertion holds because

[TABLE]

whereas the remainder terms do not diverge to $\pm\infty$ . ∎

Lemma 3.4.

It holds that

[TABLE]

Proof of Lemma 3.4.

The assertion is an immediate consequence of (LABEL:K2m2) and Lemma A.1. ∎

Remark 9.

*The asymptotic value of the RHS in (3.7) is given by $\theta/3$ (Case A),

$\theta\left[{1}/{3}-{1}/{(c+1)}+{1}/{(c+1)^{2}}-1/\{3(c+1)^{3}\}\right]$ (Case B), or ${n^{3}}/{(3\theta^{2})}+2$ (Case C).*

4 Proofs of the results in Section 2

4.1 Proof of the results in Subsection 2.1

In this subsection, we provide proofs of the results in Subsection 2.1.

Proof of Theorem 2.1.

Let $n$ be an arbitrary integer such that $n\geq n_{0}$ . From (3.1), Lemma B.1 yields that

[TABLE]

where $C$ is the constant appearing in Lemma B.1. Additionally, Lemmas 3.1-(i) and 3.2-(i) yield that

[TABLE]

∎

Proof of Corollary 2.2.

In Case A, B, or C1, it holds that

[TABLE]

Hence, Theorem 2.1, Lemmas 3.1 and 3.2 yield that

[TABLE]

This completes the proof. ∎

Proof of Corollary 2.3.

From

[TABLE]

and the triangle inequality, it follows that

[TABLE]

The first term on the RHS in (LABEL:pc2t1) is

[TABLE]

from Corollary 2.2. The second term on the RHS in (LABEL:pc2t1) is bounded above by

[TABLE]

from Lemma A.2-(i). This is because $|\mu_{T}-\mu_{0}|=O(1)$ (Lemma A.1) and $\sigma_{0}^{2}\sim\theta(\log(1+n/\theta)-1+\theta/(n+\theta))$ (Lemma 3.1). The third term of the RHS in (LABEL:pc2t1) is bounded above by

[TABLE]

from Lemma A.2-(ii). This is because, from $\sigma_{0}^{2}\geq 1\geq n/(n+\theta)$ for $n\geq n_{1}$ and

[TABLE]

(see Lemma 3.1-(i)), it follows that

[TABLE]

for $n\geq n_{1}$ . Note that the LHS and RHS of (4.2) are

[TABLE]

This completes the proof. ∎

4.2 Proof of the results in Subsection 2.2

In this subsection, we provide proofs of the results in Subsection 2.2.

Proof of Theorem 2.4.

(i) Let $n$ be an arbitrary integer such that $n\geq n_{1}$ . As $|\xi_{i}-p_{i}|<1\leq{\rm var}(K)$ for all $i=1,\ldots,n$ , (3.1) and Lemma B.2 yield that

[TABLE]

where $D$ is the constant appearing in Lemma B.2. Additionally, Lemmas 3.1-(i) and 3.3-(i) yield that

[TABLE]

Moreover, Lemmas 3.1-(i) and 3.4-(i) yield that

[TABLE]

This completes the proof of (i).

(ii) Let $n$ be an arbitrary integer such that $n\geq n_{2}$ . From the same reason as (i), (3.1) and Lemma B.2 yields (4.3). Additionally, Lemmas 3.1-(i) and 3.3-(i) yield that

[TABLE]

Moreover, Lemmas 3.1-(i) and 3.4-(i) yield (4.4). This completes the proof of (ii). ∎

Proof of Corollary 2.5.

In Case A, it follows from

[TABLE]

that

[TABLE]

Moreover, it holds that

[TABLE]

Hence, Corollary 2.2 and Theorem 2.4 yield the desired result in Case A.

In Case B⋆, it follows from

[TABLE]

that

[TABLE]

Moreover, it holds that

[TABLE]

As

[TABLE]

either $n_{2}$ or $n_{3}$ exist in Case B⋆. Hence, Corollary 2.2 and Theorem 2.4 yields the desired result in Case B⋆.

In Case C1, it follows from

[TABLE]

that

[TABLE]

Moreover, it holds that

[TABLE]

Hence, Corollary 2.2 and Theorem 2.4 yield the desired result in Case C1.

This completes the proof. ∎

5 Concluding remarks

In this paper, we evaluated the approximation errors $\|F_{n,\theta}-\Phi\|_{\infty}$ and $\|G_{n,\theta}-\Phi\|_{\infty}$ . Deriving decay rates for $\|F_{n,\theta}-\Phi\|_{\infty}$ when $n/\theta\to c^{\star}$ (i.e., Case B with $c=c^{\star}$ ) and for $\|G_{n,\theta}-\Phi\|_{\infty}$ is left for future research. Moreover, as normal approximations are refined by the Edgeworth expansion, it is also important to derive the Edgeworth expansion under our asymptotic regimes.

Appendix A Some evaluations

The following lemma is used in the main body.

Lemma A.1.

Let $\theta$ be a positive value and $n$ be a positive integer. (i) It holds that

[TABLE]

(ii) It holds that

[TABLE]

for any positive integer $k$ .

Proof.

For (i), see Tsukuda (2017, Proof of Proposition 1). For (ii), the conclusion follows from

[TABLE]

for any positive integer $k$ . This completes the proof. ∎

The next lemma provides some basic results on the standard normal distribution function.

Lemma A.2.

(i) For any $\alpha(\in\mathbb{R})$ , it holds that

[TABLE]

(ii) For any positive $\beta(\in\mathbb{R})$ , it holds that

[TABLE]

Proof.

(i) For some $\delta$ between 0 and $\alpha$ , it holds that

[TABLE]

(ii) As

[TABLE]

we prove the assertion for $\beta\geq 1$ and $\beta\leq 1$ , separately. First, we consider the case $\beta\geq 1$ . For $x=0$ , it holds that $|\Phi(\beta x)-\Phi(x)|=0$ . For $x>0$ ,

[TABLE]

For $x<0$ ,

[TABLE]

Next, we consider the case $(0<)\beta\leq 1$ . For $x=0$ , it holds that $|\Phi(\beta x)-\Phi(x)|=0$ . For $x>0$ ,

[TABLE]

For $x<0$ ,

[TABLE]

This completes the proof. ∎

Appendix B Error bounds for normal approximations

B.1 The Berry–Esseen-type theorem for independent sequences

In this subsection, we introduce the Berry–Esseen-type theorem for independent sequences. For further details, see Tyurin (2012).

Let $\{X_{i}\}_{i\geq 1}$ be a sequence of independent random variables, and ${\rm E}[X_{i}]=0$ , ${\rm E}[X_{i}^{2}]=\sigma_{i}^{2}(>0)$ , ${\rm E}[|X_{i}|^{3}]=\beta_{i}<\infty$ for all $i=1,2,\ldots$ . The quantity $\varepsilon_{n}={\sum_{i=1}^{n}\beta_{i}}/{(\sum_{i=1}^{n}\sigma_{i}^{2})^{3/2}}$ is called the Lyapunov fraction. We denote the distribution function of ${\sum_{i=1}^{n}X_{i}}/{(\sum_{i=1}^{n}\sigma_{i}^{2})^{1/2}}$ by $F^{X}_{n}$ . Then, the following result holds.

Lemma B.1 (Tyurin (2012)).

There exists a universal constant $C$ such that

[TABLE]

for all positive integers $n$ , where $C$ does not exceed $0.5591$ .

Remark 10.

Here, we introduce the result given by Tyurin (2012). There have been many studies in which Berry–Esseen-type results are derived; see, e.g., Chen, Goldstein and Shao (2011, Chapter 3).

B.2 Lower bound

In this subsection, we introduce the result given by Hall and Barbour (1984) that considers reversing the Berry–Esseen inequality.

Let $\{Y_{i}\}_{i\geq 1}$ be a sequence of independent random variables satisfying ${\rm E}[Y_{i}]=0$ and ${\rm E}[Y_{i}^{2}]=\sigma_{i}^{2}(>0)$ for all $i$ , and $\sum_{i=1}^{n}\sigma_{i}^{2}=1$ . We denote the distribution function of $\sum_{i=1}^{n}Y_{i}$ by $F^{Y}_{n}$ . Letting

[TABLE]

the following result holds.

Lemma B.2 (Hall and Barbour (1984)).

There exists a universal constant $D$ such that

[TABLE]

As

[TABLE]

we use the RHS as a lower bound. This bound is sufficient in Cases A, B⋆, and C1 to show the decay rate of $\|F_{n,\theta}-\Phi\|_{\infty}$ .

Bibliography21

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Arratia and Tavaré (1992) Arratia, R.; Tavaré, S. (1992). Limit theorems for combinatorial structures via discrete process approximations. Random Structures Algorithms 3, no.3, 321–345.
2Chen, Goldstein and Shao (2011) Chen, L. H. Y.; Goldstein, L.; Shao, Q.-M. (2011). Normal approximation by Stein’s method . Probability and its Applications (New York). Springer, Heidelberg. xii+405 pp.
3Crane (2016) Crane, H. (2016). The ubiquitous Ewens sampling formula. Statist. Sci. 31, no.1, 1–19.
4Durrett (2008) Durrett, R. (2008). Probability models for DNA sequence evolution. Second edition. Probability and its Applications (New York). Springer, New York.
5Ewens (1972) Ewens, W. J. (1972). The sampling theory of selectively neutral alleles. Theoret. Population Biology 3, 87–112; erratum, ibid. 3 (1972), 240; erratum, ibid. 3 (1972), 376.
6Feng (2007) Feng, S. (2007). Large deviations associated with Poisson–Dirichlet distribution and Ewens sampling formula. Ann. Appl. Probab. 17, no. 5–6, 1570–1595.
7Ghosal and van der Vaart (2017) Ghosal, S.; van der Vaart, A. (2017). Fundamentals of nonparametric Bayesian inference. Cambridge Series in Statistical and Probabilistic Mathematics, 44. Cambridge University Press, Cambridge.
8Goncharov (1944) Goncharov, V. L. (1944). Some facts from combinatorics. Izv. Akad. Nauk SSSR , Ser. Mat. 8, 3–48.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Error bounds for the normal approximation to

Abstract

1 Introduction

Remark 1**.**

1.1 Assumptions and asymptotic regimes

Remark 2**.**

Remark 3**.**

2 Main results

2.1 An upper error bound

Theorem 2.1**.**

Remark 4**.**

Remark 5**.**

Corollary 2.2**.**

Corollary 2.3**.**

2.2 Evaluation of the decay rate

Theorem 2.4**.**

Remark 6**.**

Remark 7**.**

Corollary 2.5**.**

3 Some preliminary results

3.1 A representation of KKK by a Bernoulli sequence

Remark 8**.**

3.2 Evaluations for moments

Lemma 3.1**.**

Proof of Lemma 3.1.

Lemma 3.2**.**

Proof of Lemma 3.2.

Lemma 3.3**.**

Proof of Lemma 3.3.

Lemma 3.4**.**

Proof of Lemma 3.4.

Remark 9**.**

4 Proofs of the results in Section 2

4.1 Proof of the results in Subsection 2.1

Proof of Theorem 2.1.

Proof of Corollary 2.2.

Proof of Corollary 2.3.

4.2 Proof of the results in Subsection 2.2

Proof of Theorem 2.4.

Proof of Corollary 2.5.

5 Concluding remarks

Appendix A Some evaluations

Lemma A.1**.**

Proof.

Lemma A.2**.**

Proof.

Appendix B Error bounds for normal approximations

B.1 The Berry–Esseen-type theorem for independent sequences

Lemma B.1** (Tyurin (2012)).**

Remark 10**.**

B.2 Lower bound

Lemma B.2** (Hall and Barbour (1984)).**

Remark 1.

Remark 2.

Remark 3.

Theorem 2.1.

Remark 4.

Remark 5.

Corollary 2.2.

Corollary 2.3.

Theorem 2.4.

Remark 6.

Remark 7.

Corollary 2.5.

3.1 A representation of $K$ by a Bernoulli sequence

Remark 8.

Lemma 3.1.

Lemma 3.2.

Lemma 3.3.

Lemma 3.4.

Remark 9.

Lemma A.1.

Lemma A.2.

Lemma B.1 (Tyurin (2012)).

Remark 10.

Lemma B.2 (Hall and Barbour (1984)).