The impatient collector

Anis Amri (IECL); Philippe Chassaing (IECL)

arXiv:1906.11012·math.PR·June 27, 2019

The impatient collector

Anis Amri (IECL), Philippe Chassaing (IECL)

PDF

Open Access

TL;DR

This paper studies the shape of the completion curve in the coupon collector problem under the condition of completing the collection unusually quickly, and applies findings to automata theory.

Contribution

It introduces the asymptotic shape of the completion curve conditioned on rapid collection, extending classical results and deriving a new formula for automata analysis.

Findings

01

Characterizes the asymptotic completion curve under fast collection conditions

02

Provides a new derivation of Koršunov's formula for automata

03

Enhances understanding of collection dynamics under atypical scenarios

Abstract

In the coupon collector problem with $n$ items, the collector needs a random number of tries $T_{n} ≃ n ln n$ to complete the collection. Also, after $n t$ tries, the collector has secured approximately a fraction $ζ_{\infty} (t) = 1 - e^{- t}$ of the complete collection, so we call $ζ_{\infty}$ the (asymptotic) \emph{completion curve}. In this paper, for $ν > 0$ , we address the asymptotic shape $ζ (ν, .)$ of the completion curve under the condition $T_{n} \leq (1 + ν) n$ , i.e. assuming that the collection is \emph{completed unlikely fast}. As an application to the asymptotic study of complete accessible automata, we provide a new derivation of a formula due to Kor\v{s}unov.

Equations666

ζ_{\infty} (t) = 1 - e^{- t}

ζ_{\infty} (t) = 1 - e^{- t}

E [T_{n}] = n H_{n} \sim n ln n .

E [T_{n}] = n H_{n} \sim n ln n .

T_{n}

T_{n}

N = (1 + ν) n, ν = \frac{N - n}{n} .

N = (1 + ν) n, ν = \frac{N - n}{n} .

F (x) = exp (- 1 - x - W_{0} (- (1 + x) e^{- 1 - x})) .

F (x) = exp (- 1 - x - W_{0} (- (1 + x) e^{- 1 - x})) .

y^{'} = F (\frac{x - y}{y}), y (1 + ν) = 1.

y^{'} = F (\frac{x - y}{y}), y (1 + ν) = 1.

N, n lim P_{N, n} ([a, 1 + ν] sup ∣ ζ_{n} - ζ (ν, .) ∣ \geq ε) = 0.

N, n lim P_{N, n} ([a, 1 + ν] sup ∣ ζ_{n} - ζ (ν, .) ∣ \geq ε) = 0.

ω = (ω_{k})_{k \geq 1}

ω = (ω_{k})_{k \geq 1}

y_{ℓ} (ω) = # {ω_{k} \vline 1 \leq k \leq ℓ},

y_{ℓ} (ω) = # {ω_{k} \vline 1 \leq k \leq ℓ},

T_{k} (ω) = in f {ℓ \geq 1 \vline y_{ℓ} (ω) = k} .

T_{k} (ω) = in f {ℓ \geq 1 \vline y_{ℓ} (ω) = k} .

Y_{n} (t, ω) = y_{⌊ t ⌋} (ω), t \geq 0,

Y_{n} (t, ω) = y_{⌊ t ⌋} (ω), t \geq 0,

ζ_{n} (t, ω) = n^{- 1} Y_{n} (n t, ω) .

ζ_{n} (t, ω) = n^{- 1} Y_{n} (n t, ω) .

E_{n} [ζ_{n} (t)] = 1 - (1 - \frac{1}{n})^{⌊ n t ⌋} ≃ ζ_{\infty} (t),

E_{n} [ζ_{n} (t)] = 1 - (1 - \frac{1}{n})^{⌊ n t ⌋} ≃ ζ_{\infty} (t),

n lim ζ_{n} (t, .) = ζ_{\infty} (t) .

n lim ζ_{n} (t, .) = ζ_{\infty} (t) .

P_{N, n} ([a, 1 + Λ (N, n)] sup ∣ ζ_{n} - ζ (Λ (N, n), .) ∣ \geq C n^{- 1/3}) \leq n^{1/3} e^{- l n^{2} n /2} .

P_{N, n} ([a, 1 + Λ (N, n)] sup ∣ ζ_{n} - ζ (Λ (N, n), .) ∣ \geq C n^{- 1/3}) \leq n^{1/3} e^{- l n^{2} n /2} .

P_{n} (T_{n} \leq N) = n! {n N} n^{- N}

P_{n} (T_{n} \leq N) = n! {n N} n^{- N}

ξ (Λ) = (1 - e^{- ξ (Λ)}) (1 + Λ),

ξ (Λ) = (1 - e^{- ξ (Λ)}) (1 + Λ),

+ \infty lim J (Ξ) = 0,

+ \infty lim J (Ξ) = 0,

ρ = e^{- ξ},

ρ = e^{- ξ},

λ (m, ℓ)

λ (m, ℓ)

ξ (m, ℓ)

v

r (m, ℓ) = \frac{{ ℓ - 1 m - 1 }}{{ ℓ m }}, ρ (λ) = e^{- ξ} .

r (m, ℓ) = \frac{{ ℓ - 1 m - 1 }}{{ ℓ m }}, ρ (λ) = e^{- ξ} .

\forall m \geq ℓ \geq 0, {ℓ m} = ℓ {ℓ m - 1} + {ℓ - 1 m - 1},

\forall m \geq ℓ \geq 0, {ℓ m} = ℓ {ℓ m - 1} + {ℓ - 1 m - 1},

\frac{ℓ { ℓ m - 1 }}{{ ℓ m }} = 1 - r (m, ℓ) .

\frac{ℓ { ℓ m - 1 }}{{ ℓ m }} = 1 - r (m, ℓ) .

∣ r (m, ℓ) - ρ (λ) ∣ \leq \frac{C _{1}}{ℓ} .

∣ r (m, ℓ) - ρ (λ) ∣ \leq \frac{C _{1}}{ℓ} .

ψ (m, ℓ) = \frac{1}{2 π} \frac{m !}{ℓ !} (\frac{e ^{ξ} - 1}{ξ ^{1 + λ}})^{ℓ} \frac{π}{v ℓ} .

ψ (m, ℓ) = \frac{1}{2 π} \frac{m !}{ℓ !} (\frac{e ^{ξ} - 1}{ξ ^{1 + λ}})^{ℓ} \frac{π}{v ℓ} .

\psi(m,\ell)=\frac{m!(e^{\xi}-1)^{\ell}}{\ell!\xi^{m}\sqrt{2\pi m\Big{(}1-\frac{m}{\ell}e^{-\xi}\Big{)}}}.

\psi(m,\ell)=\frac{m!(e^{\xi}-1)^{\ell}}{\ell!\xi^{m}\sqrt{2\pi m\Big{(}1-\frac{m}{\ell}e^{-\xi}\Big{)}}}.

{ℓ m} \sim ψ (m, ℓ) .

{ℓ m} \sim ψ (m, ℓ) .

χ (m, ℓ) = \frac{{ ℓ m } - ψ ( m , ℓ )}{ψ ( m , ℓ )},

χ (m, ℓ) = \frac{{ ℓ m } - ψ ( m , ℓ )}{ψ ( m , ℓ )},

\frac{{ ℓ m } - ψ ( m , ℓ )}{ψ ( m , ℓ )} \leq \frac{C _{2}}{ℓ} .

\frac{{ ℓ m } - ψ ( m , ℓ )}{ψ ( m , ℓ )} \leq \frac{C _{2}}{ℓ} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topicssemigroups and automata theory · Advanced Combinatorial Mathematics · Algorithms and Data Compression

Full text

The impatient collector

Anis Amri Institut Élie Cartan, Université de Lorraine Email: [email protected]

Philippe Chassaing Institut Élie Cartan, Université de Lorraine Email: [email protected]

Abstract

In the coupon collector problem with $n$ items, the collector needs a random number of tries $T_{n}\simeq n\ln n$ to complete the collection. Also, after $nt$ tries, the collector has secured approximately a fraction $\zeta_{\infty}(t)=1-e^{-t}$ of the complete collection, so we call $\zeta_{\infty}$ the (asymptotic) completion curve. In this paper, for $\nu>0$ , we address the asymptotic shape $\zeta(\nu,.)$ of the completion curve under the condition $T_{n}\leq\left(1+\nu\right)n$ , i.e. assuming that the collection is completed unlikely fast. As an application to the asymptotic study of complete accessible automata, we provide a new derivation of a formula due to Koršunov [7, 8].

1 Introduction
1.1 Main result
1.2 Context: coupon collector problem, Stirling numbers and random allocation
1.3 Asymptotics for the Stirling numbers of the second kind
2 The asymptotic behavior of the completion curve
2.1 A random walk related to Stirling numbers
2.2 Azuma inequality
2.3 Euler scheme
3 Coupon and automata
3.1 Koršunov’s formula
3.2 Basics on automata
3.3 Reduction to the NorthEast corner
3.4 Random walks and Pollaczek-Khintchine’s formula
3.5 Tail probabilities and Hoeffding’s inequality
3.6 The profile of an accessible automaton
4 Saddle-point method and Stirling numbers
4.1 Generating function and Cauchy formula
4.2 Central term
4.3 Tail
5 Appendix: Some special functions
5.1 $\xi$ as an implicit function of $\lambda$
5.2 Large deviation for the coupon collector
5.3 Properties of the limit path.
5.4 Small variations of $\rho$
5.5 Proof of Theorem 4
5.5.1 Small variations of $\xi$
5.5.2 Final argument
5.6 Explicit bounds for the second order asymptotics of ${m\brace\ell}$
5.6.1 Taylor coefficients
5.6.2 Upper bound for $C_{\texttt{000}}$
5.6.3 Upper bound for $C_{\texttt{001}}$
5.6.4 Upper bound for $C_{\texttt{0020}}$ and for $C_{\texttt{0021}}$
5.6.5 Upper bound for $K_{\ell}^{(1)}$
5.6.6 Conclusion

1 Introduction

1.1 Main result

This section is intended as a concise introduction to the main results, at the price of, eventually, lacking details, for instance on the way we round real numbers where integers are expected. Details and context are given in the next section. In the standard coupon collector problem with $n$ items, our concern is the completion curve $\zeta_{\infty}$ : after $nt$ tries, according to [9, pp. 4-5], the collector has secured approximately a fraction

[TABLE]

of the complete collection111In [9, pp. 4-5], the coupons outside the collection are seen as the empty cells in a random allocation scheme.. Furthermore, the coupon collector needs a random number of tries $T_{n}$ to complete the collection, with expectation:

[TABLE]

In this paper, for any given $\nu>0$ , we address the asymptotic shape $\zeta_{\nu}$ of the completion curve conditioned to the event $\mathcal{I}(\nu,n)$ :

[TABLE]

i.e. when the collection is completed much faster than in the classical model, hence the title. Since $(1+\nu)n=o\left(\mathbb{E}\left[T_{n}\right]\right)$ , one expects that the conditioning event $\mathcal{I}(\nu,n)$ has an exponentially small probability, see Section 5.2. Define $\nu(N,n)=\nu$ through the relations:

[TABLE]

Formal definitions are given in the next section, but let us, for now, define the random variable $\zeta_{n}(t,\omega)$ as the fraction of the complete collection secured by the collector after $nt$ tries. Let $W_{0}$ denote the principal branch of the Lambert W-function (i.e. the inverse of $x\mapsto xe^{x}$ ), and set:

[TABLE]

Let $\zeta(\nu,.)$ denote the unique solution, on $(0,\;1+\nu]$ , of the Cauchy problem:

[TABLE]

The graph of $\zeta(\nu,.)$ stays in the set $\{1+\nu\geq x>y>0\}$ , and satisfies $\lim_{t\downarrow 0}\zeta\left(u,t\right)=0$ , see Section 5.3.

Let $\mathbb{P}_{N,n}$ denote the conditional probability distribution of the coupon problem, given that $T_{n}\leq N$ . The asymptotic completion curve of the impatient collector is as follows :

Theorem 1.

For any $a,\varepsilon,\nu>0$ , when $N,n\rightarrow+\infty$ with $N/n\rightarrow 1+\nu$ , i.e. with $\lim\Lambda(N,n)=\nu$ , we have

[TABLE]

Theorem 1 is an extension of Theorem 2 (see next section), to the conditional case: $\zeta_{n}$ converges in probability to $\zeta(\nu,.)$ , uniformly in any interval $[a,1+\nu]$ . Thus $\zeta(\nu,.)$ is the $\nu$ -analog of $\zeta_{\infty}$ . In the next section, we give a stronger result, in which convergence in probability is given with an explicit bound on the error, cf. Theorem 3. In Section 3, we discuss some applications of this result to random finite automata, including a new (to our knowledge) derivation of a formula by Koršunov [7]. Finally, in Sections 4 and 5 we precise (see Theorem 6) a classic asymptotic formula, due to Good, for Stirling numbers of the second kind, providing a bound that is key for our results, but could also be of independent interest.

1.2 Context: coupon collector problem, Stirling numbers and random allocation

Let us define more precisely the classical model (resp. the conditioned model), that we shall call the patient model (resp. the impatient model). In the patient model, we consider a sequence

[TABLE]

of uniform i.i.d. integers in $[\![1,n]\!]$ . Let $\mathbb{P}_{n}$ denote the corresponding probability distribution on the set $[\![1,n]\!]^{\mathbb{N}}$ of infinite sequences. For $\ell\geq 0$ , let

[TABLE]

denote the size of the collection after the $k$ th try, or the number of nonempty cells after the $k$ th allocation, so that $T_{n}$ can also be defined as follows: for $1\leq k\leq n$ ,

[TABLE]

Then, set:

[TABLE]

so that the completion curve is defined as:

[TABLE]

One finds easily:

[TABLE]

but also, more precisely, as a consequence of [9, Ch. 1.1-3],

Theorem 2.

In the patient model, in probability, for any $t\geq 0$ ,

[TABLE]

As opposed to the patient model, in the impatient model, we consider the conditional distribution of $\omega$ given that $T_{n}(\omega)\leq N$ : then only the prefix $\omega_{[N]}=(\omega_{1},\omega_{2},\dots,\omega_{N})$ of $\omega$ matters. In the impatient model, $\omega_{[N]}$ is uniformly distributed on the $n!{N\brace n}$ sequences that are surjections on $[\![1,n]\!]$ , a small subset $\Omega_{N,n}$ of $[\![1,n]\!]^{N}$ . Here, as usual, ${m\brace\ell}$ denotes the number of partitions of a set of $m$ elements in $\ell$ nonempty subsets, called Stirling number of the second kind. Thus $\mathbb{P}_{N,n}$ , the conditional probability distribution of the coupon problem, given that $T_{n}\leq N$ , is the uniform distribution on $\Omega_{N,n}$ . A stronger version of Theorem 2 is as follows:

Theorem 3.

For any $a>0$ , and for $n_{0}$ large enough, there exists $C=C(n_{0},a)>0$ such that, for $n\geq n_{0}$ ,

[TABLE]

The expression of $C(n_{0},a)>0$ is given at Section 5.6.6. If we assume that $\Lambda(N,n)$ stays away from 0 and $+\infty$ , then, according to the asymptotic analysis of Stirling numbers of the second kind, to be found in [6], the conditioning event has an exponentially small probability:

[TABLE]

in which $J$ is discussed in more detail in Section 5.2. Let us just mention, now, that $\Xi=\xi(\Lambda)$ is the unique positive solution of

[TABLE]

that $\xi(\Lambda)=-\ln\left(F((N-n)/n)\right)$ , and that $J$ is decreasing and satisfies

[TABLE]

which entails that $J$ is positive. Together with

[TABLE]

the implicit function $\xi$ is known to play a special rôle in the asymptotic behavior of ${N\brace n}$ , see Section 4.

1.3 Asymptotics for the Stirling numbers of the second kind

First we need to set some notations. For some integers $m\geq\ell\geq 1$ , the Stirling number of the second kind, denoted by ${m\brace\ell}$ , is the number of partitions of a set of $m$ elements into $\ell$ non-empty subsets. By convention ${0\brace 0}=0$ , and for $m\geq 1$ we have ${m\brace 0}=0$ . Let $W_{0}$ denote the principal branch of the Lambert W-function (i.e. the inverse of $x\mapsto xe^{x}$ ), and set:

[TABLE]

We set:

[TABLE]

The Stirling numbers of the second kind satisfy the following recurrence relation

[TABLE]

so that

[TABLE]

In Section 5.5, we prove that for $m$ , $\ell$ large, $r\left(m,\ell\right)$ depends mostly on the ratio $m/\ell$ :

Theorem 4.

For any $\delta\in(0,1)$ , there exist $\ell_{0},C_{1}=C_{1}(\ell_{0},\delta)$ , both positive, such that, for any $\ell\geq\ell_{0},\lambda\in(\delta,\delta^{-1})$ ,

[TABLE]

This bound proves to be crucial to our aims, for $r\left(m,\ell\right)$ and $1-r\left(m,\ell\right)$ can be seen as transition probabilities for a random walk closely related to the completion curve $\zeta_{n}$ , cf. Proposition 1. At Section 5.6.6, we describe $C_{1}$ . To prove Theorem 4, we need a refinement of the asymptotic study of ${m\brace\ell}$ , originally made in [6]: set

[TABLE]

In Good [6], $\psi$ takes the alternative form

[TABLE]

As a first step toward Theorem 4, Good, followed by many others, established that $\psi(m,\ell)$ is an estimate of the corresponding Stirling number:

Theorem 5 ([6]).

When $\ell$ and $m$ both grow towards $+\infty$ , with $m=\Theta(\ell)$ ,

[TABLE]

Though [6, (3)] hints at an asymptotic expansion for the relative error:

[TABLE]

it does not really provide a bound for $\chi$ , while such a bound is needed to prove Theorem 4. So Sections 4 and 5 are devoted to the proof of the following bound, of independent interest :

Theorem 6.

For any $\delta\in(0,1)$ , there exist $\ell_{0},C_{2}=C_{2}(\ell_{0},\delta)$ , both positive, such that for any $\ell\geq\ell_{0}$ ,and for $\lambda(m,\ell)\in\left(\delta,\tfrac{1}{\delta}\right)$ ,

[TABLE]

2 The asymptotic behavior of the completion curve

2.1 A random walk related to Stirling numbers

In this section, with the help of Theorem 6, we prove Theorem 3, about the asymptotic behavior of the completion curve of an impatient coupon collector. For a suitable elementary (small) step $\tilde{h}$ , to be defined later in the section, we shall prove that

[TABLE]

in which $\sigma_{\ell}=o\left(\tilde{h}\right)$ , while, by definition,

[TABLE]

Then $\zeta_{n}$ is the result of an Euler scheme with rounding errors $\sigma_{\ell}$ . As such, $\zeta_{n}$ provides a stochastic approximation for $\zeta$ , in the spirit of [2, 4].

Actually, time-reversed versions of $\zeta$ and $\zeta_{n}$ , that start at time $1+\Lambda$ and end at time 0, are more convenient, for the approximations of Stirling numbers that we use are much worse for small arguments, making the convergence trickier when $(t,\zeta(t))$ and $(t,\zeta_{n}(t))$ are close to $(0,0)$ . The bound on $\sigma_{\ell}$ is obtained through probabilistic and combinatorial tools applied to the discrete version of $\zeta_{n}$ , before it is rescaled: for any surjection $\omega$ , consider a time-reversed version $Z^{(n)}$ of the completion curve $Y_{n}$ of $\omega$ , defined, for $t\in[0,N]$ , by

[TABLE]

Actually the corresponding point of the curve has coordinates $W_{t}=(N-t,Z^{(n)}_{t})$ , and under $\mathbb{P}_{N,n}$ , the probability distribution of $W=(W_{k})_{k\in[\![0,N]\!]}$ has a slick description in terms of Stirling numbers of the second kind.

Proposition 1.

$W$ * is a Markov chain starting at $(N,n)$ , with transition probabilities described, for $0\leq\ell\leq m$ , by :*

[TABLE]

In other words, $Z^{(n)}$ is an inhomogeneous Markov chain, with increments $\Delta_{k+1}=Z^{(n)}_{k+1}-Z^{(n)}_{k}$ satisfying

[TABLE]

Proof.

Let us compute the probability $p_{z}$ of a sample path

[TABLE]

for $Z$ , in which $z_{0}=n$ and $z_{N-m}=\ell$ : the restriction to $[\![1,m]\!]$ of any surjection $\omega$ resulting in $z$ has $\ell$ elements in its image, leading to ${m\brace\ell}n_{\downarrow\ell}$ choices for this restriction, then at each step $z_{k}\rightarrow z_{k-1}$ we have either $z_{k}$ choices for $\omega(k-1)$ if $y_{k}=z_{k}-z_{k-1}=0$ , or $n-z_{k}$ choices for $\omega(k-1)$ if $y_{k}=-1$ . The second case happens $n-\ell$ times exactly, and produces a factor $n-\ell!$ . Thus

[TABLE]

while, if $z.\ell-1$ denotes the path $\left(z_{0},z_{1},\ldots,z_{N-m},\ell-1\right)$ —seen as a word—, we have, by the same formula, since $y_{N-m+1}=-1$ :

[TABLE]

Thus the expression

[TABLE]

depends only on the final part of the sample path, on the couple $\left(Z^{(n)}_{N-m},Z^{(n)}_{N-m+1}\right)=(\ell,\ell-1)$ . As a consequence, $W$ satisfies the Markov property, and $r(m,\ell)$ is its transition probability, as expected.∎

2.2 Azuma inequality

Theorem 3 is a consequence of the following chain of approximations:

[TABLE]

and its proof results from bounds for the errors in this chain of approximation, as explained before. The first error is bounded with the help of the Azuma-Hoeffding inequality, as usual when the approximation stems from the law of large numbers, while the bound for the second error, given by Theorem 4, follows from the saddle-point method, as explained in Section 4. In order to use an Euler scheme, let us now divide the path into a sequence of, approximately, $\left(1+\Lambda\right)\times n^{\beta}$ infinitesimal intervals, each of these intervals being a sequence of $h=\lfloor n^{\alpha}\rfloor$ steps, $\alpha+\beta=1,\alpha,\beta>0$ . Consider then an integer $t$ of the form $jh$ , $j\in\mathbb{N}$ , so that $t$ is the beginning of some interval, and $t+h$ is the end of the same interval. Then

[TABLE]

in which:

[TABLE]

the last equality due to (9). Rescaling time and space by a factor $1/n$ , we set $\tilde{h}=h/n$ and

[TABLE]

Finally, for $\eta,\delta\in(0,1)$ , we set

[TABLE]

in such a way that, according to Theorem 6, $|\ell\chi(m,\ell)|$ is uniformly bounded for $(m,\ell)$ in $n\mathfrak{W}_{\eta,\delta}$ , as long as $n$ is large enough, and the same holds true for Theorem 4. Now, for $x\in[\eta,1+\Lambda]$ , by geometric considerations,

[TABLE]

Section 1.3 entails that

Lemma 1.

For $n$ large enough, and for $\eta,\delta\in(0,1)$ , if $n^{-1}W_{jh}\in\mathfrak{W}_{\eta,2\delta}$ and $N-(j+1)h\geq\eta n$ , we have

[TABLE]

Proof.

Recall that $t=jh$ . If

[TABLE]

then

[TABLE]

but, if $n^{-1}W_{t+s}\in\mathfrak{W}_{\eta,\delta}$ , we obtain, below, that

[TABLE]

entailing (11). Relation (12) follows from the Taylor inequality for $\rho$ , provided that both $n^{-1}W_{t}$ and $n^{-1}W_{t+s}$ belong to $\mathfrak{W}_{\eta,\delta}$ :

[TABLE]

and, since $W_{t+s}$ meets the conditions in Theorem 4,

[TABLE]

while, according to Section 5.4,

[TABLE]

For $n$ large enough,

[TABLE]

yielding successively (12), then (11). ∎

Also, for $t,k\geq 0$ , let $\mathcal{F}_{k}$ denote the $\sigma$ -algebra $\sigma(Z_{1},\;Z_{2},\ldots,Z_{t+k})$ , and

[TABLE]

The sequence $(\mathcal{M}_{k})$ is a martingale with respect to the filtration $\mathcal{F}$ and for any $k$ , $|\mathcal{M}_{k+1}-\mathcal{M}_{k}|\leq 1$ , thus Azuma’s inequality gives :

[TABLE]

in which $\mathcal{M}_{h}=A(j,h)$ . For $u=n^{\alpha/2}\ln n$ , we obtain:

[TABLE]

Set

[TABLE]

The previous bounds lead to

Proposition 2.

For $n$ large enough, the set $\mathfrak{H}_{n}$ satisfies:

[TABLE]

2.3 Euler scheme

Thus, for $\omega\notin\mathfrak{H}_{n}$ , i.e. but for a probability at most $\mathcal{O}\left(n^{\beta}e^{-\tfrac{\ln^{2}n}{2}}\right)$ , $\left(\zeta_{n}(t)\right)_{0\leq t\leq k}$ is obtained through an Euler scheme with step $\tilde{h}=n^{-1}h\simeq n^{\alpha-1}$ and rounding error $\sigma_{j}$ such that

[TABLE]

For the choice $\alpha=2/3$ , $\beta=1/3$ , and for $n$ large enough, depending on the choice of $(\ell_{0},\eta,\delta)$ , we obtain that

[TABLE]

Then we can see $\zeta(\ell\tilde{h})$ , resp. $\zeta_{n}(\ell\tilde{h})$ , as the solution of the ODE at time $\ell\tilde{h}$ (resp. the output of the Euler scheme after $\ell$ steps), and set

[TABLE]

Then, following [3] and according to Section 5.4, provided that the points $M_{n,\ell}=\left(\ell\tilde{h},\zeta_{n}(\ell\tilde{h})\right)$ and $M_{\ell}=\left(\ell\tilde{h},\zeta(\ell\tilde{h})\right)$ belong to $\mathfrak{W}_{\eta,2\delta}$ , and that $1+\Lambda-(\ell+1)\tilde{h}\geq\eta$ , we can write

[TABLE]

in which

[TABLE]

and

[TABLE]

The bounds for the supremums in (13) are obtained in Section 5.5, see Proposition 10. For $\ell=0$ , $M_{n,0}=M_{0}=\left(1+\Lambda,1\right)\in\mathfrak{W}_{a,4\delta}$ for $\delta$ small enough. Consider the bound (48) obtained for $\lambda$ at Section 5.3. It entails that, for $x\in[a,1+\Lambda]$ ,

[TABLE]

so that $\left(x,\zeta(x)\right)\in\mathfrak{W}_{a,4\delta}$ for $4\delta\leq\min\left(\tfrac{a\Lambda}{(1+\Lambda)^{2}},\Lambda^{-1}\right)$ . Thus $M_{\ell}\in\mathfrak{W}_{a,4\delta}\subset\mathfrak{W}_{a,2\delta}$ if $\ell\tilde{h}\in[a,1+\Lambda]$ . Now, for $x\in[a,1+\Lambda]$ ,

[TABLE]

Assume that, for $k\leq\ell$ , $M_{n,k-1}\in\mathfrak{W}_{a,2\delta}$ , so that we can write :

[TABLE]

Then

[TABLE]

for $n$ large enough, depending on $(\delta,a)$ , but not on $\ell$ , since :

[TABLE]

for $\ell\leq(1+\Lambda)n^{\beta}\simeq(1+\Lambda)\tilde{h}^{-1}$ . Relations (16) and (17) entail that $M_{n,\ell}\in\mathfrak{W}_{a,2\delta}$ so that (14) holds true and, in turn, $M_{n,\ell+1}\in\mathfrak{W}_{a,2\delta}$ , if necessary. It follows, recursively, that, for any $\ell\leq(1+\Lambda)n^{\beta}$ ,

[TABLE]

that is, at the ends of any infinitesimal interval, the error $\left|\zeta_{n}-\zeta\right|$ is bounded accordingly. Between these ends the error can be larger by at most half the length of this infinitesimal interval, i.e. by $n^{-1/3}/2$ , since both $\zeta_{n}$ and $\zeta$ are non increasing with slope smaller than 1. Finally, for $n$ large enough and $\omega\notin\mathfrak{H}_{n}$ , i.e. but for a probability at most $\mathcal{O}\left(n^{-\ln n/2\ +\beta}\right)$ , on the interval $[a,1+\Lambda]$ ,

[TABLE]

3 Coupon and automata

3.1 Koršunov’s formula

In $1978$ , Koršunov [7, 8] proves a formula for the asymptotic enumeration of accessible complete and deterministic automata (ACDA) with $n$ states over a $k$ -letters alphabet. Later Nicaud [13] proves that ACDA are in bijection with a subset $\mathcal{A}_{k,n}$ of $\Omega_{kn+1,n}$ , though he uses a different terminology : surjections are represented by boxed diagrams, and ACDA by Dyck boxed diagram. We recall briefly the definitions of these combinatorial objects in the next subsection. In this paper we assume that $k\geq 2$ and we set $N=kn+1$ . With these notations, we can rephrase Koršunov’s result as follows :

Theorem 7.

[7, 8]**

[TABLE]

In the notations of [10], $1-k\rho(k)=(1-\rho(k))E_{k}$ . In Section 3.2, we describe $\mathcal{A}_{k,n}$ following the lines of [13], then in Sections 3.3,3.4,3.5 we give a probabilistic proof of Theorem 7 : with the help of Theorem 3 and of the representation of ACDA, taken from [13], Theorem 7 reduces to the Pollaczeck-Khinchine formula for a simple random walk. In Section 3.6 we explain how Theorem 3 extends to ACDA.

3.2 Basics on automata

In this section, we recall briefly some vocabulary on words and automata, taken from [11, Section 1.3], then we describe the representation of ACDA by boxed diagrams, following [13]. Let $\mathcal{A}$ be a finite totally ordered set, called alphabet. The elements of $\mathcal{A}$ are called letters or also symbols. A finite word $w$ on the alphabet $\mathcal{A}$ is a finite sequence $w=w_{1}w_{2}\ldots w_{n}$ of elements of $\mathcal{A}$ . The set of words is endowed with the operation of concatenation, also called product, in which two words $u=u_{1}u_{2}\ldots u_{p}$ and $v=v_{1}v_{2}\ldots v_{q}$ give the word $uv=u_{1}u_{2}\ldots u_{p}v_{1}v_{2}\ldots v_{q}$ . This operation is associative, and it has a neutral element, the empty word, denoted by $\emptyset$ . The length of a word $u$ , denoted $|u|$ , is the number of letters in the word $u$ (so that $|\emptyset|=0$ ). We denote by $\mathcal{A}^{*}$ the set of finite words on the alphabet $\mathcal{A}$ .

Definition 1.

A deterministic and complete automaton $\mathfrak{A}$ is a quintuplet $(\mathcal{A},Q,\delta,I,F)$ consisting of:

•

an alphabet $\mathcal{A}$ , such that $\#\mathcal{A}=k$ ,

•

a set $Q$ of states, such that $\#Q=n$ ,

•

an initial state $q_{0}$ ,

•

a transition function, $\delta$ , that takes as argument a state and a symbol and returns a state, $\delta:Q\times\mathcal{A}\rightarrow Q$ ,

•

a set of final states $F\subset Q$ .

The transition function $\delta$ has a straightforward extension to $Q\times\mathcal{A}^{*}$ , that describes a path from a state $q$ to another state $\delta(q,w)$ through a sequence $w$ of letters (=edges) in a directed graph related to $\delta$ , see the figure below. For instance, for $w=w_{1}w_{2}\in\mathcal{A}^{2}$ ,

[TABLE]

Definition 2.

A deterministic finite automaton $\mathfrak{A}$ is accessible when for each state $q$ of $\mathfrak{A}$ , there exists a word $u\in\mathcal{A}^{*}$ such that $\delta(q_{0},u)=q$ .

Definition 3.

A word $u$ is recognized by an automaton when $\delta(q_{0},u)\in F$ . The language recognized by an automaton is the set of words that it recognizes.

Two representations of an ACDA. Consider the ACDA $\mathfrak{A}=(\mathcal{A},Q,\delta,I,F)$ given by the alphabet $\mathcal{A}=\{a,b,c\}$ , $Q=\{q_{0},q_{1},q_{2},q_{3}\}$ , $I=q_{0}$ and $F=\{q_{3}\}$ . The transition table of $\mathfrak{A}$ is a first representation of $\delta$ , for instance :

[TABLE]

The symbol $\rightarrow$ marks the initial state, here it is $q_{0}$ . The symbols $*$ mark the final state(s) (here there is only one final state, $q_{3}$ ).

Another representation is through a directed graph with edges labelled by $\mathcal{A}$ and whose set of vertices is $Q$ : for $(a,q,r)\in\mathcal{A}\times Q^{2}$ a directed edge $(q,r)$ with label $a$ is present in the graph of $\mathfrak{A}$ if and only if $\delta(q,a)=r$ . Then each vertex of the graph has out-degree $k$ , and there is a path from $q$ to $r$ in the graph if and only if there exists a word $u\in\mathcal{A}^{*}$ such that $\delta(q,u)=r$ , hence the term accessible. Only the initial state has an ingoing edge with no starting point, and the final states have an outgoing edge with no endpoint.

$q_{0}$ start $q_{1}$$q_{2}$$q_{3}$ \pgfmathresultpt $b$ \pgfmathresultpt $a$ \pgfmathresultpt $c$ \pgfmathresultpt $a$ \pgfmathresultpt $b,c$ \pgfmathresultpt $a,b$ \pgfmathresultpt $c$ \pgfmathresultpt $a,b,c$

The accessibility of an automaton $\mathfrak{A}=(\mathcal{A},Q,\delta,q_{0},F)$ depends only on its transition structure $\mathfrak{D}=(\mathcal{A},Q,\delta,q_{0})$ , not on its final states, thus one can discuss the accessibility of a complete deterministic transition structure (CDTS) : $(\delta,q_{0})$ can be seen as a map $\delta^{*}$ from the set of edges $\{\rightarrow\}\cup\left(Q\times\mathcal{A}\right)$ , including thus $Q\times\mathcal{A}$ plus the starting edge $\rightarrow$ , such that $\delta^{*}\left(\rightarrow\right)=q_{0}$ and $\delta^{*}|_{Q\times\mathcal{A}}=\delta$ . The CDTS is accessible only if its transition function $\delta^{*}$ is a surjection, that is, $\delta^{*}$ has to belong to $\Omega_{N,n}$ and this has to be the connection between the impatient collector and ACDA. However, two problems arise:

•

though $\{\rightarrow\}\cup\left(Q\times\mathcal{A}\right)$ has $kn+1=N$ elements, a total order would be handy to identify $\{\rightarrow\}\cup\left(Q\times\mathcal{A}\right)$ with $[\![1,N]\!]$ , and $\delta^{*}$ with an element of $\Omega_{N,n}$ ;

•

the surjectivity of $\delta^{*}$ is not sufficient to insure the connexity of $\mathfrak{D}$ .

It turns out that the answer to the second point is also an answer to the first point : as usual for the connexity of graphs, a necessary and sufficient condition of connexity is a positivity condition for a path related to the breadth-first-search of the corresponding graph, and this breadth-first search also provides a total order on $Q$ , which, with the alphabetic order on $\mathcal{A}$ , induces a lexicographic order on $\{\rightarrow\}\cup\left(Q\times\mathcal{A}\right)$ , allowing to identify $\{\rightarrow\}\cup\left(Q\times\mathcal{A}\right)$ with $[\![1,N]\!]$ . The path, then, is the completion curve for $\delta^{*}$ once $\{\rightarrow\}\cup\left(Q\times\mathcal{A}\right)$ is identified to $[\![1,N]\!]$ .

More precisely, the search starts from the initial vertex, $\delta^{*}(\rightarrow)$ , of the first directed edge $\rightarrow$ , end vertex relabelled $1$ for it is the first piece in the collection. Then one explores the edges starting from $q_{0}=1$ , in the alphabetic order, from $\delta(q_{0},a_{1})$ to $\delta(q_{0},a_{n})$ , and when this exploration is over, either there exists no new piece in the collection, meaning that $q_{0}$ is a connected component by itself, and meaning that $\mathfrak{D}$ is not accessible, or there exists some new piece. Thus the completion curve of an accessible CDTS must satisfy $y_{k+1}\geq 2$ . The $y_{k+1}-1$ new vertices, at this stage, are of the form

[TABLE]

and they are sorted (and explored) according to the alphabetic order for the letters $a\in\mathcal{A}$ . They are also relabeled $2,3,\dots,y_{k+1}$ . Similarly, after the exploration of the neighbours of the second piece $q_{1}=2$ of the collection, we need $y_{2k+1}\geq 3$ , else $\{q_{0},q_{1}\}$ would be a connected component. In general, the CDTS is accessible if and only if the completion curve $y$ of $\delta^{*}$ satisfies

[TABLE]

Now, according to [13], the boxed diagram of $\mathcal{D}$ is just the completion curve of $\delta^{\ast}$ , decorated with one mark in each column, at height $x_{i}\leq y_{i}$ , meaning that $\delta^{*}(i)=x_{i}$ .

Example: In Figure 1, we see how the breadth-first search of the graph produces a labeling of the vertices and edges, which, in turn, dictates the order of the search: the ends of the edges starting from a given vertex are searched in the alphabetic order, and the vertices are searched according to their order of apparition during the breadth-first search, beginning with the starting vertex. The 24 CDTS, obtained by permutation of the symbols $\{\mathfrak{T},\mathfrak{W},\mathfrak{S},\mathfrak{X}\}$ as labels of the vertices of our example on the left, would produce the same labeling as is pictured on the left. Note that the correspondance edge-endpoint is a surjection $\tilde{\omega}$ from $[\![1,13]\!]$ to $[\![1,4]\!]$ , with a special property: the partition $P_{\mathfrak{D}}=\left(\tilde{\omega}^{-1}(1),\tilde{\omega}^{-1}(2),\tilde{\omega}^{-1}(3),\tilde{\omega}^{-1}(4)\right)$ , here e.g.

[TABLE]

is necessarily sorted in increasing order of the smallest elements of the parts.

Similarly, a sequence $\omega\in\Omega_{N,n}$ (a surjection) can be matched with a boxed diagram in $\mathcal{S}_{N,n}$ in exactly $n!$ ways, as follows : according to the coupon collector metaphor, the collection process produces an order among the elements of the collection:

[TABLE]

denotes the $k$ th element of $[\![1,n]\!]$ to enrich the collection ; $\sigma_{\omega}$ is a random uniform permutation of $[\![1,n]\!]$ . Setting

[TABLE]

we obtain that

[TABLE]

Thus $(y_{i},\tilde{\omega}_{i})_{1\leq i\leq N}$ is a boxed diagram associated with the surjection $\omega$ , or with any surjection of the form $\tau\circ\omega$ , with $\tau\in\mathfrak{S}_{n}$ . That is, if $\tau\in\mathfrak{S}_{n}$ , $\tau\circ\omega$ produces the same boxed diagram as $\omega$ , and there exists exactly $n!$ elements of $\Omega_{N,n}$ with the same boxed diagram. Finally, a random uniform surjection $\omega\in\Omega_{N,n}$ produces a random uniform boxed diagram, while a random uniform CDTS produces a random uniform boxed diagram satisfying additionally the constraint (18) (such a boxed diagram is also called a $k$ -Dyck boxed diagram). Thus there is a correspondance in which each boxed diagram is related to $n!$ different surjections of $\Omega_{N,n}$ , and a similar (though different) correspondance in which a $k-$ Dyck boxed diagram is related to $n!$ different CDTS. Note that, according to (18),

[TABLE]

meaning that $y_{nk+1}$ does not satisfy inequality (18).

3.3 Reduction to the NorthEast corner

Now $\mathcal{A}_{k,n}$ denotes the subset of elements $\omega\in\Omega_{N,n}$ meeting the condition (18). Then Theorem 7 is equivalent to an assertion on the asymptotic behaviour of the Markov chain $Z^{(n)}$ studied at Section 2.1 : the probability that the sample path of $Z^{(n)}$ crosses the line $y=x/k$ outside its endpoints $(0,0)$ and $(kn+1,n)$ converges to $k\rho(k)\in(0,1)$ .

This probabilistic formulation of Koršunov’s formula hints at the relation with Theorem 3 : as Figure 3 shows, Theorem 3, with Proposition 9, prevents these crossings outside the close vicinity of the endpoints, but for a small probability. For $n$ large enough, a crossing inside the interval $I_{2}=[\![an,kn-2Ck^{2}n^{1/3}]\!]$ violates the convergence to the limit path at the rate given by Theorem 3, thus such crossings happen with a probability smaller than $n^{1/3}e^{-\ln^{2}n/2}$ . As a consequence, the line of proof for Theorem 7 goes according to the following steps: set

[TABLE]

and let the event that a crossing happens inside the interval $I_{j}$ be denoted $\Upsilon_{n,j}$ . Then

[TABLE]

in which, due to Theorem 3,

[TABLE]

We also have:

Proposition 3.

If $a$ is small enough,

[TABLE]

As a consequence, the asymptotic behaviour of the profile in the NorthEast corner should give simultaneously the limit of $\mathbb{P}_{N,n}\left(\Upsilon_{n,3}\right)$ , and Koršunov’s formula. This is the topic of the next sections.

Proof.

An alternative formulation of the condition $\overline{\Upsilon_{n,1}}$ is as follows: $y_{\ell k+1}\geq\ell+1$ *holds true for * $0\leq\ell\leq an$ . The proof of (21) has two steps :

[TABLE]

and

[TABLE]

Now:

[TABLE]

But

[TABLE]

for $a$ small enough, in which case we have :

[TABLE]

as expected.

For (23), note that :

[TABLE]

But, under $\mathbb{P}_{n}$ , $\left(T_{k}-T_{k-1}\right)_{1\leq k\leq n}$ is a sequence of independent random variables (with geometric distributions and respective expectations $n/(n+1-k)$ ), so, according to [14]:

[TABLE]

or, equivalently,

[TABLE]

which is relation (23). ∎

3.4 Random walks and Pollaczek-Khintchine’s formula

In this section, and the next one, we shall prove that

[TABLE]

Relation (24) results from the Pollaczek-Khinchine formula, as we shall see now: $\Upsilon_{n,3}$ relates to a crossing of the line $y=x/k$ by $Z^{(n)}$ before time $2Ck^{2}n^{1/3}$ . But, before time $2Ck^{2}n^{1/3}$ , i.e. for $(m,\ell)$ close to $(kn,n)$ , due to Proposition 1 and Theorem 4, the transition probabilities $r(m,\ell)$ of $Z^{(n)}$ are close to the constant $\rho(k)$ , so that we expect $Z^{(n)}$ to behave, early, like a random walk $Z$ starting at $n$ , with step distribution

[TABLE]

Since $\rho(k)<1/k$ , the trend is that $Z$ does not cross the line, and if it does at all, the crossing has to take place early, hence we expect the crossing probability of Z to be the limit of $\mathbb{P}_{N,n}\left(\Upsilon_{n,3}\right)$ . In the next section, we shall discuss the convergence (to $Z$ ) of $Z^{(n)}$ , and its speed. In this section, we compute the crossing probability $\mathbb{P}\left(\Upsilon\right)$ of $Z$ , in which:

[TABLE]

Proposition 4.

[TABLE]

Proof.

It is convenient to make some time and space changes to represent our crossing probability in more familar terms, i.e. in terms of a new random walk $S$ on the integers, with negative drift, starting from 0, and such that $\overline{\Upsilon}=\left\{\max S_{n}=0\right\}$ holds true (or such that $\overline{\Upsilon}$ and $\tilde{\Upsilon}=\left\{\max S_{n}=0\right\}$ are closely related events, actually). If we set, for $j\geq-1$ ,

[TABLE]

then $S$ is a random walk with step distribution

[TABLE]

with drift

[TABLE]

starting from 1 at time -1, and:

[TABLE]

Thus

[TABLE]

But we know, from the Pollaczek-Khinchine formula (cf. [1, Corollary 6.6]) that $\mathbb{P}\left(\tilde{\Upsilon}\right)$ is the stationary distribution $\pi_{0}$ at 0 of the Lindsey process with step $\mu_{k}$ . For $\ell\geq 1$ , let $t_{\ell}$ denote the average time needed by the Lindsey process (or, indifferently, by the random walk $S$ ) to hit 0 starting from position $\ell$ : Wald’s identity gives that

[TABLE]

On the other hand, if $t_{0}$ is the expected time of the first return to 0, starting from 0, then, by the Markov property,

[TABLE]

and finally:

[TABLE]

Thus, as expected, $\mathbb{P}\left(\overline{\Upsilon}\right)=\left(1-\rho\left(k\right)\right)\mathbb{P}\left(\tilde{\Upsilon}\right)=-d_{k}=1-k\rho(k)$ . ∎

3.5 Tail probabilities and Hoeffding’s inequality

For some process $X=(X_{i})_{i\geq 0}$ , let $X_{[\![\ell,m]\!]}$ denote the section $(X_{i})_{\ell\leq i\leq m}$ of the sample path $X$ . First, let us bound the distance between the random walk $Z^{(n)}$ of Proposition 1, and $Z$ :

Proposition 5.

Under $\mathbb{P}_{N,n}$ , $Z^{(n)}$ converges to $Z$ in distribution. Moreover, for $\alpha\in(0,1)$ , there exists $C_{\alpha}>0$ such that for $n$ large enough :

[TABLE]

Proof.

We shall use that if

[TABLE]

then:

[TABLE]

Consider a sample path $z=\left(z_{j}\right)_{0\leq j\leq s}$ in which $z_{0}=n$ . Let $\Delta_{j}=z_{j}-z_{j+1}\in\{0,1\}$ denote its $j$ th increment. Under $\mathbb{P}_{N,n}$ , as a consequence of Proposition 1, for any given $s$ ,

[TABLE]

while

[TABLE]

For $\alpha\in(0,1)$ and $n$ large enough, and for a suitable choice of $\eta,\delta\in(0,1)$ , $n^{-1}W_{t}$ belongs to $\mathfrak{W}_{\eta,2\delta}$ , so that, according to (12), for $t=N-1$ and $0\leq\ell\leq m\leq n^{\alpha}$ ,

[TABLE]

Since the probability of a given sample path of $Z$ , resp. $Z^{(n)}$ , is a product of terms $\alpha_{i}$ or $\beta_{i}$ of the following form

[TABLE]

in which $-\Delta\in\{0,-1\}$ is the increment for some step of the random walks of $Z$ , resp. $Z^{(n)}$ , and as a consequence of (26) and (25) (with $\theta=1$ ), the probabilities of these sample paths of length $s\leq n^{\alpha}$ differ by at most

[TABLE]

That a set $A\subset[\![0,n]\!]^{s+1}$ has at most $2^{s}$ admissible elements starting at position $n$ entails that Proposition 5 holds true with the choice $C_{\alpha}=\dfrac{8}{\eta}$ . ∎

Let $\mathbb{Z}^{\star}$ (resp. $\mathbb{Z}^{\infty}$ ) denote the set of finite (resp. finite or infinite) words on the alphabet $\mathbb{Z}$ , and for a finite word $\omega=\omega_{0}\omega_{1}\omega_{2}\dots\omega_{s}$ , set $|\omega|=s+1$ . For $0\leq s<t\leq+\infty$ , let us define the crossing set $\Upsilon(s,t)$ as follows:

[TABLE]

so that, for instance,

[TABLE]

The next proposition completes the proof of Theorem 7 :

Proposition 6.

[TABLE]

Proof.

We shall prove successively that:

[TABLE]

for $s_{n}=\upsilon\ln n$ , in which $\upsilon\ln 2\leq 1-\alpha$ . First, Proposition 5 entails (28) at once. Relations (27) and (29) both follow from Hoeffding’s inequality. For (27) it is rather straightforward : if we set $\beta(k)=\rho(k)-\tfrac{1}{k}$ , relation (44) entails that $\beta(k)<0$ , so that

[TABLE]

Thus the probability of a crossing at some point after time $s_{n}$ satisfies

[TABLE]

Similarly

[TABLE]

but here we cannot use Hoeffding’s inequality directly, though $Z^{(n)}_{0}-Z^{(n)}_{\ell}$ is a sum of Bernoulli random variables, for these Bernoulli random variables are not independent. However, we can build, on the same probability space, a copy of $Z^{(n)}$ and a random walk $\hat{Z}$ in such a way that, for $\ell\leq 2k^{2}n^{1/3}$ , $Z^{(n)}_{m}\leq\hat{Z}_{m}$ and $\hat{Z}$ ’s drift is smaller than $1/k$ , using a sequence $U=\left(U_{m}\right)_{m\geq 1}$ of independent random variables, uniform on $(0,1)$ . Set $b=\dfrac{1}{k}+\dfrac{\beta\left(k\right)}{2}$ and

[TABLE]

For $\widehat{Z}^{(n)}$ and ${Z}^{(n)}$ to have the same distribution, due to Proposition 1 , we need to set $\widehat{Z}^{(n)}_{0}={Z}^{(n)}_{0}=n$ and

[TABLE]

For $0\leq n-\widehat{Z}^{(n)}_{m}\leq m\leq n^{\alpha}$ , and for $\alpha>1/3$ , if we choose $n$ large enough, so that $n^{\alpha}>2k^{2}n^{1/3}$ , and so that

[TABLE]

we can use (26) to obtain that

[TABLE]

Hence

[TABLE]

the second inequality due to Hoeffding’s inequality. Relation (29) follows. ∎

3.6 The profile of an accessible automaton

Jointly with Theorem 7, Theorem 3 has a straightforward consequence : the completion curve of a uniform ADCA has the same limit curve. More precisely, if $\mathbb{Q}_{k,n}$ denotes the uniform distribution on ADCA with $n$ vertices and $k$ letters, or, equivalently $\mathbb{Q}_{k,n}$ is the conditional probability given $\mathcal{A}_{k,n}$ :

[TABLE]

then

Lemma 2.

For a sequence of events $\left(B_{n}\right)_{n\geq n_{0}}$ ,

[TABLE]

Thus, Theorem 3 translates to large automata at once, and we obtain

Theorem 8.

For any $a>0$ , there exists $C_{3}(n_{0},\varepsilon)>0$ such that, for $n\geq n_{0}$ ,

[TABLE]

Recall that $f_{\Lambda}$ is defined at the end of Section 1.2.

4 Saddle-point method and Stirling numbers

This section is devoted to the proof of Theorem 6.

4.1 Generating function and Cauchy formula

Recall the notations:

[TABLE]

According to [6, (6)] or [5, Example III.11, p.179], we have :

[TABLE]

in which

[TABLE]

By the Cauchy formula,

[TABLE]

in which

[TABLE]

to be compared to the asymptotic equivalent to ${m\brace\ell}$ given by [6, (3)], $\psi(m,\ell)$ , that satisfies:

[TABLE]

We expect that $|g(\theta)|\leq 1$ for any $\theta$ , or $|B(\xi e^{i\theta})|\leq B(\xi)$ , since $B(\xi z)$ , as a power series in $z$ , has positive coefficients. We also expect that, around 0,

[TABLE]

and more precisely, since $\xi$ is a saddle-point, we expect that

[TABLE]

According to (46),

[TABLE]

which entails that

[TABLE]

Set

[TABLE]

in which a suitable choice of $\theta_{0}$ is made later, so that:

[TABLE]

In the next sections, in order to prove Theorem 6, we obtain that

[TABLE]

4.2 Central term

In this section we obtain a saddlepoint bound for $K_{\ell}^{(0)}$ , following [5]. We write

[TABLE]

in which :

[TABLE]

is the characteristic function of any Poisson random variable $Z$ with expectation $\xi$ . For our aims, we need a precise estimation of $g$ , obtained through the Taylor-Laplace inequality, see Section 5.6. There, we prove that, for suitable constants $(v,\tau,\gamma)$ ,

[TABLE]

in which, according to Section 5.6.2,

[TABLE]

Note that, according to (46),

[TABLE]

We can write

[TABLE]

Now

[TABLE]

Thus, $\theta_{0}\sqrt{\ell}$ has to be large for $K_{\ell}^{(01)}$ to be $o\left(\ell^{-3/2}\right)$ . On the other hand,

[TABLE]

in which, for $\tilde{\gamma}=\gamma-\tfrac{v^{2}}{2}$ ,

[TABLE]

With the help of (53), since $T(\lambda)$ is bounded for $\lambda\in(\delta,\delta^{-1})$ , we obtain that

[TABLE]

in which $C_{\texttt{000}}$ is discussed at Section 5.6.2. For $K_{\ell}^{(000)}$ to be small, $\theta_{0}\sqrt{\ell}$ cannot be too large:

[TABLE]

yields that

[TABLE]

and that, for $\lambda\in(\delta,\delta^{-1})$ , and $\ell\geq e^{3/\delta}$ ,

[TABLE]

Now

[TABLE]

in which the dependence of $(C_{\texttt{000}},C_{\texttt{001}},\tau,\gamma,\tilde{\gamma})$ on $\lambda$ is studied at Section 5.6, in order to complete the proof of Theorem 6. Finally

[TABLE]

For inequality (38), note that, due to inequalities (54), if $\lambda(m,\ell)\in(\delta,\tfrac{1}{\delta})$ , then $\ell|\tilde{\gamma}|\theta_{0}^{4}\leq\ln 2$ for $\ell$ large enough. For the first term, since

[TABLE]

and $\tau\in i\mathbb{R}$ , we have

[TABLE]

for $\ell$ large enough. Also:

[TABLE]

Thus

[TABLE]

4.3 Tail

As for $K_{\ell}^{(1)}$ , relation (34) yields that:

[TABLE]

Set:

[TABLE]

Following [12, Lemma 1& 2], we prove that

Lemma 3.

For $\theta\in[-\pi,\pi]$ ,

[TABLE]

or equivalently

[TABLE]

Proof.

For $k\geq 0$ , set:

[TABLE]

so that:

[TABLE]

and:

[TABLE]

But

[TABLE]

Thus

[TABLE]

For $\theta\in[-\pi,\pi]$ ,

[TABLE]

leading to:

[TABLE]

∎

Thus, according to Section 5.6.5, for $\ell$ large enough,

[TABLE]

Finally we are ready to prove Theorem 6.

Proof.

Inequality (40) holds true for $\ell$ large enough, and, on $(0,+\infty)$ , its coefficients $C_{i}$ are positive continuous functions of $\lambda$ , thus, for $\lambda(m,\ell)\in[\delta,\tfrac{1}{\delta}]$ , they are bounded. One can deal similarly with inequality (41) (see Section 5.6.5). ∎

5 Appendix: Some special functions

5.1 $\xi$ as an implicit function of $\lambda$

We have seen that $\xi$ is an essential parameter in the asymptotic behaviour of the Stirling number

[TABLE]

in which $\lambda=\lambda(m,\ell)$ is defined by $\lambda=\tfrac{m-\ell}{\ell}$ , and $\xi$ is an implicit function of $\lambda$ , defined by

[TABLE]

For instance, the completion curve $\zeta_{\Lambda}$ of Theorem 3 is defined in terms of $\Lambda=\lambda(N,n)$ and in terms of $\Xi=\xi(\Lambda)$ . Thus we need to list some of the properties of $\xi$ that are of interest in our proofs, not all of them being straightforward, for instance in order to prove Theorem 4 in Section 5.5.

Proposition 7.

The function $\xi$ is increasing, nonnegative and concave, and $\lambda\rightarrow\xi(\lambda)-\lambda$ is increasing, nonnegative and concave as well. Also, we have:

[TABLE]

Proof.

Proof of (42). Relation (2) entails

[TABLE]

at once. Since $\xi\geq 0$ ,

[TABLE]

so, from

[TABLE]

we deduce that

[TABLE]

In order to prove that $2\lambda\geq\xi$ , we need to prove that

[TABLE]

but the last inequality holds true for any positive number $\xi$ , as a consequence of

[TABLE]

Proof of monotony and concavity of $\xi$ and $\xi-\lambda$ . Note that

[TABLE]

entails $\xi(0)=0$ . For $\xi^{\prime}\left(0\right)=2$ we have no additional trouble: when $\xi,\lambda\rightarrow 0_{+}$ ,

[TABLE]

Now, from the implicit function theorem, we obtain:

[TABLE]

entailing that $\xi$ , and $\xi-\lambda$ as well, are increasing. Then

[TABLE]

so that

[TABLE]

It follows that $\xi$ and $\xi-\lambda$ are concave. Finally, (43) is an easy consequence of (2), and (44) follows from:

[TABLE]

and from (42).∎

We also need that :

Lemma 4.

[TABLE]

Proof.

The relation (46) can be written successively:

[TABLE]

the last one being clearly true. ∎

5.2 Large deviation for the coupon collector

Since, for $\lambda>0$ , we have:

[TABLE]

Theorem 5 entails that

[TABLE]

in which

[TABLE]

One can write :

[TABLE]

and finally

[TABLE]

Also:

[TABLE]

Thus, $J$ is decreasing and

[TABLE]

which entails that $J$ is positive.

5.3 Properties of the limit path.

The properties of the sample path $\zeta=f_{\lambda(x_{0},y_{0})}$ solution of

[TABLE]

between $(0,0)$ and $(x_{0},y_{0})$ matter to our saddle-point estimates for the Stirling numbers, since these estimates are valid only when the sample path is far away from $\partial A$ , or, equivalently, when $x$ is large enough and $\lambda$ is far from $\{0,+\infty\}$ . We are specially interested by the solution $\zeta_{\Lambda}$ obtained on the interval $[0,1+\Lambda]$ when $(x_{0},y_{0})=(1+\Lambda,1)$ , for it is the asymptotic completion curve mentionned in Theorem 3. In this section, we prove that $\zeta$ satisfies

[TABLE]

and stays away from $\partial A$ , if $x$ is large enough, as shown in Figure 5. This follows from the variations of $\lambda$ along the curve $y=\zeta(x)$ , where we have:

[TABLE]

Proposition 8.

The solution $\zeta$ to (47) going through $(x_{0},y_{0})$ satisfies, for $0<x\leq x_{0}$ ,

[TABLE]

Proof.

From (47), we obtain the differential equation for $\lambda(x)$ :

[TABLE]

**Lower bounds. ** Relations (50) and (46) yields that

[TABLE]

thus, for $0<x\leq x_{0}$ ,

[TABLE]

leading to the lower bounds of (49).

**Upper bounds. ** With (42) and (50) together, we obtain:

[TABLE]

leading to the upper bounds in Proposition 8. ∎

These estimates are also useful in the proof of Koršunov’s formula, in which we need that a strip close to the limit path intersects the forbidden zone $\{y\leq x/k\}$ only close to its endpoints $(0,0)$ and $(k,1)$ . We have :

Proposition 9.

For $0<\varepsilon\leq\dfrac{1}{2(k+1)}$ , and $2k\varepsilon\leq x\leq k-2k^{2}\varepsilon$ ,

[TABLE]

Proof.

Due to (49), we only need to prove that

[TABLE]

when $x$ is in the interval, and it follows easily from the fact that

[TABLE]

holds true at the endpoints. ∎

5.4 Small variations of $\rho$

In this section, we bound the variations of $\rho$ in order to obtain the accuracy of the Euler scheme used in Theorem 3, cf. (13), and also to obtain the precision of the approximation of the completion curve by a random walk in the proof of Koršunov’s formula. Since $\rho=e^{-\xi}$ , according to (45)

[TABLE]

Thus, according to (50), a sample path $\zeta$ solution of (47) satisfies

[TABLE]

Similarly, anywhere in the domain,

[TABLE]

since $\dfrac{\xi(\lambda+1-\xi)}{(\xi-\lambda)}\leq 2$ holds true, for it reduces to $2(e^{\xi}-1-\xi)\geq\xi^{2}$ . Thus we have:

Proposition 10.

If $\{(x,\zeta(x)),(x,y)\}\subset\mathfrak{W}_{\eta,\delta}$ ,

[TABLE]

5.5 Proof of Theorem 4

5.5.1 Small variations of $\xi$

For $m>\ell\geq 2$ , $\lambda>0$ and $\tilde{\lambda}=\tfrac{m-1}{\ell-1}-1=\lambda+\tfrac{\lambda}{\ell-1}$ , $\xi=\xi\left(\lambda\right)$ , $\tilde{\xi}=\xi\left(\tilde{\lambda}\right)$ , we set, for any real function $f$ ,

[TABLE]

Then we have:

Proposition 11.

For $m>\ell\geq 2$ , $\lambda>0$ , we have

[TABLE]

Proof.

We need a bound for

[TABLE]

Thus,

[TABLE]

In order to bound $\left|\ln\tilde{\xi}-\ln\xi\right|$ , after some computations starting from:

[TABLE]

we obtain:

[TABLE]

since

[TABLE]

yielding a bound, for the second derivative, that entails the desired result. ∎

5.5.2 Final argument

Now we can use Theorem 6 to bound the error $\varpi$ in the approximation of the transition probability $r\left(m,\ell\right)$ by $\rho\left(\lambda\right)=e^{-\xi}$ :

[TABLE]

Set :

[TABLE]

so that :

[TABLE]

First :

[TABLE]

Since

[TABLE]

we have

[TABLE]

According to Theorem 6,

[TABLE]

thus, for $\ell$ large enough, $\chi\left(m-1,\ell-1\right)\geq-\tfrac{1}{2}$ and

[TABLE]

Now, for some $u\in[0,1]$ ,

[TABLE]

in which $\theta\left(m,\ell\right)$ , that turns out to be $\mathcal{O}\left(\frac{1}{\ell}\right)$ , is defined as follows :

[TABLE]

We write

[TABLE]

with

[TABLE]

The factor $\ell$ in $A$ is the reason why we need the second order approximations of Section 5.5.1. Now we see, from Proposition 11, that :

[TABLE]

so that

[TABLE]

But, more precisely, Proposition 11 entails:

[TABLE]

that is, $A=\mathcal{O}\left(\tfrac{1}{\ell}\right)$ . Now, for $B$ , since $\lambda\rightarrow\xi\left(\lambda\right)$ , $\lambda\rightarrow\xi\left(\lambda\right)-\lambda$ , $\lambda\rightarrow\ln\left(1+\lambda\right)$ , are increasing and concave, then $\lambda\rightarrow\ln\left(\xi\left(\lambda\right)-\lambda\right)$ is increasing and concave too, being composed with an increasing and concave function, so, due to Taylor-Lagrange formula, all these functions satisfy:

[TABLE]

and that yields:

[TABLE]

Also, for $\ell\geq\tfrac{1}{2}$ ,

[TABLE]

so that, using (46),

[TABLE]

and, using (46) again, for $(m,\ell)\in\mathfrak{W}_{3,\delta}$ ,

[TABLE]

so that, for $\delta\leq\lambda\leq\delta^{-1}$ and $\ell\geq 20\delta^{-3}$ , we have $|\theta|\leq 1$ and

[TABLE]

For instance, this holds true for $(m,\ell)\in\mathfrak{W}_{40\delta^{-4},\delta}$ .

5.6 Explicit bounds for the second order asymptotics of ${m\brace\ell}$

In this section, we provide detailed computations in order to bound $|\chi(m,\ell)|$ , thus completing the proof of Theorem 6.

5.6.1 Taylor coefficients

As usual, the derivatives of a characteristic function such as $\Phi$ are bounded as follows:

[TABLE]

thus, due to (36), we need the first moments of the Poisson distribution, given by the Touchard polynomials:

[TABLE]

in order to compute the coefficients in the Taylor-Laplace formula for $g$ , for the derivatives of $g$ are obtained through the Leibniz rule, as follows:

[TABLE]

This gives the coefficients in the Taylor-Laplace inequality:

[TABLE]

that is:

[TABLE]

computations needed in order to bound the coefficients in (40).

5.6.2 Upper bound for $C_{\texttt{000}}$

Finally, for $\theta\in\mathbb{R}$ , the fifth derivative is bounded as follows:

[TABLE]

in which we use again and again $\lambda\leq\xi\leq 1+\lambda$ and $\lambda\left(2+\lambda\right)\leq\left(1+\lambda\right)^{2}$ , cf. Figure 6.

Thus we just proved that:

[TABLE]

in which $T\left(\lambda\right)=C_{4}\left(\lambda\right)/120$ . This leads to $C_{\texttt{000}}\leq T\left(\lambda\right)/3,$ contributing to the bound for $|\chi(m,\ell)|$ through

[TABLE]

so that $\lambda\in(\delta,\delta^{-1})$ entails

[TABLE]

In the next sections, we shall also use the following inequalities:

[TABLE]

5.6.3 Upper bound for $C_{\texttt{001}}$

The choice $\tilde{\gamma}=\gamma-\tfrac{v^{2}}{2}$ insures that

[TABLE]

so that

[TABLE]

Then $C_{\texttt{001}}=C_{5}/3$ , and, as a function of $\lambda$ , $C_{\texttt{001}}$ is bounded for $\lambda\in[\delta,\delta^{-1}]$ , for any choice of $\delta\in(0,1)$ , just like $T\left(\lambda\right)$ , $C_{\texttt{000}}$ , and the other coefficients in relation (40). More precisely, for $C_{5}$ , for instance, we have

[TABLE]

For $\delta\leq\lambda\leq\delta^{-1}$ , and $\ell$ large enough, we have $\theta_{0}=\dfrac{\ln\ell}{\sqrt{\ell}}\leq\dfrac{\delta^{3}}{2\left(1+\delta\right)^{4}}$ , thus $|\theta|\leq\theta_{0}$ entails that

[TABLE]

and , as a consequence,

[TABLE]

Then

[TABLE]

but, since $\theta_{0}\leq 1$ ,

[TABLE]

and

[TABLE]

Finally

[TABLE]

and $C_{\texttt{001}}=\left(1+\lambda\right)^{12}$ does the trick. Finally, $\lambda\in(\delta,\delta^{-1})$ entails that the corresponding contribution to $|\chi(m,\ell)|$ is bounded as follows :

[TABLE]

5.6.4 Upper bound for $C_{\texttt{0020}}$ and for $C_{\texttt{0021}}$

First, inequality (39) holds true, for instance, when

[TABLE]

while inequality (38) holds true if $\ell|\tilde{\gamma}|\theta_{0}^{4}\leq\ln 2$ , for instance if

[TABLE]

Then

[TABLE]

holds true if one chooses:

[TABLE]

Finally, $\lambda\in(\delta,\delta^{-1})$ entails that the corresponding contribution to $|\chi(m,\ell)|$ is bounded as follows :

[TABLE]

Similarly

[TABLE]

5.6.5 Upper bound for $K_{\ell}^{(1)}$

According to (41),

[TABLE]

we have

[TABLE]

as desired, provided that:

[TABLE]

Due to the variations of $h$ , $\xi$ , this amounts to:

[TABLE]

5.6.6 Conclusion

Finally, for $\lambda\in\left(\delta,\delta^{-1}\right)$ and $\ell$ large enough (be more precise),

[TABLE]

more precisely,

[TABLE]

Bibliography14

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Asm [03] Sören Asmussen, Applied probability and queues , second ed., Applications of Mathematics (New York), vol. 51, Springer-Verlag, New York, 2003, Stochastic Modelling and Applied Probability. MR 1978607
2Ben [99] Michel Benaïm, Dynamics of stochastic approximation algorithms , Séminaire de Probabilités, XXXIII, Lecture Notes in Math., vol. 1709, Springer, Berlin, 1999, pp. 1–68. MR 1767993
3Con [65] S. D. Conte, Elementary numerical analysis: An algorithmic approach , Mc Graw-Hill Book Co., New York-Toronto, Ont.-London, 1965. MR 0202267
4Duf [97] Marie Duflo, Random iterative models , Applications of Mathematics (New York), vol. 34, Springer-Verlag, Berlin, 1997, Translated from the 1990 French original by Stephen S. Wilson and revised by the author. MR 1485774
5FS [09] Philippe Flajolet and Robert Sedgewick, Analytic combinatorics , Cambridge University Press, Cambridge, 2009. MR 2483235
6Goo [61] I. J. Good, An asymptotic formula for the differences of the powers at zero , Ann. Math. Statist. 32 (1961), 249–256. MR 0120204
7Kor [78] A. D. Koršunov, Enumeration of finite automata , Problemy Kibernet. (1978), no. 34, 5–82, 272. MR 517814
8Kor [86] , On the number of nonisomorphic strongly connected finite automata , Elektron. Informationsverarb. Kybernet. 22 (1986), no. 9, 459–462. MR 862029

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

The impatient collector

Abstract

Contents

1 Introduction

1.1 Main result

Theorem 1**.**

1.2 Context: coupon collector problem, Stirling numbers and random allocation

Theorem 2**.**

Theorem 3**.**

1.3 Asymptotics for the Stirling numbers of the second kind

Theorem 4**.**

Theorem 5** ([6]).**

Theorem 6**.**

2 The asymptotic behavior of the completion curve

2.1 A random walk related to Stirling numbers

Proposition 1**.**

Proof.

2.2 Azuma inequality

Lemma 1**.**

Proof.

Proposition 2**.**

2.3 Euler scheme

3 Coupon and automata

3.1 Koršunov’s formula

Theorem 7**.**

3.2 Basics on automata

Definition 1**.**

Definition 2**.**

Definition 3**.**

3.3 Reduction to the NorthEast corner

Proposition 3**.**

Proof.

3.4 Random walks and Pollaczek-Khintchine’s formula

Proposition 4**.**

Proof.

3.5 Tail probabilities and Hoeffding’s inequality

Proposition 5**.**

Proof.

Proposition 6**.**

Proof.

3.6 The profile of an accessible automaton

Lemma 2**.**

Theorem 8**.**

4 Saddle-point method and Stirling numbers

4.1 Generating function and Cauchy formula

4.2 Central term

4.3 Tail

Lemma 3**.**

Proof.

Proof.

5 Appendix: Some special functions

5.1 ξ\xiξ as an implicit function of λ\lambdaλ

Proposition 7**.**

Proof.

Lemma 4**.**

Proof.

5.2 Large deviation for the coupon collector

5.3 Properties of the limit path.

Proposition 8**.**

Proof.

Proposition 9**.**

Proof.

5.4 Small variations of ρ\rhoρ

Proposition 10**.**

5.5 Proof of Theorem 4

5.5.1 Small variations of ξ\xiξ

Proposition 11**.**

Proof.

5.5.2 Final argument

5.6 Explicit bounds for the second order asymptotics of {mℓ}{m\brace\ell}{ℓm​}

5.6.1 Taylor coefficients

5.6.2 Upper bound for C000C_{\texttt{000}}C000​

5.6.3 Upper bound for C001C_{\texttt{001}}C001​

Theorem 1.

Theorem 2.

Theorem 3.

Theorem 4.

Theorem 5 ([6]).

Theorem 6.

Proposition 1.

Lemma 1.

Proposition 2.

Theorem 7.

Definition 1.

Definition 2.

Definition 3.

Proposition 3.

Proposition 4.

Proposition 5.

Proposition 6.

Lemma 2.

Theorem 8.

Lemma 3.

5.1 $\xi$ as an implicit function of $\lambda$

Proposition 7.

Lemma 4.

Proposition 8.

Proposition 9.

5.4 Small variations of $\rho$

Proposition 10.

5.5.1 Small variations of $\xi$

Proposition 11.

5.6 Explicit bounds for the second order asymptotics of ${m\brace\ell}$

5.6.2 Upper bound for $C_{\texttt{000}}$

5.6.3 Upper bound for $C_{\texttt{001}}$

5.6.4 Upper bound for $C_{\texttt{0020}}$ and for $C_{\texttt{0021}}$

5.6.5 Upper bound for $K_{\ell}^{(1)}$