The $k$-cut model in deterministic and random trees

Gabriel Berzunza; Xing Shi Cai; Cecilia Holmgren

arXiv:1907.02770·math.PR·October 19, 2020·Electron. J. Comb.

The $k$-cut model in deterministic and random trees

Gabriel Berzunza, Xing Shi Cai, Cecilia Holmgren

PDF

TL;DR

This paper studies the k-cut number in various rooted trees, showing that after rescaling, it converges in distribution or probability, revealing universal behaviors across different tree models.

Contribution

It extends existing results by proving convergence of moments for the k-cut number in conditioned Galton-Watson trees and other tree types, regardless of offspring distribution.

Findings

01

Moments of k-cut number converge after rescaling in conditioned Galton-Watson trees.

02

k-cut number converges to a constant in various logarithmic height trees.

03

Results hold for both deterministic and random tree models.

Abstract

The $k$ -cut number of rooted graphs was introduced by Cai et al. as a generalization of the classical cutting model by Meir and Moon. In this paper, we show that all moments of the k-cut number of conditioned Galton-Watson trees converges after proper rescaling, which implies convergence in distribution to the same limit law regardless of the offspring distribution of the trees. This extends the result of Janson. Using the same method, we also show that the k-cut number of various random or deterministic trees of logarithmic height converges in probability to a constant after rescaling, such as random split-trees, uniform random recursive trees, and scale-free random trees.

Equations232

E [ξ] = 1 and 0 < σ^{2} : = V a r (ξ) < \infty.

E [ξ] = 1 and 0 < σ^{2} : = V a r (ξ) < \infty.

σ^{- 1/ k} n^{- 1 + 1/2 k} K (T_{n}) \frac{\buildrel d}{\to} Z_{CRT}, as n \to \infty,

σ^{- 1/ k} n^{- 1 + 1/2 k} K (T_{n}) \frac{\buildrel d}{\to} Z_{CRT}, as n \to \infty,

η_{k, q} : = q! \int_{0}^{\infty} \dots \int_{0}^{\infty} y_{1} (y_{1} + y_{2}) \dots (y_{1} + \dots + y_{q}) e^{- \frac{( y _{1} + \dots + y _{q} ) ^{2}}{2}} F_{q} (y_{q}) d y_{q} \dots d y_{1},

η_{k, q} : = q! \int_{0}^{\infty} \dots \int_{0}^{\infty} y_{1} (y_{1} + y_{2}) \dots (y_{1} + \dots + y_{q}) e^{- \frac{( y _{1} + \dots + y _{q} ) ^{2}}{2}} F_{q} (y_{q}) d y_{q} \dots d y_{1},

F_{q} (y_{q}) : = \int_{0}^{\infty} \int_{0}^{x_{1}} \dots \int_{0}^{x_{q - 1}} exp (- \frac{y _{1} x _{1}^{k} + y _{2} x _{2}^{k} + \dots + y _{q} x _{q}^{k}}{k !}) d x_{q} \dots d x_{2} d x_{1} .

F_{q} (y_{q}) : = \int_{0}^{\infty} \int_{0}^{x_{1}} \dots \int_{0}^{x_{q - 1}} exp (- \frac{y _{1} x _{1}^{k} + y _{2} x _{2}^{k} + \dots + y _{q} x _{q}^{k}}{k !}) d x_{q} \dots d x_{2} d x_{1} .

η_{k, 1} = 2^{- \frac{1}{2 k}} \frac{( k ! ) ^{\frac{1}{k}}}{k} Γ (\frac{1}{k}) Γ (1 - \frac{1}{2 k}) .

η_{k, 1} = 2^{- \frac{1}{2 k}} \frac{( k ! ) ^{\frac{1}{k}}}{k} Γ (\frac{1}{k}) Γ (1 - \frac{1}{2 k}) .

\displaystyle\eta_{k,q}=q!\int_{0}^{\infty}\int_{0}^{x_{1}}\dots\int_{0}^{x_{q-1}}\mathbb{E}\left[\exp\left(-\frac{\sum_{i=1}^{q}(L_{i}^{\rm CRT}-L_{i-1}^{\rm CRT})x_{i}^{k}}{k!}\right)\right]\;{\rm d}{\mathchoice{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$\displaystyle\vec{}\mkern 4.0mu$}\cr\kern-4.30554pt\cr$\displaystyle{\bf x}$\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$\textstyle\vec{}\mkern 4.0mu$}\cr\kern-4.30554pt\cr$\textstyle{\bf x}$\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$\scriptstyle\vec{}\mkern 4.0mu$}\cr\kern-3.01389pt\cr$\scriptstyle{\bf x}$\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$\scriptscriptstyle\vec{}\mkern 4.0mu$}\cr\kern-2.15277pt\cr$\scriptscriptstyle{\bf x}$\cr}}}}_{q},

\displaystyle\eta_{k,q}=q!\int_{0}^{\infty}\int_{0}^{x_{1}}\dots\int_{0}^{x_{q-1}}\mathbb{E}\left[\exp\left(-\frac{\sum_{i=1}^{q}(L_{i}^{\rm CRT}-L_{i-1}^{\rm CRT})x_{i}^{k}}{k!}\right)\right]\;{\rm d}{\mathchoice{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$\displaystyle\vec{}\mkern 4.0mu$}\cr\kern-4.30554pt\cr$\displaystyle{\bf x}$\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$\textstyle\vec{}\mkern 4.0mu$}\cr\kern-4.30554pt\cr$\textstyle{\bf x}$\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$\scriptstyle\vec{}\mkern 4.0mu$}\cr\kern-3.01389pt\cr$\scriptstyle{\bf x}$\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$\scriptscriptstyle\vec{}\mkern 4.0mu$}\cr\kern-2.15277pt\cr$\scriptscriptstyle{\bf x}$\cr}}}}_{q},

G_{r, v} < min {G_{k, u} : u \in T_{n} and u is a strict ancestor of v} .

G_{r, v} < min {G_{k, u} : u \in T_{n} and u is a strict ancestor of v} .

I_{r, v} : = [[G_{r, v} < min {G_{k, u} : u \in T_{n} and u is a strict ancestor of v}]],

I_{r, v} : = [[G_{r, v} < min {G_{k, u} : u \in T_{n} and u is a strict ancestor of v}]],

K (T_{n}) = d 1 \leq r \leq k \sum K_{r} (T_{n}),

K (T_{n}) = d 1 \leq r \leq k \sum K_{r} (T_{n}),

(n^{- 1/2} V_{n} (2 (n - 1) t), t \in [0, 1]) \frac{\buildrel d}{\to} 2 σ^{- 1} B^{ex}, as n \to \infty.

(n^{- 1/2} V_{n} (2 (n - 1) t), t \in [0, 1]) \frac{\buildrel d}{\to} 2 σ^{- 1} B^{ex}, as n \to \infty.

E [K_{r} (T_{n}) ∣ T_{n}]

E [K_{r} (T_{n}) ∣ T_{n}]

\approx \frac{C _{r, k}}{n ^{- 1 + \frac{r}{2 k}}} \int_{0}^{1} (\frac{V _{n} ( 2 ( n - 1 ) t )}{n})^{- \frac{r}{k}} d t

\approx \frac{C _{r, k}}{n ^{- 1 + \frac{r}{2 k}}} (\frac{σ}{2})^{\frac{r}{k}} \int_{0}^{1} \frac{d t}{B ^{ex} ( t ) ^{r / k}},

σ^{- r / k} n^{- 1 + \frac{r}{2 k}} E [K_{r} (T_{n})] \sim C_{r, k} E [\int_{0}^{1} (2 B^{ex} (t))^{- r / k} d t], as n \to \infty,

σ^{- r / k} n^{- 1 + \frac{r}{2 k}} E [K_{r} (T_{n})] \sim C_{r, k} E [\int_{0}^{1} (2 B^{ex} (t))^{- r / k} d t], as n \to \infty,

L_{f} (t_{1}, \dots, t_{q}) : = i = 1 \sum q f (t_{(i)}) - i = 1 \sum q - 1 t \in [t_{(i)}, t_{(i + 1)}] in f f (t),

L_{f} (t_{1}, \dots, t_{q}) : = i = 1 \sum q f (t_{(i)}) - i = 1 \sum q - 1 t \in [t_{(i)}, t_{(i + 1)}] in f f (t),

D_{f} (t_{1}) : = L_{f} (t_{1}) and D_{f} (t_{1}, \dots, t_{q}) : = L_{f} (t_{1}, \dots, t_{q}) - L_{f} (t_{1}, \dots, t_{q - 1}), for q \geq 2.

D_{f} (t_{1}) : = L_{f} (t_{1}) and D_{f} (t_{1}, \dots, t_{q}) : = L_{f} (t_{1}, \dots, t_{q}) - L_{f} (t_{1}, \dots, t_{q - 1}), for q \geq 2.

G_{f} (t_{q}, x_{q}) : = exp (- \frac{D _{f} ( t _{1} ) x _{1}^{k} + \dots + D _{f} ( t _{1} , \dots , t _{q} ) x _{q}^{k}}{k !}),

G_{f} (t_{q}, x_{q}) : = exp (- \frac{D _{f} ( t _{1} ) x _{1}^{k} + \dots + D _{f} ( t _{1} , \dots , t _{q} ) x _{q}^{k}}{k !}),

\displaystyle m_{0}(f)\coloneqq 1\hskip 5.69054pt\text{and}\hskip 5.69054ptm_{q}(f)\coloneqq q!\int_{0}^{1}\int_{0}^{1}\cdots\int_{0}^{1}\int_{0}^{\infty}\int_{0}^{x_{1}}\dots\int_{0}^{x_{q-1}}G_{f}({\bf t}_{q},{\bf x}_{q})\;{\rm d}{\mathchoice{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$\displaystyle\vec{}\mkern 4.0mu$}\cr\kern-4.30554pt\cr$\displaystyle{\bf x}$\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$\textstyle\vec{}\mkern 4.0mu$}\cr\kern-4.30554pt\cr$\textstyle{\bf x}$\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$\scriptstyle\vec{}\mkern 4.0mu$}\cr\kern-3.01389pt\cr$\scriptstyle{\bf x}$\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$\scriptscriptstyle\vec{}\mkern 4.0mu$}\cr\kern-2.15277pt\cr$\scriptscriptstyle{\bf x}$\cr}}}}_{q}\;{\rm d}{\mathchoice{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$\displaystyle\vec{}\mkern 4.0mu$}\cr\kern-4.30554pt\cr$\displaystyle{\bf t}$\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$\textstyle\vec{}\mkern 4.0mu$}\cr\kern-4.30554pt\cr$\textstyle{\bf t}$\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$\scriptstyle\vec{}\mkern 4.0mu$}\cr\kern-3.01389pt\cr$\scriptstyle{\bf t}$\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$\scriptscriptstyle\vec{}\mkern 4.0mu$}\cr\kern-2.15277pt\cr$\scriptscriptstyle{\bf t}$\cr}}}}_{q},\quad\text{for }q\geq 2,

\displaystyle m_{0}(f)\coloneqq 1\hskip 5.69054pt\text{and}\hskip 5.69054ptm_{q}(f)\coloneqq q!\int_{0}^{1}\int_{0}^{1}\cdots\int_{0}^{1}\int_{0}^{\infty}\int_{0}^{x_{1}}\dots\int_{0}^{x_{q-1}}G_{f}({\bf t}_{q},{\bf x}_{q})\;{\rm d}{\mathchoice{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$\displaystyle\vec{}\mkern 4.0mu$}\cr\kern-4.30554pt\cr$\displaystyle{\bf x}$\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$\textstyle\vec{}\mkern 4.0mu$}\cr\kern-4.30554pt\cr$\textstyle{\bf x}$\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$\scriptstyle\vec{}\mkern 4.0mu$}\cr\kern-3.01389pt\cr$\scriptstyle{\bf x}$\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$\scriptscriptstyle\vec{}\mkern 4.0mu$}\cr\kern-2.15277pt\cr$\scriptscriptstyle{\bf x}$\cr}}}}_{q}\;{\rm d}{\mathchoice{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$\displaystyle\vec{}\mkern 4.0mu$}\cr\kern-4.30554pt\cr$\displaystyle{\bf t}$\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$\textstyle\vec{}\mkern 4.0mu$}\cr\kern-4.30554pt\cr$\textstyle{\bf t}$\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$\scriptstyle\vec{}\mkern 4.0mu$}\cr\kern-3.01389pt\cr$\scriptstyle{\bf t}$\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$\scriptscriptstyle\vec{}\mkern 4.0mu$}\cr\kern-2.15277pt\cr$\scriptscriptstyle{\bf t}$\cr}}}}_{q},\quad\text{for }q\geq 2,

\int_{[0, \infty)} x^{q} ν_{f} (d x) = m_{q} (f), for q \in Z_{\geq 0} .

\int_{[0, \infty)} x^{q} ν_{f} (d x) = m_{q} (f), for q \in Z_{\geq 0} .

\displaystyle H_{f,q}({\bf t}_{q})\coloneqq\int_{0}^{\infty}\int_{0}^{x_{1}}\dots\int_{0}^{x_{q-1}}G_{f}({\bf t}_{q},{\bf x}_{q})\;{\rm d}{\mathchoice{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$\displaystyle\vec{}\mkern 4.0mu$}\cr\kern-4.30554pt\cr$\displaystyle{\bf x}$\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$\textstyle\vec{}\mkern 4.0mu$}\cr\kern-4.30554pt\cr$\textstyle{\bf x}$\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$\scriptstyle\vec{}\mkern 4.0mu$}\cr\kern-3.01389pt\cr$\scriptstyle{\bf x}$\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$\scriptscriptstyle\vec{}\mkern 4.0mu$}\cr\kern-2.15277pt\cr$\scriptscriptstyle{\bf x}$\cr}}}}_{q}.

\displaystyle H_{f,q}({\bf t}_{q})\coloneqq\int_{0}^{\infty}\int_{0}^{x_{1}}\dots\int_{0}^{x_{q-1}}G_{f}({\bf t}_{q},{\bf x}_{q})\;{\rm d}{\mathchoice{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$\displaystyle\vec{}\mkern 4.0mu$}\cr\kern-4.30554pt\cr$\displaystyle{\bf x}$\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$\textstyle\vec{}\mkern 4.0mu$}\cr\kern-4.30554pt\cr$\textstyle{\bf x}$\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$\scriptstyle\vec{}\mkern 4.0mu$}\cr\kern-3.01389pt\cr$\scriptstyle{\bf x}$\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$\scriptscriptstyle\vec{}\mkern 4.0mu$}\cr\kern-2.15277pt\cr$\scriptscriptstyle{\bf x}$\cr}}}}_{q}.

H_{f, q} (t_{q}) = \int_{0}^{\infty} \int_{x_{q}}^{\infty} \dots \int_{x_{2}}^{\infty} G_{f} (t_{q}, x_{q}) d x_{q},

H_{f, q} (t_{q}) = \int_{0}^{\infty} \int_{x_{q}}^{\infty} \dots \int_{x_{2}}^{\infty} G_{f} (t_{q}, x_{q}) d x_{q},

H_{f, q} (t_{q}) = \bigintss_{[0, \infty)^{q}} exp - \frac{1}{k !} i = 1 \sum q D_{f} (t_{1}, \dots, t_{i}) (j = i \sum q w_{j})^{k} d w_{q},

H_{f, q} (t_{q}) = \bigintss_{[0, \infty)^{q}} exp - \frac{1}{k !} i = 1 \sum q D_{f} (t_{1}, \dots, t_{i}) (j = i \sum q w_{j})^{k} d w_{q},

H_{f, q} (t_{q})

H_{f, q} (t_{q})

0 \leq m_{q} (f) \leq q! Γ (1 + 1/ k)^{q} Γ (1 + k)^{q / k} (\int_{0}^{1} f (t)^{- 1/ k} d t)^{q} .

0 \leq m_{q} (f) \leq q! Γ (1 + 1/ k)^{q} Γ (1 + k)^{q / k} (\int_{0}^{1} f (t)^{- 1/ k} d t)^{q} .

V_{n} (i) : = d_{n} (ψ (i)), i \in {0, \dots, 2 (n - 1)},

V_{n} (i) : = d_{n} (ψ (i)), i \in {0, \dots, 2 (n - 1)},

V_{n} (t) : = V_{n} (2 (n - 1) t) and V_{n} (t) : = ⌈ V_{n} (2 (n - 1) t)⌉,

V_{n} (t) : = V_{n} (2 (n - 1) t) and V_{n} (t) : = ⌈ V_{n} (2 (n - 1) t)⌉,

v \in T_{n} max d_{n} (v) = t \in [0, 2 (n - 1)] sup V_{n} (t) = t \in [0, 1] sup V_{n} (t) .

v \in T_{n} max d_{n} (v) = t \in [0, 2 (n - 1)] sup V_{n} (t) = t \in [0, 1] sup V_{n} (t) .

n^{- q} a_{n}^{- q / k} E [K_{1} (T_{n})^{q}] \to m_{q} (f),

n^{- q} a_{n}^{- q / k} E [K_{1} (T_{n})^{q}] \to m_{q} (f),

G_{n} (v_{q}, x_{q}) : = exp (- \frac{D _{n} ( v _{1} ) x _{1}^{k} + \dots + D _{n} ( v _{1} , \dots , v _{q} ) x _{q}^{k}}{k !}),

G_{n} (v_{q}, x_{q}) : = exp (- \frac{D _{n} ( v _{1} ) x _{1}^{k} + \dots + D _{n} ( v _{1} , \dots , v _{q} ) x _{q}^{k}}{k !}),

Γ (k, x) = \int_{x}^{\infty} t^{k - 1} e^{- t} d t, for x \geq 0.

Γ (k, x) = \int_{x}^{\infty} t^{k - 1} e^{- t} d t, for x \geq 0.

P (Gamma (k) > x)^{D_{n} (v_{1}, \dots, v_{q})} = (\frac{Γ ( k , x )}{Γ ( k )})^{D_{n} (v_{1}, \dots, v_{q})} = (1 + O (a_{n}^{\frac{1}{2 k}})) exp (- \frac{D _{n} ( v _{1} , \dots , v _{q} ) x ^{k}}{k !}),

P (Gamma (k) > x)^{D_{n} (v_{1}, \dots, v_{q})} = (\frac{Γ ( k , x )}{Γ ( k )})^{D_{n} (v_{1}, \dots, v_{q})} = (1 + O (a_{n}^{\frac{1}{2 k}})) exp (- \frac{D _{n} ( v _{1} , \dots , v _{q} ) x ^{k}}{k !}),

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

The $k$ -cut model in deterministic and random trees

Gabriel Berzunza111E-mail: [email protected], Xing Shi Cai222E-mail: [email protected] and Cecilia Holmgren333E-mail: [email protected]

Department of Mathematics, Uppsala University, Sweden

Abstract

The $k$ -cut number of rooted graphs was introduced by Cai et al. [12] as a generalization of the classical cutting model by Meir and Moon [30]. In this paper, we show that all moments of the $k$ -cut number of conditioned Galton-Watson tree converge after proper rescaling, which implies convergence in distribution to the same limit law regardless of the offspring distribution of the trees. This extends the result of Janson [25]. Using the same method, we also show that the $k$ -cut number of various random or deterministic trees of logarithmic height converges in probability to a constant after rescaling, such as random split-trees, uniform random recursive trees, and scale-free random trees.

Key words and phrases: $k$ -cut, cutting, conditioned Galton-Watson trees, split trees, preferential attachment trees

1 Introduction and main result

In order to measure the difficulty for the destruction of a resilient network Cai et al. [12] introduced a generalization of the cut model of Meir and Moon [30] where each vertex (or edge) needs to be cut $k\in{\mathbb{N}}$ times (instead of only once) before it is destroyed. More precisely, consider that the resilient network is a rooted tree ${\mathbb{T}}_{n}$ , with $n\in{\mathbb{N}}$ vertices. We assume that sibling vertices in ${\mathbb{T}}_{n}$ are ordered. (Such trees sometimes are referred to as plane trees.) We destroy it by removing its vertices as follows: Step 1: Choose a vertex uniformly at random from the component that contains the root and cut the selected vertex once. Step 2: If this vertex has been cut $k$ times, remove the vertex together with the edges attached to it from the tree. Step 3: If the root has been removed, then stop. Otherwise, go to step Step 1. We let $\mathcal{K}_{k}({\mathbb{T}}_{n})$ denote the (random) total number of cuts needed to end this procedure the $k$ -cut number, i.e., $\mathcal{K}_{k}({\mathbb{T}}_{n})$ models how much effort it takes to destroy the network. (For simplicity, we will omit the subscript $k$ and write $\mathcal{K}({\mathbb{T}}_{n})$ .) It should be clear that one can define analogously an edge deletion version of the previous algorithm, where one needs to cut an edge $k$ times before removing it from the root component. Then, one would be interested in the number $\mathcal{K}_{e}({\mathbb{T}}_{n})$ of edge cuts needed to isolate the root of ${\mathbb{T}}_{n}$ .

The case $k=1$ (i.e., the traditional cutting model of Meir and Moon [30]) has been well-studied by several authors. More precisely, Meir and Moon estimated the first and second moment of the $1$ -cut number in the cases when ${\mathbb{T}}_{n}$ is a Cayley tree [30] and a recursive tree [31]. Subsequently, several weak limit theorems for the $1$ -cut number have been obtained for Cayley trees (Panholzer [33, 34]), complete binary trees (Janson [24]), conditioned Galton-Watson trees (Janson [25] and Addario-Berry et al. [1]), recursive trees (Drmota et al. [16], Iksanov and Möhle [23]), binary search trees (Holmgren [19]) and split trees (Holmgren [20]). In the general case $k\geq 1$ , the authors in [12] established first moment estimates of $\mathcal{K}({\mathbb{T}}_{n})$ for families of deterministic and random trees, such as paths, complete binary trees, split trees, random recursive trees and conditioned Galton-Watson trees. In particular, the authors in [12] have proven a weak limit theorem for $\mathcal{K}({\mathbb{T}}_{n})$ when ${\mathbb{T}}_{n}$ is a path consisting of $n$ vertices. More recently, Cai and Holmgren [11] also obtained a weak limit theorem in the case when ${\mathbb{T}}_{n}$ is a complete binary tree.

In this work, we continue the investigation of this general cutting-down procedure in conditioned Galton-Watson trees and show that ${\mathcal{K}}({\mathbb{T}}_{n})$ , after a proper rescaling, converges in distribution to a non-degenerate random variable. More precisely, let $\xi$ be a non-negative integer-valued random variable such that

[TABLE]

We further assume that the distribution of $\xi$ is aperiodic. This last condition is to avoid unnecessary complications, but our results can be extended to the periodic case. We then consider a Galton-Watson process with (critical) offspring distribution $\xi$ . Let ${\mathbb{T}}_{n}$ be the family tree conditioned on its number of vertices being $n\in{\mathbb{N}}$ , providing that this conditioning makes sense. The main result of this paper is the following. We write ${\,{\buildrel d\over{\rightarrow}}\,}$ to denote convergence in distribution. (In the rest of the paper CRT stands for Continuum Random Tree.)

Theorem 1.

Let $k\in{\mathbb{N}}$ . Let ${\mathbb{T}}_{n}$ be a Galton-Watson tree conditioned on its number of vertices being $n\in{\mathbb{N}}$ with offspring distribution $\xi$ satisfying (1). Then,

[TABLE]

where $Z_{\rm CRT}$ is a non-degenerate random variable whose law is determined entirely by its moments: ${\mathbb{E}}[Z_{\rm CRT}^{0}]=1$ , and for $q\in{\mathbb{N}}$ , ${\mathbb{E}}[Z_{\rm CRT}^{q}]=\eta_{k,q}$ with

[TABLE]

where ${\bf y}_{q}=(y_{1},\dots,y_{q})\in{\mathbb{R}}_{+}^{q}$ and

[TABLE]

Furthermore, if ${\mathbb{E}}[\xi^{p}]<\infty$ for every $p\in{\mathbb{Z}}_{\geq 0}$ , then for every $q\in{\mathbb{Z}}_{\geq 0}$ , $\sigma^{-q/k}n^{-q+q/2k}{\mathbb{E}}[{\mathcal{K}}({\mathbb{T}}_{n})^{q}]\rightarrow{\mathbb{E}}[Z_{{\rm CRT}}^{q}]$ as $n\rightarrow\infty$ .

In the case $k=1$ , Theorem 1 reduces to a $Z_{{\rm CRT}}$ having a Rayleigh distribution with density $xe^{-x^{2}/2}$ , for $x\in\mathbb{R}_{+}$ . More precisely, one can verify that $\eta_{1,q}=2^{q/2}\Gamma(1+q/2)$ , for $q\in{\mathbb{Z}}_{\geq 0}$ , which are the moments of a random variable with the Rayleigh distribution; in this paper $\Gamma(\cdot)$ denotes the well-known gamma function. As we mentioned earlier, the case $k=1$ has been shown in [25, Theorem 1.6] (or Addario-Berry et al. [1]). We henceforth assume throughout this paper that $k\geq 2$ .

It is also important to mention that we could not find a simpler expression (in general) for the moments $\eta_{k,q}$ except for some particular instances. For $q=1$ , we have

[TABLE]

Then Theorem 1 provides a proof of [12, Lemma 4.10], where an estimation of the first moment of ${\mathcal{K}}({\mathbb{T}}_{n})$ was first announced but whose proof was left to the reader. One can also compute with the help of Mathematica the second moment of $Z_{{\rm CRT}}$ or other particular examples. However, the expressions are too involved and we decided not to include them.

On the other hand, let $(U_{1},\dots,U_{q})$ be $q$ i.i.d. leaves of a Brownian CRT and define the vector $(L_{0}^{\rm CRT},L_{1}^{\rm CRT},\dots,L_{q}^{\rm CRT})$ where $L_{0}^{\rm CRT}=0$ and $L_{i}^{\rm CRT}$ is the total length of the minimal subtree of a Brownian CRT which connects its root and the leaves of $U_{1},\dots,U_{i}$ ; see [3, Lemma 21] from where one can deduce explicitly the distribution of $(L_{0}^{\rm CRT},L_{1}^{\rm CRT},\dots L_{q}^{\rm CRT})$ . From the proof of Theorem 1, we obtain, for $q\in{\mathbb{N}}$ , that

[TABLE]

where ${\mathchoice{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$ \displaystyle\vec{}\mkern 4.0mu $}\cr\kern-4.30554pt\cr$ \displaystyle{\bf x} $\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$ \textstyle\vec{}\mkern 4.0mu $}\cr\kern-4.30554pt\cr$ \textstyle{\bf x} $\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$ \scriptstyle\vec{}\mkern 4.0mu $}\cr\kern-3.01389pt\cr$ \scriptstyle{\bf x} $\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$ \scriptscriptstyle\vec{}\mkern 4.0mu $}\cr\kern-2.15277pt\cr$ \scriptscriptstyle{\bf x} $\cr}}}}_{q}=(x_{q},\dots,x_{1})\in{\mathbb{R}}_{+}^{q}$ . This suggests that it ought to be possible to build the random variable $Z_{\rm CRT}$ by some construction that can be interpreted as the $k$ -cut model on the Brownian CRT defined by Aldous [2, 3]. The appearance of the Brownian CRT in this framework should not come as a surprise since it is well-known that if we assign length $n^{-1/2}$ to each edge of the Galton-Watson tree $\mathbb{T}_{n}$ , then the latter converges weakly to a Brownian CRT as $n\rightarrow\infty$ . We believe that this connection can be exploited even more than the one used in this work in order to obtain the precise distribution of $Z_{\rm CRT}$ . For example, ideas from [6] and [1] could be useful to answer this question.

The approach used in this work consists of implementing an extension of the idea of Janson [25], which was used in [12], in order to study the $k$ -cut model on deterministic and random trees. The authors in [12] introduced an equivalent model that allows them to define ${\mathcal{K}}({\mathbb{T}}_{n})$ in terms of the number of records in ${\mathbb{T}}_{n}$ when vertices are assigned random labels. More precisely, let $(E_{i,v})_{i\geq 1,v\in{\mathbb{T}}_{n}}$ be a sequence of independent exponential random variables with parameter $1$ ; ${\rm Exp}(1)$ for short. Let $G_{r,v}\coloneqq\sum_{1\leq i\leq r}E_{i,v}$ , for $r\in{\mathbb{N}}$ and $v\in{\mathbb{T}}_{n}$ . Clearly, $G_{r,v}$ has a gamma distribution with parameters $(r,1)$ , which we denote by Gamma $(r)$ . Imagine that each vertex $v\in{\mathbb{T}}_{n}$ has an alarm clock and $v$ ’s clock fires at times $(G_{r,v})_{r\geq 1}$ . If we cut a vertex when its alarm clock fires, then due to the memoryless property of exponential random variables, we are actually choosing a vertex uniformly at random to cut. However, this also means that we are cutting vertices that have already been removed from the tree. Thus, for a cut on vertex $v$ at time $G_{r,v}$ (for some $r\in\{1,\dots,k\}$ ) to be counted in ${\mathcal{K}}({\mathbb{T}}_{n})$ , none of its strict ancestors can already have been cut $k$ times, i.e.,

[TABLE]

When the previous event happens, we say that $G_{r,v}$ , or simply $v$ , is an $r$ -record and let

[TABLE]

where $\llbracket\cdot\rrbracket$ denotes the Iverson bracket, i.e., $\llbracket S\rrbracket=1$ if the statement $S$ is true and $\llbracket S\rrbracket=0$ otherwise. Let ${\mathcal{K}}_{r}({\mathbb{T}}_{n})$ be the number of $r$ -records, i.e., ${\mathcal{K}}_{r}({\mathbb{T}}_{n})\coloneqq\sum_{v\in{\mathbb{T}}_{n}}I_{r,v}$ . Then, it should be clear that

[TABLE]

where $\overset{d}{=}$ denotes equal in distribution.

Loosely speaking, we then consider the well-known depth-first search walk or contour function $V_{n}=(V_{n}(t),t\in[0,2(n-1)])$ of the (ordered) tree ${\mathbb{T}}_{n}$ as depicted in Figure 1, that is, $V_{n}(t)$ is “the depth of the $t$ -th vertex” visited in this walk; this will be made precise in the next section. As it is well-known (see Aldous [3, Theorem 23 with Remark 2] or [29, Theorem 1]), when $\mathbb{T}_{n}$ is a conditioned Galton-Watson with offspring distribution satisfying (1), we have that

[TABLE]

in $C([0,1],{\mathbb{R}}_{+})$ , with its usual topology, and where $B^{\rm ex}=(B^{\rm ex}(t),t\in[0,1])$ is a standard normalized Brownian excursion. It has been shown in [12, Lemma 2.1] that444For two sequences of non-negative real numbers $(A_{n})_{n\geq 1}$ and $(B_{n})_{n\geq 1}$ such that $B_{n}>0$ , we write $A_{n}\sim B_{n}$ if $A_{n}/B_{n}\rightarrow 1$ as $n\rightarrow\infty$ ${\mathbb{E}}{[I_{r,v}]\sim C_{r,k}d_{n}(v)^{-r/k}}$ , for some (explicit) constant $C_{r,k}>0$ , where $d_{n}(v)$ is the depth of the vertex $v\in\mathbb{T}_{n}$ . Let $\circ$ denote the root of $\mathbb{T}_{n}$ . Thus, informally

[TABLE]

when $n$ is large. One then expects that

[TABLE]

which coincides with the right-hand side of (3) when $r=q=1$ . Note that this informal computation suggests that555For two sequences of non-negative real numbers $(A_{n})_{n\geq 1}$ and $(B_{n})_{n\geq 1}$ such that $B_{n}>0$ , we write $A_{n}=O(B_{n})$ if $\limsup_{n\rightarrow\infty}A_{n}/B_{n}<\infty$ . ${\mathbb{E}}\left[{\mathcal{K}}_{r}({\mathbb{T}}_{n})\right]=O(n^{1-\frac{r}{2k}})$ , for $r\in\{1,\dots,k\}$ . As a consequence, Markov’s inequality implies that $n^{-1+\frac{1}{2k}}{\mathcal{K}}_{r}({\mathbb{T}}_{n})\rightarrow 0$ in probability, as $n\rightarrow\infty$ , for $r\in\{2,\dots,k\}$ . As shown later, by the identity in (6), it would be enough to prove Theorem 1 for ${\mathcal{K}}_{1}({\mathbb{T}}_{n})$ instead of ${\mathcal{K}}({\mathbb{T}}_{n})$ .

In the rest of the paper, Section 2 and Section 3 make the above argument precise and extend it to higher moments. This will allow us to use the method of moments for proving Theorem 1. In Section 4, we also apply the same idea to get all moments of the number of records in paths and several types of trees of logarithmic height, e.g., complete binary trees, split trees, uniform random recursive trees and scale-free trees.

2 Preliminary results

The purpose of this section is to establish a general convergence result for the number of $1$ -records ${\mathcal{K}}_{1}({\mathbb{T}}_{n})$ of a deterministic rooted ordered tree ${\mathbb{T}}_{n}$ . The results of this section can also be viewed as a generalization of those in Janson [25] and in Cai, et al. [12]. Furthermore, these results will allow us to study the convergence of ${\mathcal{K}}({\mathbb{T}}_{n})$ not only for conditioned Galton-Watson trees, but also for other classes of random trees in Section 4. We start by defining a probability measure through a continuous function in the same spirit as in [25, Theorem 1.9]. Let $I\subseteq{\mathbb{R}}_{+}$ be an interval. For a function $f:I\rightarrow{\mathbb{R}}_{+}$ and $t_{1},\dots,t_{q}\in I$ with $q\in{\mathbb{N}}$ , we define

[TABLE]

where $t_{(1)},\dots,t_{(q)}$ are $t_{1},\dots,t_{q}$ arranged in nondecreasing order. Notice that $L_{f}(t_{1},\dots,t_{q})$ is symmetric in $t_{1},\dots,t_{q}$ and that $L_{f}(t)=f(t)$ for $t\in I$ . Define

[TABLE]

We also consider the functional

[TABLE]

for ${\bf x}_{q}=(x_{1},\dots,x_{q})\in{\mathbb{R}}_{+}^{q}$ and ${\bf t}_{q}=(t_{1},\dots,t_{q})\in I^{q}$ . If $I=[0,1]$ , we further define, for $q\in{\mathbb{N}}$ ,

[TABLE]

where ${\mathchoice{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$ \displaystyle\vec{}\mkern 4.0mu $}\cr\kern-4.30554pt\cr$ \displaystyle{\bf x} $\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$ \textstyle\vec{}\mkern 4.0mu $}\cr\kern-4.30554pt\cr$ \textstyle{\bf x} $\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$ \scriptstyle\vec{}\mkern 4.0mu $}\cr\kern-3.01389pt\cr$ \scriptstyle{\bf x} $\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$ \scriptscriptstyle\vec{}\mkern 4.0mu $}\cr\kern-2.15277pt\cr$ \scriptscriptstyle{\bf x} $\cr}}}}_{q}=(x_{q},\dots,x_{1})$ and ${\mathchoice{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$ \displaystyle\vec{}\mkern 4.0mu $}\cr\kern-4.30554pt\cr$ \displaystyle{\bf t} $\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$ \textstyle\vec{}\mkern 4.0mu $}\cr\kern-4.30554pt\cr$ \textstyle{\bf t} $\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$ \scriptstyle\vec{}\mkern 4.0mu $}\cr\kern-3.01389pt\cr$ \scriptstyle{\bf t} $\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$ \scriptscriptstyle\vec{}\mkern 4.0mu $}\cr\kern-2.15277pt\cr$ \scriptscriptstyle{\bf t} $\cr}}}}_{q}=(t_{q},\dots,t_{1})$ .

Theorem 2.

Let $k\in{\mathbb{N}}$ . Suppose that $f\in C([0,1],{\mathbb{R}}_{+})$ is such that $\int_{0}^{1}f(t)^{-1/k}{\rm d}t<\infty$ . Then there exists a unique probability measure $\nu_{f}$ on $[0,\infty)$ with finite moments given by

[TABLE]

Proof.

We only prove uniqueness here. The proof for existence follows along the lines of [25, Proof of Theorem 1.9, Pages 18-19] and details are left to the interested reader. Informally speaking, the idea in [25] for the proof of existence is to build a sequence of functions that satisfy the conditions of Lemma 1 below. Define the function

[TABLE]

By changing the order of integration, we obtain that

[TABLE]

for ${\bf x}_{q}=(x_{1},\dots,x_{q})\in{\mathbb{R}}_{+}^{q}$ and ${\bf t}_{q}=(t_{1},\dots,t_{q})\in[0,1]^{q}$ . By making the change of variables $x_{q}=w_{q},x_{q-1}=w_{q}+w_{q-1},\dots,x_{1}=w_{q}+\cdots+w_{1}$ , we see that

[TABLE]

where ${\bf w}_{q}=(w_{1},\dots,w_{q})\in{\mathbb{R}}_{+}^{q}$ . From the inequality $(x_{1}+\cdots+x_{q})^{k}\geq x_{1}^{k}+\cdots+x_{q}^{k}$ , we observe that

[TABLE]

where for the last inequality we have used the fact that $L_{f}(t_{1},\dots,t_{i})\geq\max_{1\leq j\leq i}f(t_{j})$ , for $1\leq i\leq q$ . The later follows from the symmetry of $L_{f}$ ; see [25, Lemma 4.1] for a proof. Then, the previous inequality allows us to conclude that

[TABLE]

We conclude that there exists $a>0$ such that $\sum_{q=0}^{\infty}m_{q}(f)\frac{x^{q}}{q!}<\infty$ , for $0\leq x<a$ . Then a probability measure with moments $m_{q}(f)$ has a finite generating function in a neighbourhood of [math]. Thus, it is well-known that this implies that the probability measure is unique; see, e.g., [18, Section 4.10]. ∎

Consider a rooted ordered tree ${\mathbb{T}}_{n}$ with root $\circ$ and $n\in{\mathbb{N}}$ vertices. We now explain how ${\mathbb{T}}_{n}$ can be encoded by a continuous function. We define the so-called depth-first search function [2, page 260], $\psi_{n}:\{0,1,\dots,2(n-1)\}\rightarrow\{\,\text{vertices of}\,\,{\mathbb{T}}_{n}\}$ such that $\psi_{n}(i)$ is the $(i+1)$ -th vertex visited in a depth-first walk on the tree starting from the root $\circ$ . Note that $\psi_{n}(i)$ and $\psi_{n}(i+1)$ always are neighbours, and thus, we extend $\psi$ to $[0,2(n-1)]$ by letting, for $1\leq i<t<i+1\leq 2(n-1)$ , $\psi_{n}(t)$ to be the one of $\psi_{n}(i)$ and $\psi_{n}(i+1)$ that has largest depth (recall that the depth of a vertex $v\in{\mathbb{T}}_{n}$ is the distance, i.e., number of edges, between $\circ$ to $v$ ). Let $d_{n}(v)$ be the depth of a vertex $v\in{\mathbb{T}}_{n}$ . We further define the depth-first walk $V_{n}$ of ${\mathbb{T}}_{n}$ by

[TABLE]

and extend $V_{n}$ to $[0,2(n-1)]$ by linear interpolation. Thus $V_{n}\in C([0,2(n-1)],{\mathbb{R}}_{+})$ . See Figure 1 for an example of $V_{n}$ . Furthermore, we normalize the domain of $V_{n}$ to $[0,1]$ by defining

[TABLE]

for $t\in[0,1]$ . Thus $\widetilde{V}_{n}\in C([0,1],{\mathbb{R}}_{+})$ . Note that $d_{n}(\psi(t))=\lceil V_{n}(t)\rceil$ , for $t\in[0,2(n-1)]$ . Moreover,

[TABLE]

We now state the central result of this section, that is, a general limit theorem in distribution for the number of $1$ -records ${\mathcal{K}}_{1}({\mathbb{T}}_{n})$ of a deterministic rooted tree ${\mathbb{T}}_{n}$ with $n$ vertices. It is important to notice that ${\mathcal{K}}_{1}({\mathbb{T}}_{n})$ is a random variable since the $1$ -records are random. From now on, we always assume that $k\geq 2$ .

Lemma 1.

Suppose that $({\mathbb{T}}_{n})_{n\geq 1}$ is a sequence of ordered (deterministic) rooted trees, and denote the corresponding normalized depth-first walks by $\widetilde{V}_{n}$ and $\widehat{V}_{n}$ . Suppose that there exists a sequence $(a_{n})_{n\geq 1}$ of non-negative real numbers with $\lim_{n\rightarrow\infty}a_{n}=0$ , $\lim_{n\rightarrow\infty}na_{n}^{1/k}=\infty$ and a function $f\in C([0,1],{\mathbb{R}}_{+})$ such that

(a)

$\displaystyle a_{n}\widetilde{V}_{n}(t)\rightarrow f(t)$ , in $C([0,1],{\mathbb{R}}_{+})$ , as $n\rightarrow\infty$ .

(b)

$\displaystyle\int_{0}^{1}(a_{n}\widehat{V}_{n}(t))^{-1/k}\;{\rm d}t\rightarrow\int_{0}^{1}f(t)^{-1/k}\;{\rm d}t<\infty$ , as $n\rightarrow\infty$ .

Then, for each $q\in{\mathbb{Z}}_{\geq 0}$ ,

[TABLE]

as $n\rightarrow\infty$ , where $m_{q}(f)$ is defined in (26). Moreover, $n^{-1}a_{n}^{-1/k}{\mathcal{K}}_{1}({\mathbb{T}}_{n}){\,{\buildrel d\over{\rightarrow}}\,}Z_{f}$ , as $n\rightarrow\infty$ , where $Z_{f}$ is a random variable with distribution $\nu_{f}$ defined by Theorem 2.

Before proving Lemma 1, we need to establish some preliminary results and to introduce some further notation. For $q\in{\mathbb{N}}$ and vertices $v_{1},\dots,v_{q}\in{\mathbb{T}}_{n}$ , let $L_{n}(v_{1},\dots,v_{q})$ be the number of edges in the subtree of ${\mathbb{T}}_{n}$ spanned by $v_{1},\dots,v_{q}$ and its root $\circ$ (i.e., the minimal number of edges that are needed to connect $v_{1},\dots,v_{q}$ and $\circ$ ). We write $D_{n}(v_{1})\coloneqq L_{n}(v_{1})$ and $D_{n}(v_{1},\dots,v_{q})\coloneqq L_{n}(v_{1},\dots,v_{q})-L_{n}(v_{1},\dots,v_{q-1})$ for $q\geq 2$ . We also consider the functional

[TABLE]

for ${\bf x}_{q}=(x_{1},\dots,x_{q})\in{\mathbb{R}}_{+}^{q}$ and ${\bf v}_{q}=(v_{1},\dots,v_{q})\in{\mathbb{T}}_{n}^{q}$ . We denote by $\Gamma(k,\cdot)$ the upper incomplete gamma function of parameter $k\in{\mathbb{N}}$ , i.e.,

[TABLE]

Remark 1.

Let ${\mathbb{T}}_{n}$ be an ordered (deterministic) rooted tree with depth-first search walk $\psi_{n}$ and the corresponding function $V_{n}$ . It is not difficult to see that $L_{n}$ and $L_{\lceil V_{n}\rceil}$ are connected, in the sense that $L_{n}(\psi_{n}(t_{1}),\dots,\psi_{n}(t_{q}))=L_{\lceil V_{n}\rceil}(t_{1},\dots,t_{q})$ for $t_{1},\dots,t_{q}\in[0,2(n-1)]$ ; see [25, Lemma 4.4] for a proof of this fact.

Lemma 2.

Let ${\mathbb{T}}_{n}$ be an ordered (deterministic) rooted tree with $n\in{\mathbb{N}}$ vertices. Suppose that there exists a sequence $(a_{n})_{n\geq 1}$ of non-negative real numbers such that $\lim_{n\rightarrow\infty}a_{n}=0$ and $\max_{v\in{\mathbb{T}}_{n}}d_{n}(v)=O(a_{n}^{-1})$ . Let $\alpha\coloneqq\frac{1}{2}\left(\frac{1}{k}+\frac{1}{k+1}\right)$ and $x_{0}\coloneqq a_{n}^{\alpha}$ . Then, for $q\in{\mathbb{N}}$ and uniformly for all $x\in[0,x_{0}]$ ,

[TABLE]

where the vertices $v_{1},\dots,v_{q}\in{\mathbb{T}}_{n}$ .

Proof.

Our claim can be shown along the lines of [12, Proof of Lemma 5.1]. ∎

Recall that for two sequences of non-negative real numbers $(A_{n})_{n\geq 1}$ and $(B_{n})_{n\geq 1}$ such that $B_{n}>0$ , one writes $A_{n}=o(B_{n})$ if $\lim_{n\rightarrow\infty}A_{n}/B_{n}=0$ .

Lemma 3.

Let ${\mathbb{T}}_{n}$ be an ordered (deterministic) rooted tree with $n\in{\mathbb{N}}$ vertices. Suppose that there exists a sequence $(a_{n})_{n\geq 1}$ of non-negative real numbers with $\lim_{n\rightarrow\infty}a_{n}=0$ , $\lim_{n\rightarrow\infty}na_{n}^{1/k}=\infty$ and $\max_{v\in{\mathbb{T}}_{n}}d_{n}(v)=O(a_{n}^{-1})$ . Then the moments of ${\mathcal{K}}_{1}({\mathbb{T}}_{n})$ are given by

[TABLE]

where

[TABLE]

Proof.

For simplicity, we write $X_{q}\coloneqq{\mathcal{K}}_{1}({\mathbb{T}}_{n})^{q}$ for $q\in{\mathbb{Z}}_{\geq 0}$ and note that $X_{q}=X_{1}^{q}$ . For $q\in{\mathbb{N}}$ , we observe that

[TABLE]

where $Y_{q}\coloneqq\sum_{p=0}^{q-1}\sum_{l=0}^{p}\binom{q}{p}\binom{p}{l}(-1)^{p-l}X_{l}$ . Recall that $I_{1,v}$ is the indicator that $v\in{\mathbb{T}}_{n}$ is a $1$ -record defined in (5). By the previous identity, we have that

[TABLE]

where ${\mathcal{E}}(v_{1},\dots,v_{q})\coloneqq\{E_{1,v_{q}}<\cdots<E_{1,v_{1}}\,\,\text{and}\,\,v_{1},\dots,v_{q}\,\,\text{are all$ 1 $-records}\}$ ; recall that $E_{1,v_{1}},\dots,E_{1,v_{q}}$ are independent random variables with an $\text{Exp}(1)$ distribution. To see the last identity, note that each product $I_{1,v_{1}}\cdots I_{1,v_{q}}$ occurs $q!$ times with indices permuted and for exactly one of these permutations we have that $E_{1,v_{q}}<\cdots<E_{1,v_{1}}$ .

Consider the simple case $q=2$ . Conditioning on $E_{1,v_{2}}=x_{2}<E_{1,v_{1}}=x_{1}$ , we see that $v_{1}$ and $v_{2}$ are both $1$ -records, if and only if, the following two events happen:

(i)

the $D_{n}(v_{1})$ ancestors of $v_{1}$ are removed after time $x_{1}$ ;

(ii)

the $D_{n}(v_{1},v_{2})$ vertices which are ancestors of $v_{2}$ but not of $v_{1}$ are removed after time $x_{2}$ .

Since $x_{2}<x_{1}$ , we note that the event (i) implies that the vertices which are both the ancestors of $v_{1}$ and $v_{2}$ are removed after $x_{1}$ . Let $g(x)\coloneqq{\mathbb{P}}(\text{Gamma}(k)>x)$ for $x\in\mathbb{R}_{+}$ . Since the events (i) and (ii) are independent, we have

[TABLE]

Recall that we are assuming $k\geq 2$ . Otherwise, when $k=1$ , the above equality is not entirely correct since ${\mathcal{E}}(v_{1},v_{2})$ is impossible if $v_{2}$ is an ancestor of $v_{1}$ ; see [25, Lemma 4.3] for details in the case $k=1$ .

By generalizing the previous argument to $q\in{\mathbb{N}}$ , we see that

[TABLE]

where ${\mathchoice{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$ \displaystyle\vec{}\mkern 4.0mu $}\cr\kern-4.30554pt\cr$ \displaystyle{\bf x} $\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$ \textstyle\vec{}\mkern 4.0mu $}\cr\kern-4.30554pt\cr$ \textstyle{\bf x} $\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$ \scriptstyle\vec{}\mkern 4.0mu $}\cr\kern-3.01389pt\cr$ \scriptstyle{\bf x} $\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$ \scriptscriptstyle\vec{}\mkern 4.0mu $}\cr\kern-2.15277pt\cr$ \scriptscriptstyle{\bf x} $\cr}}}}_{q}=(x_{q},\dots,x_{1})\in{\mathbb{R}}_{+}^{q}$ , $x_{0}=a_{n}^{\alpha}$ and $\alpha=\frac{1}{2}\left(\frac{1}{k}+\frac{1}{k+1}\right)$ . On the one hand, Lemma 2 implies that

[TABLE]

we have used our assumption $\max_{v\in{\mathbb{T}}_{n}}d_{n}(v)=O(a_{n}^{-1})$ . On the other hand, Lemma 2 also implies that

[TABLE]

where ${\bf v}_{q}=(v_{1},\dots,v_{q})\in{\mathbb{T}}_{n}^{q}$ and

[TABLE]

this estimation can be deduced similarly as the one for the integral $A_{2}$ . Therefore, the previous estimations and Remark 1 allow us to conclude that

[TABLE]

note that if we had not excluded the root, we would not be able to write the sum as an integral. By making the change of variables $x_{i}=a_{n}^{1/k}w_{i}$ , for $1\leq i\leq q$ , we have that

[TABLE]

Finally, our claim follows by induction on $q\in{\mathbb{N}}$ and the assumption $\lim_{n\rightarrow\infty}na_{n}^{1/k}=\infty$ . ∎

We are now able to establish Lemma 1.

Proof of Lemma 1.

First note that by condition (a) of Lemma 1 and (38), we have $\max_{v\in{\mathbb{T}}_{n}}d_{n}(v)=\sup_{t\in[0,1]}\widetilde{V}_{n}(t)=O(a_{n}^{-1})$ . Thus the conditions for Lemma 2 and Lemma 3 are satisfied.

Recall the functions $\bar{H}_{n,q}$ and $H_{f,q}$ defined in (49) and (35), respectively. Therefore, notice that we only need to show that

[TABLE]

The above convergence together with Lemma 3 implies that ${\mathbb{E}}[{\mathcal{K}}_{1}({\mathbb{T}}_{n})^{q}]=O(n^{q}a_{n}^{q/k})$ which clearly proves the first claim in Lemma 1. The second claim follows immediately from Theorem 2 and the method of moments.

We henceforth prove the claim in (115). Recall that a sequence $(g_{n})_{n\geq 1}$ of non-negative functions on a measure space $(\Omega,{\mathcal{F}},\mu)$ with total mass $1$ , i.e., $\mu(\Omega)=1$ , is uniformly integrable if $\int_{\Omega}g_{n}\;{\rm d}\mu<\infty$ for all $n\geq 1$ and

[TABLE]

We also recall the following useful result on uniformly integrable sequences of functions. Suppose further that $g_{n}\rightarrow g$ almost everywhere as $n\rightarrow\infty$ . By [27, Proposition 4.12], we know that

[TABLE]

Then in order to prove (115), it is enough to check the following:

(i)

The sequence $(\bar{H}_{n,q})_{n\geq 1}$ is uniformly integrable on $[0,1]^{q}$ , and

(ii)

$\bar{H}_{n,q}\rightarrow H_{f,q}$ as $n\rightarrow\infty$ .

We start by showing (i). Note that $|a_{n}\widetilde{V}_{n}(t)-a_{n}\widehat{V}_{n}(t)|\leq a_{n}$ for $t\in[0,1]$ . Then, the assumption (a) implies that $a_{n}\widehat{V}_{n}(t)\rightarrow f(t)$ and $1/(a_{n}\widehat{V}_{n}(t))^{1/k}\rightarrow(1/f(t))^{1/k}$ , for every $t\in[0,1]$ , as $n\rightarrow\infty$ . Moreover, the assumption (b) shows that $(1/(a_{n}\widehat{V}_{n}(t))^{1/k})_{n\geq 1}$ is uniformly integrable on $[0,1]$ . More generally, for every fixed $q\in{\mathbb{N}}$ and ${\bf t}_{q}=(t_{1},\dots,t_{q})$ , define the function $\widetilde{H}_{n,q}({\bf t}_{q})\coloneqq(a_{n}\widehat{V}_{n}(t_{1})\cdots a_{n}\widehat{V}_{n}(t_{q}))^{-1/k}$ . We then observe that

[TABLE]

as $n\rightarrow\infty$ . Thus the result in (116) shows that the sequence $(\widetilde{H}_{n,q})_{n\geq 1}$ is uniformly integrable on $[0,1]^{q}$ . Next notice that the inequality $\exp(-a_{n}^{1/k}(x_{1}+\cdots+x_{q}))\leq 1$ implies that $\bar{H}_{n,q}({\bf t}_{q})\leq H_{a_{n}\widehat{V}_{n},q}({\bf t}_{q})$ , where $H_{a_{n}\widehat{V}_{n},q}$ is defined in (35). Then the inequality (36) implies that there exists a constant $C_{k,q}>0$ such that $\bar{H}_{n,q}({\bf t}_{q})\leq C_{k,q}\widetilde{H}_{n,q}({\bf t}_{q})$ . Hence (i) follows by applying [18, Theorem 4.5].

Finally, we verify (ii). Recall that condition (a) implies that $a_{n}\widehat{V}_{n}(t)\rightarrow f(t)$ , for every $t\in[0,1]$ , as $n\rightarrow\infty$ . Hence, whenever $0\leq t_{1}\leq t_{2}\leq 1$ , $\inf_{t\in[t_{1},t_{2}]}a_{n}\widehat{V}_{n}(t)\rightarrow\inf_{t\in[t_{1},t_{2}]}f(t)$ as $n\rightarrow\infty$ . Thus, for $q\in{\mathbb{N}}$ , the equation (8), implies that $D_{a_{n}\widehat{V}_{n}}(t_{1},\dots,t_{q})\rightarrow D_{f}(t_{1},\dots,t_{q})$ uniformly for $t_{1},\dots,t_{q}\in[0,1]$ as $n\rightarrow\infty$ . Then, for ${\bf x}_{q}\in{\mathbb{R}}_{+}^{q}$ and ${\bf t}_{q}\in[0,1]^{q}$ ,

[TABLE]

Note that for $\varepsilon\in(0,1)$ there exists $N\in{\mathbb{N}}$ such that

[TABLE]

Moreover, note that condition (b) implies that the function on the right-hand side of the inequality is integrable on $\{{\bf x}_{q}\in{\mathbb{R}}_{+}:0\leq x_{q}\leq\cdots\leq x_{1}<\infty\}$ . Therefore, it should be clear that (ii) follows by the dominated convergence theorem. This finishes the proof. ∎

We can apply similar ideas as in the proofs of Lemma 1 and Lemma 3 to estimate the mean of the number of $r$ -records ${\mathcal{K}}_{r}({\mathbb{T}}_{n})$ . It is important to mention that we have not tried to estimate higher moments of ${\mathcal{K}}_{r}({\mathbb{T}}_{n})$ to obtain a limit theorem in distribution for this quantity. We believe that our methods can be used but the computations will be more involved and we decided not to do it. Furthermore, the next results show that ${\mathcal{K}}_{r}({\mathbb{T}}_{n})$ is of smaller order than ${\mathcal{K}}_{1}({\mathbb{T}}_{n})$ and hence it will not contribute (in the limit) to the distribution of the $k$ -cut number ${\mathcal{K}}({\mathbb{T}}_{n})$ .

Lemma 4.

Let ${\mathbb{T}}_{n}$ be an ordered (deterministic) rooted tree with $n\in{\mathbb{N}}$ vertices. Suppose that there exists a sequence $(a_{n})_{n\geq 1}$ of non-negative real numbers with $\lim_{n\rightarrow\infty}a_{n}=0$ , $\lim_{n\rightarrow\infty}na_{n}^{1/k}=\infty$ and $\max_{v\in{\mathbb{T}}_{n}}d_{n}(v)=O(a_{n}^{-1})$ . Then, for $r\in\{1,\dots,k\}$ ,

[TABLE]

Proof.

Note that the case $r=1$ has been proven in Lemma 3. We follow a similar strategy to prove the case $r\in\{2,\dots,k\}$ . Recall that $I_{r,v}$ is the indicator of the event that the vertex $v\in{\mathbb{T}}_{n}$ is an $r$ -record defined in (5). We observe that

[TABLE]

where $x_{0}^{\alpha}=a_{n}^{\alpha}$ and $\alpha=\frac{1}{2}\left(\frac{1}{k}+\frac{1}{k+1}\right)$ . On the one hand, Lemma 2, with $q=1$ , implies that

[TABLE]

On the other hand, Lemma 2, with $q=1$ , also implies that

[TABLE]

where

[TABLE]

this estimate can be deduced similarly as the one for the integral $A_{2}$ . By recalling that ${\mathcal{K}}_{r}({\mathbb{T}}_{n})=\sum_{v\in{\mathbb{T}}_{n}}I_{r,v}$ , we conclude from the previous estimations that

[TABLE]

Finally, our claim follows by making the change of variables $x=a_{n}^{1/k}w$ . ∎

Lemma 5.

Suppose that $({\mathbb{T}}_{n})_{n\geq 1}$ is a sequence of ordered (deterministic) rooted trees. Suppose that there exists a sequence $(a_{n})_{n\geq 1}$ of non-negative real numbers with $\lim_{n\rightarrow\infty}a_{n}=0$ , $\lim_{n\rightarrow\infty}na_{n}^{1/k}=\infty$ , and a function $f\in C([0,1],{\mathbb{R}}_{+})$ such that $\widetilde{V}_{n}$ satisfies the condition (a) in Lemma 1 and that for $r\in\{1,\dots,k\}$ ,

[TABLE]

Then,

[TABLE]

Proof.

Notice that the case $r=1$ has been proved in Lemma 1. The proof of the general case $r\in\{1,\dots,k\}$ follows by a simple adaptation of the argument used in the proof of Lemma 1 for $q=1$ with the use of Lemma 4. One only needs to note that

[TABLE]

3 Proof of Theorem 1

Let ${\mathbb{T}}_{n}$ be a Galton-Watson tree conditioned on its number of vertices being $n\in{\mathbb{N}}$ with offspring distribution $\xi$ satisfying (1). Note that in this case both the $r$ -records and the tree are random. Then we study ${\mathcal{K}}_{r}({\mathbb{T}}_{n})$ as random variable conditioned on ${\mathbb{T}}_{n}$ . More precisely, we first choose a random tree ${\mathbb{T}}_{n}$ . Then we keep it fixed and consider the number of $r$ -records. This gives a random variable ${\mathcal{K}}_{r}({\mathbb{T}}_{n})$ with distribution that depends on ${\mathbb{T}}_{n}$ . We have the following lemma that corresponds to [25, Lemma 4.8].

Lemma 6.

Let ${\mathbb{T}}_{n}$ be a Galton-Watson tree conditioned on its number of vertices being $n\in{\mathbb{N}}$ with offspring distribution $\xi$ satisfying (1). For $r\in\{1,\dots,k\}$ . We have that ${\mathbb{E}}[{\mathcal{K}}_{r}({\mathbb{T}}_{n})]=O(n^{1-\frac{r}{2k}})$ .

Proof.

By an application of the proof of Lemma 4 with $a_{n}=n^{-1/2}$ (in particular, the equality (2)), we see that

[TABLE]

where $w_{i}({\mathbb{T}}_{n})$ denotes the number of vertices at depth $i\in{\mathbb{N}}$ in ${\mathbb{T}}_{n}$ . Notice that

[TABLE]

by the fact that $\sum_{i\geq 0}w_{i}({\mathbb{T}}_{n})=n$ . Since ${\mathbb{E}}[\xi^{2}]<\infty$ by our assumption (1), [25, Theorem 1.13] implies that for all $n,i\in{\mathbb{N}}$ , ${\mathbb{E}}[w_{i}({\mathbb{T}}_{n})]\leq Ci$ for some constant $C>0$ depending on $\xi$ only. Therefore,

[TABLE]

By taking expectation in (118), our claim follows by (119). ∎

We continue by studying the moments of the number of $1$ -records ${\mathcal{K}}_{1}({\mathbb{T}}_{n})$ . We denote by $\mu_{n}$ the (random) probability distribution of $\sigma^{-1/k}n^{-1+1/2k}{\mathcal{K}}_{1}({\mathbb{T}}_{n})$ given ${\mathbb{T}}_{n}$ . Define the random variables

[TABLE]

Notice that the moments of $\mu_{n}$ are given by $\sigma^{-q/k}n^{-q+q/2k}m_{q}({\mathbb{T}}_{n})$ . We have the following lemma that corresponds to [25, Lemma 4.9].

Lemma 7.

Let ${\mathbb{T}}_{n}$ be a Galton-Watson tree conditioned on its number of vertices being $n\in{\mathbb{N}}$ with offspring distribution $\xi$ satisfying (1). Furthermore, suppose that for every fixed $q\in{\mathbb{N}}$ we have that ${\mathbb{E}}[\xi^{q+1}]<\infty$ . Then ${\mathbb{E}}[m_{q}({\mathbb{T}}_{n})]=O(n^{q-\frac{q}{2k}})$ .

Proof.

By an application of Lemma 3 with $q\in{\mathbb{N}}$ and $a_{n}=n^{-1/2}$ (in particular, the equality (62) in its proof), we see that

[TABLE]

where $Y_{q}\coloneqq\sum_{p=0}^{q-1}\sum_{l=0}^{p}\binom{q}{p}\binom{p}{l}(-1)^{p-l}m_{l}({\mathbb{T}}_{n})$ . After a similar computation as in the proof of the inequality (36), one sees that there exists a constant $C_{k,q}>0$ such that

[TABLE]

where $\bar{m}_{1}({\mathbb{T}}_{n})\coloneqq\sum_{v\in{\mathbb{T}}_{n}\setminus\{\circ\}}d_{n}(v)^{-1/k}$ . Notice that

[TABLE]

where $w_{i}({\mathbb{T}}_{n})$ denotes the number of vertices at depth $i\in{\mathbb{N}}$ in ${\mathbb{T}}_{n}$ . Since ${\mathbb{E}}[\xi^{q+1}]<\infty$ for $q\in{\mathbb{N}}$ , [25, Theorem 1.13] implies that for all $n,i\in{\mathbb{N}}$ , ${\mathbb{E}}[w_{i}({\mathbb{T}}_{n})^{q}]\leq Ci^{q}$ for some constant $C>0$ depending on $q$ and $\xi$ only. Therefore, Minkowski’s inequality implies that

[TABLE]

By taking expectation in (121), we deduce from (122) that

[TABLE]

and our claim follows by induction on $q\in{\mathbb{N}}$ . ∎

Let $\widetilde{V}_{n}$ and $\widehat{V}_{n}$ be the normalized depth-first search walks associated with the conditioned Galton-Watson tree ${\mathbb{T}}_{n}$ . Note that in this case $\widetilde{V}_{n}$ becomes a random function on $C([0,1],{\mathbb{R}}_{+})$ . Recall that a remarkable result due to Aldous [3, Theorem 23 with Remark 2] (see also [29, Theorem 1]) shows that

[TABLE]

in $C([0,1],{\mathbb{R}}_{+})$ , with its usual topology, and where $B^{\rm ex}=(B^{\rm ex}(t),t\in[0,1])$ is a standard normalized Brownian excursion. Note that $B^{\rm ex}$ is a random element from $C([0,1],{\mathbb{R}}_{+})$ ; see for example [8] or [36].

Lemma 8.

For $r\in\{1,\dots,k\}$ , we have that $\int_{0}^{1}B^{\rm ex}(t)^{-r/k}\;{\rm d}t<\infty$ almost surely.

Proof.

One only needs to show that ${\mathbb{E}}[\int_{0}^{1}B^{\rm ex}(t)^{-r/k}\;{\rm d}t]<\infty$ . This follows by computing ${\mathbb{E}}[B^{\rm ex}(t)^{-r/k}]$ , for every $t\in[0,1]$ , from the well-known density function of $B^{\rm ex}(t)$ ; see [8, Chapter II, Equation (1.4)]. ∎

Therefore, Theorem 2 and Lemma 8 imply that there exists almost surely a (unique) measure $\nu_{2B^{\rm ex}}$ with moments given by $m_{q}(2B^{\rm ex})$ . The next result provides a generalization of [25, Theorem 1.10] and it will be used in the proof of Theorem 1.

Theorem 3.

Let ${\mathbb{T}}_{n}$ be a Galton-Watson tree conditioned on its number of vertices being $n\in{\mathbb{N}}$ with offspring distribution $\xi$ satisfying (1). Then

[TABLE]

in the space of probability measures on ${\mathbb{R}}$ . Moreover, we have that for every $q\in{\mathbb{N}}$ ,

[TABLE]

The convergences in (123), (124) and (125), for all $q\in{\mathbb{N}}$ , hold jointly. In particular, if ${\mathbb{E}}[\xi^{p}]<\infty$ for all $p\in{\mathbb{N}}$ , then for all $q\in{\mathbb{N}}$ and $l\in{\mathbb{N}}$ ,

[TABLE]

Proof.

A simple adaptation of the proof of [25, Lemma 4.7] easily shows that

[TABLE]

in $C([0,1],\mathbb{R}_{+})\times{\mathbb{R}}$ , as $n\rightarrow\infty$ . By the Skorohod coupling theorem (see e.g. [27, Theorem 4.30]), we can assume that the trees $({\mathbb{T}}_{n})_{n\geq 1}$ are defined on a common probability space such that the convergence in (127) holds almost surely. Therefore, the convergences (124) and (125) follow immediately from Lemma 1. It only remains to prove (126). Recall that we assume that ${\mathbb{E}}[\xi^{p}]<\infty$ for every $p\in{\mathbb{N}}$ . By Jensen’s inequality, we notice that $m_{q}({\mathbb{T}}_{n})^{l}\leq m_{lq}({\mathbb{T}}_{n})$ for $l,q\in{\mathbb{N}}$ . Hence Lemma 7 implies that ${\mathbb{E}}[m_{q}({\mathbb{T}}_{n})^{l}]=O(n^{lq-\frac{lq}{2k}})$ . This shows that every moment of the right-hand side of (125) stays bounded as $n\rightarrow\infty$ which implies (126). ∎

We are now able to prove Theorem 1.

Proof of Theorem 1.

Lemma 6 establishes that $\mathbb{E}[{\mathcal{K}}_{r}({\mathbb{T}}_{n})]=O(n^{1-\frac{r}{2k}})$ for $r\in\{1,\dots,k\}$ . As a consequence, Markov’s inequality implies that $n^{-1+\frac{1}{2k}}{\mathcal{K}}_{r}({\mathbb{T}}_{n})\rightarrow 0$ in probability, as $n\rightarrow\infty$ , for $r\in\{2,\dots,k\}$ . Then, by the identity in (6), it is enough to prove Theorem 1 for ${\mathcal{K}}_{1}({\mathbb{T}}_{n})$ instead of ${\mathcal{K}}({\mathbb{T}}_{n})$ . By the definition of $\mu_{n}$ and Theorem 3, for any bounded continuous function $g:{\mathbb{R}}_{+}\rightarrow{\mathbb{R}}_{+}$ ,

[TABLE]

Taking expectations, the dominated convergence theorem implies that $\sigma^{-1/k}n^{-1+1/2k}{\mathcal{K}}_{1}({\mathbb{T}}_{n}){\,{\buildrel d\over{\rightarrow}}\,}Z_{\rm CRT}$ , as $n\rightarrow\infty$ , where $Z_{\rm CRT}$ has distribution $\nu(\cdot)={\mathbb{E}}[\nu_{2B^{\rm ex}}(\cdot)]$ . Suppose that ${\mathbb{E}}[\xi^{p}]<\infty$ for every $p\in{\mathbb{N}}$ . Lemma 7 implies that every moment of $n^{-1+1/2k}{\mathcal{K}}_{1}({\mathbb{T}}_{n})$ stays bounded as $n\rightarrow\infty$ which implies the moment convergence in Theorem 1. It remains to identify the moments of $Z_{\rm CRT}$ (or equivalently $\nu$ ). Notice that

[TABLE]

For $q\in{\mathbb{N}}$ , let $U_{1},\dots,U_{q}$ be independent random variables with the uniform distribution on $[0,1]$ . Let $Y_{1},\dots,Y_{q}$ be the first $q$ points in a Poisson process on $(0,\infty)$ with intensity $x\,{\rm d}x$ , i.e., $Y_{1},\dots,Y_{q}$ have joint density function $y_{1}\cdots y_{q}e^{-y_{q}^{2}/2}$ on $0<y_{1}<\cdots<y_{q}<\infty$ . It is well-known that $L_{2B^{\rm ex}}(U_{1},\dots,U_{q})\stackrel{{\scriptstyle d}}{{=}}Y_{q}$ , see, e.g., [25, Proof of Lemma 5.1]. Thus by recalling the definition of the function $H_{2B^{\rm ex},q}$ in (35), we see that

[TABLE]

where ${\bf U}_{q}=(U_{1},\dots,U_{q})$ , ${\bf y}_{q}=(y_{1},\dots,y_{q})\in{\mathbb{R}}_{+}^{q}$ and

[TABLE]

Finally, the expression for the moments in Theorem 1 follows by first changing the order of integration in (128) and then by making the change of variables $w_{i}=y_{i}-y_{i-1}$ for $2\leq i\leq q$ . ∎

Following the idea of the proof of Theorem 1, we obtain the following convergence of the first moment of the number of $r$ -records ${\mathcal{K}}_{r}({\mathbb{T}}_{n})$ . This provides a proof of [12, Lemma 4.10].

Lemma 9.

Let ${\mathbb{T}}_{n}$ be a Galton-Watson tree conditioned on its number of vertices being $n\in{\mathbb{N}}$ with offspring distribution $\xi$ satisfying (1). For $r\in\{1,\dots k\}$ , we have that

[TABLE]

Proof.

The proof follows by a simple adaptation of the argument used in the proof of Theorem 1 by using Lemma 5 (with $a_{n}=n^{-1/2}$ ), Lemma 6 and Lemma 8. One only needs to note that

[TABLE]

which follows from the well-known density function of $B^{\rm ex}(t)$ ; see [8, Chapter II, Equation (1.4)]. ∎

4 Further applications

In this section, we show that the results obtained in Section 2 can be used and extended to study the $k$ -cut model in other families of trees. In this section, let ${\mathbb{T}}_{n}$ be a rooted tree (maybe random and not necessarily ordered) with $n\in{\mathbb{N}}$ vertices and root $\circ$ .

4.1 Paths

Lemma 10.

Let ${\mathbb{T}}_{n}$ be a path with $n$ vertices labelled $1,\dots,n$ from the root to the leaf. For $k\in\{2,3,\dots\}$ , we have that $n^{-1+1/k}{\mathcal{K}}({\mathbb{T}}_{n}){\,{\buildrel d\over{\rightarrow}}\,}Z_{\rm path}$ , as $n\rightarrow\infty$ , where $Z_{\rm path}$ is a non-degenerate random variable whose law is determined entirely by its moments: ${\mathbb{E}}[Z_{\rm path}^{q}]=m_{q}(f)$ for $q\in{\mathbb{Z}}_{\geq 0}$ , where

[TABLE]

Proof.

By [12, Theorem 1.1], we know that ${\mathbb{E}}[{\mathcal{K}}_{r}({\mathbb{T}}_{n})]=O(n^{1-\frac{r}{k}})$ , for $r\in\{1,\dots,k-1\}$ , and ${\mathbb{E}}[{\mathcal{K}}_{k}({\mathbb{T}}_{n})]=O(\ln n)$ . Then Markov’s inequality implies that $n^{-1+1/k}{\mathcal{K}}_{r}({\mathbb{T}}_{n})\rightarrow 0$ in probability, as $n\rightarrow\infty$ , for $r\in\{2,\dots,k\}$ . Thus, by the identity (6), it is enough to prove our result for ${\mathcal{K}}_{1}({\mathbb{T}}_{n})$ instead of ${\mathcal{K}}({\mathbb{T}}_{n})$ . Note that the normalized depth-first search walks $\widetilde{V}_{n}$ and $\widehat{V}_{n}$ of ${\mathbb{T}}_{n}$ , defined in (37), are given by $n^{-1}\widetilde{V}_{n}(t)=f(t).$ and that $n^{-1}\widehat{V}_{n}(t)=n^{-1}\lceil\widetilde{V}_{n}(t)\rceil$ for $t\in[0,1]$ . It should be clear that the conditions of Lemma 1 are fulfilled with $a_{n}=n^{-1}$ . Therefore, our result follows from a simple application of Lemma 1. ∎

Remark 2.

The convergence in distribution and moments of the $k$ -cut number of a path to $Z_{\rm path}$ has been proved in [12, Theorem 1.5] with a very different method. The contribution of Lemma 10 is the formula for computing the $q$ -th moment of the limiting variable $Z_{\rm path}$ for all $q\in{\mathbb{Z}}_{\geq 0}$ .

4.2 General trees

The next result establishes a limit in distribution for the number of $1$ -records ${\mathcal{K}}_{1}({\mathbb{T}}_{n})$ of a general (random) rooted tree in the same spirit as in Lemma 1. For $q\in{\mathbb{N}}$ , let $u_{1},\dots,u_{q}$ be a sequence of independent uniformly chosen vertices on ${\mathbb{T}}_{n}$ . Recall that $L_{n}(u_{1},\dots,u_{q})$ denotes the number of edges in the subtree of ${\mathbb{T}}_{n}$ spanned by $u_{1},\dots,u_{q}$ and its root $\circ$ (i.e., the minimal number of edges that are needed to connect $u_{1},\dots,u_{q}$ and $\circ$ ). In particular, $L_{n}(u_{1})=d_{n}(u_{1})$ is the depth of the vertex $u_{1}$ in ${\mathbb{T}}_{n}$ . In the sequel, we will often use the notation $A_{n}=O_{p}(B_{n})$ , where $(A_{n})_{n\geq 1}$ and $(B_{n})_{n\geq 1}$ are two sequences of non-negative real random variables such that $B_{n}>0$ , to indicate that $\lim_{\delta\rightarrow\infty}\limsup_{n\rightarrow\infty}{\mathbb{P}}(A_{n}>\delta B_{n})=0$ .

Theorem 4.

Let $({\mathbb{T}}_{n})_{n\geq 1}$ be a sequence of rooted trees. Suppose that there exists a sequence $(a_{n})_{n\geq 1}$ of non-negative real numbers with $\lim_{n\rightarrow\infty}a_{n}=0$ , $\lim_{n\rightarrow\infty}na_{n}^{1/k}=\infty$ and such that

(a)

$\max_{v\in{\mathbb{T}}_{n}}L_{n}(v)=O_{\rm p}(a_{n}^{-1}).$ **

(b)

For every $q\in{\mathbb{N}}$ , $\displaystyle a_{n}(L_{n}(u_{1}),\dots,L_{n}(u_{1},\dots,u_{q})){\,{\buildrel d\over{\rightarrow}}\,}(\zeta_{1},\dots,\zeta_{1}+\cdots+\zeta_{q}),\hskip 2.84526pt\text{as}\hskip 2.84526ptn\rightarrow\infty$ , where $\zeta_{1},\zeta_{2}\dots$ is a sequence of i.i.d. random variables in ${\mathbb{R}}_{+}$ with no atom at [math].

(c)

For every $q\in{\mathbb{N}}$ , $\displaystyle{\mathbb{E}}[(a_{n}L_{n}(u_{1})\cdots a_{n}L_{n}(u_{q}))^{-1/k}\mathds{1}_{\{u_{1},\dots,u_{q}\in{\mathbb{T}}_{n}\setminus\{\circ\}\}}]\rightarrow{\mathbb{E}}[\zeta_{1}^{-1/k}]^{q}<\infty,\hskip 2.84526pt\text{as}\hskip 2.84526ptn\rightarrow\infty.$

Then $n^{-1}a_{n}^{-1/k}{\mathcal{K}}_{1}({\mathbb{T}}_{n}){\,{\buildrel d\over{\rightarrow}}\,}Z_{\zeta}$ , as $n\rightarrow\infty$ , where $Z_{\zeta}$ is a random variable whose law is determined entirely by its moments: ${\mathbb{E}}[Z_{\zeta}^{0}]=1$ , and for $q\in{\mathbb{N}}$ ,

[TABLE]

Proof.

By the assumption (a) and Lemma 3 (in particular, the identity (62)), we see that

[TABLE]

where ${\bf v}_{q}=(v_{1},\dots,v_{q})\in{\mathbb{T}}_{n}^{q}$ , $Y_{q}\coloneqq\sum_{p=0}^{q-1}\sum_{l=0}^{p}\binom{q}{p}\binom{p}{l}(-1)^{p-l}{\mathbb{E}}[{\mathcal{K}}_{1}({\mathbb{T}}_{n})^{l}|{\mathbb{T}}_{n}]$ and

[TABLE]

with $G_{n}$ defined in (39). Then we see that

[TABLE]

where ${\bf u}_{q}=(u_{1},\dots,u_{q})$ . Suppose that we have proven that

[TABLE]

as $n\rightarrow\infty$ . Then the result follows by induction on $q\in{\mathbb{N}}$ together with the previous convergence.

We henceforth prove the claim in (140). From the result in (116), it is enough to check the following:

(i)

The sequence $(a_{n}^{-q/k}\widehat{H}_{n,q}({\bf u}_{q})\mathds{1}_{\{{\bf u}_{q}\in({\mathbb{T}}_{n}\setminus\{\circ\})^{q}\}})_{n\geq 1}$ is uniformly integrable.

(ii)

$\displaystyle a_{n}^{-q/k}\widehat{H}_{n,q}({\bf u}_{q})\mathds{1}_{\{{\bf u}_{q}\in({\mathbb{T}}_{n}\setminus\{\circ\})^{q}\}}{\,{\buildrel d\over{\rightarrow}}\,}\int_{0}^{\infty}\int_{0}^{x_{1}}\cdots\int_{0}^{x_{q-1}}\exp\left({-\frac{\zeta_{1}x^{k}_{1}+\cdots+\zeta_{q}x_{q}^{k}}{k!}}\right)\;{\rm d}{\mathchoice{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$ \displaystyle\vec{}\mkern 4.0mu $}\cr\kern-4.30554pt\cr$ \displaystyle\bf x $\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$ \textstyle\vec{}\mkern 4.0mu $}\cr\kern-4.30554pt\cr$ \textstyle\bf x $\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$ \scriptstyle\vec{}\mkern 4.0mu $}\cr\kern-3.01389pt\cr$ \scriptstyle\bf x $\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$ \scriptscriptstyle\vec{}\mkern 4.0mu $}\cr\kern-2.15277pt\cr$ \scriptscriptstyle\bf x $\cr}}}}_{q}$ , as $n\rightarrow\infty$ .

We start by showing (i). Since $\exp(-(x_{1}+\cdots+x_{q}))\leq 1$ for $x_{1},\dots,x_{q}\in{\mathbb{R}}_{+}$ , we have that

[TABLE]

Hence after a similar computation as in the proof of the inequality (36), one obtains that there exists a constant $C_{k,q}>0$ such that

[TABLE]

Notice that our hypotheses (b) and (c) together with the result in (116) show that the sequence

[TABLE]

is uniformly integrable. Hence (i) follows from [18, Theorem 5.4.5].

Finally, we verify (ii). By making the change of variables $x_{i}=a_{n}^{1/k}w_{i}$ , for $1\leq i\leq q$ , we see that

[TABLE]

where ${\bf w}_{q}=(w_{1},\dots,w_{q})\in{\mathbb{R}}_{+}^{q}$ , ${\mathchoice{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$ \displaystyle\vec{}\mkern 4.0mu $}\cr\kern-4.30554pt\cr$ \displaystyle{\bf w} $\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$ \textstyle\vec{}\mkern 4.0mu $}\cr\kern-4.30554pt\cr$ \textstyle{\bf w} $\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$ \scriptstyle\vec{}\mkern 4.0mu $}\cr\kern-3.01389pt\cr$ \scriptstyle{\bf w} $\cr}}}{\vbox{\offinterlineskip\halign{#\cr\reflectbox{$ \scriptscriptstyle\vec{}\mkern 4.0mu $}\cr\kern-2.15277pt\cr$ \scriptscriptstyle{\bf w} $\cr}}}}_{q}=(w_{q},\dots,w_{1})$ , and

[TABLE]

with $D_{n}(u_{1})\coloneqq L_{n}(u_{1})$ and $D_{n}(u_{1},\dots,u_{q})\coloneqq L_{n}(u_{1},\dots,u_{q})-L_{n}(u_{1},\dots,u_{q-1})$ for $q\geq 2$ . Notice that $\mathds{1}_{\{{\bf u}_{q}\in({\mathbb{T}}_{n}\setminus\{\circ\})^{q}\}}{\,{\buildrel d\over{\rightarrow}}\,}1$ , as $n\rightarrow\infty$ . Thus, condition (b) implies that

[TABLE]

By the Skorohod coupling theorem (see e.g. [27, Theorem 4.30]), we can assume that the previous convergence holds almost surely together with the convergence in condition (b). Notice that for $\varepsilon\in(0,1)$ there exists $N\in{\mathbb{N}}$ such that

[TABLE]

By condition (c), notice also that the function on the right-hand side is integrable on $\{{\bf w}_{q}\in{\mathbb{R}}_{+}^{q}:0\leq w_{q}\leq\cdots\leq w_{1}<\infty\}$ . Therefore, it should be clear now that (ii) follows by the dominated convergence theorem. This concludes our proof. ∎

The next result establishes an estimate for the mean number of $r$ -records ${\mathcal{K}}_{r}({\mathbb{T}}_{n})$ of a general (random) rooted tree in the same spirit as in Lemma 5. Furthermore, it shows that ${\mathcal{K}}_{r}({\mathbb{T}}_{n})$ is of smaller order than ${\mathcal{K}}_{1}({\mathbb{T}}_{n})$ and hence it will not contribute (in the limit) to the distribution of the $k$ -cut number ${\mathcal{K}}({\mathbb{T}}_{n})$ . We believe as well that our methods can be used to estimate higher moments and to obtain an analogue result to Theorem 4 for ${\mathcal{K}}_{r}({\mathbb{T}}_{n})$ . We have not attempted to do it and the estimation of the mean is enough for our purpose.

Lemma 11.

Let $({\mathbb{T}}_{n})_{n\geq 1}$ be a sequence of rooted trees. Suppose that there exists a sequence $(a_{n})_{n\geq 1}$ of non-negative real numbers with $\lim_{n\rightarrow\infty}a_{n}=0$ , $\lim_{n\rightarrow\infty}na_{n}=\infty$ and such that

(a)

$\max_{v\in{\mathbb{T}}_{n}}L_{n}(v)=O_{\rm p}(a_{n}^{-1}).$ **

(b)

$\displaystyle a_{n}L_{n}(u_{1}){\,{\buildrel d\over{\rightarrow}}\,}\zeta_{1},\hskip 2.84526pt\text{as}\hskip 2.84526ptn\rightarrow\infty$ , where $\zeta_{1}$ is a random variable in ${\mathbb{R}}_{+}$ with no atom at [math].

(c)

For every $r\in\{1,\dots k\}$ , $\displaystyle{\mathbb{E}}[(a_{n}L_{n}(u_{1}))^{-r/k}\mathds{1}_{\{u_{1}\in{\mathbb{T}}_{n}\setminus\{\circ\}\}}]\rightarrow{\mathbb{E}}[\zeta_{1}^{-r/k}]<\infty,\hskip 2.84526pt\text{as}\hskip 2.84526ptn\rightarrow\infty.$

Then, for $r\in\{1,\dots k\}$ ,

[TABLE]

Proof.

By the assumption (a) and Lemma 4 (in particular, the identity (2)), we see that

[TABLE]

Hence

[TABLE]

Therefore, our result follows by proving that

[TABLE]

where the last integral is equal to the right-hand side of (143). Note that the case $r=1$ has been proved in Theorem 4. The proof of the general case $r\in\{1,\dots,k\}$ follows by a simple adaptation of the argument used in the proof of Theorem 4 for $q=1$ and details are left to the reader. ∎

The next lemma provides a useful way to verify condition (c) in Theorem 4.

Lemma 12.

Let ${\mathbb{T}}_{n}$ be a rooted tree. Suppose that there exists a sequence $(a_{n})_{n\geq 1}$ of non-negative real numbers with $\lim_{n\rightarrow\infty}a_{n}=0$ , $\lim_{n\rightarrow\infty}na_{n}^{1/k}=\infty$ and such that for every $q\in{\mathbb{N}}$ ,

[TABLE]

where $\zeta_{1},\zeta_{2}\dots$ is a sequence of i.i.d. random variables in ${\mathbb{R}}_{+}$ with no atom at [math] such that ${\mathbb{E}}[\zeta_{1}^{-1/k}]<\infty$ . Furthermore, assume that for every $q\in{\mathbb{N}}$ there exists $\delta>0$ such that for all $\varepsilon\in(0,\delta)$

[TABLE]

where $W_{i}({\mathbb{T}}_{n})$ denotes the number of vertices a depth $i\in{\mathbb{Z}}_{\geq 0}$ in ${\mathbb{T}}_{n}$ . Then the condition (c) in Theorem 4 is satisfied

Proof.

For simplicity, we introduce the notation $X_{n,q}\coloneqq(a_{n}L_{n}(u_{1})\cdots a_{n}L_{n}(u_{q}))^{-1/k}\mathds{1}_{\{{\bf u}_{q}\in({\mathbb{T}}_{n}\setminus\{\circ\})^{q}\}}$ and $X_{q}\coloneqq(\zeta_{1}\cdots\zeta_{q})^{-1/k}$ , for $n,q\in{\mathbb{N}}$ . Consider $\delta>0$ such that for $\varepsilon\in(0,\delta)$ the property in (144) is satisfied. Define the function $\phi_{\varepsilon}:{\mathbb{R}}_{+}\rightarrow{\mathbb{R}}_{+}$ given by $\phi_{\varepsilon}=0$ on $[0,\varepsilon]$ , $\phi_{\varepsilon}=1$ on $[2\varepsilon,\infty)$ , and $\phi_{\varepsilon}$ linear on $[\varepsilon,2\varepsilon]$ . Since $\mathds{1}_{\{{\bf u}_{q}\in({\mathbb{T}}_{n}\setminus\{\circ\})^{q}\}}{\,{\buildrel d\over{\rightarrow}}\,}1$ we observe that

[TABLE]

Further, we note that $\phi_{\varepsilon}(X^{-k}_{q})\rightarrow 1$ , almost surely, as $\varepsilon\rightarrow 0$ . In order to show that condition (c) in Theorem 4 is fulfilled, it is enough to check that

[TABLE]

Notice that

[TABLE]

Since $\{X_{n,q}^{-k}\leq\varepsilon\}\subseteq\{1\leq L_{n}(u_{1})\leq\varepsilon^{1/q}a_{n}^{-1}\}\cap\cdots\cap\{1\leq L_{n}(u_{q})\leq\varepsilon^{1/q}a_{n}^{-1}\}$ , it is not difficult to see that

[TABLE]

where we have used Jensen’s inequality to obtain the second inequality. Finally, by our choice of $\varepsilon$ (recall assumption (144)), we observe that

[TABLE]

This clearly implies (145) and concludes our proof. ∎

Similarly, we also provide a useful way to verify condition (c) in Lemma 11.

Lemma 13.

Let ${\mathbb{T}}_{n}$ be a rooted tree. Suppose that there exists a sequence $(a_{n})_{n\geq 1}$ of non-negative real numbers with $\lim_{n\rightarrow\infty}a_{n}=0$ , $\lim_{n\rightarrow\infty}na_{n}=\infty$ and such that the condition (b) in Lemma 11 holds with a random variable $\zeta_{1}$ satisfying ${\mathbb{E}}[\zeta_{1}^{-r/k}]<\infty$ for every $r\in\{1,\dots,k\}$ . Furthermore, assume that for every $r\in\{1,\dots,k\}$ there exists $\delta>0$ such that for all $\varepsilon\in(0,\delta)$

[TABLE]

where $W_{i}({\mathbb{T}}_{n})$ denotes the number of vertices at depth $i\in{\mathbb{Z}}_{\geq 0}$ in ${\mathbb{T}}_{n}$ . Then the condition (c) in Lemma 11 is fulfilled.

Proof.

It should be clear that this can be shown along the lines of the proof of Lemma 12, and therefore, we omit its proof. ∎

4.3 Trees of logarithmic height

Natural examples of trees that fulfil the conditions of Theorem 4 are the class of random trees with logarithmic height, i.e., trees ${\mathbb{T}}_{n}$ such that $\max_{v\in{\mathbb{T}}_{n}}d_{n}(v)=O_{\rm p}(\ln n)$ . For instance, random split trees, uniform random recursive trees, scale-free random trees and mixtures of complete regular trees.

4.3.1 Complete binary trees

Let ${\mathbb{T}}_{n}^{\rm bi}$ be a complete binary tree with $n\in{\mathbb{N}}$ vertices, i.e., its height is $\lfloor\ln n\rfloor$ . Recall that ${\mathbb{T}}_{n}^{\rm bi}$ has $2^{i}$ vertices at height $i\in\{0,1,\dots,\lfloor\ln n\rfloor\}$ , and $n-2^{\lfloor\ln n\rfloor}+1$ vertices of height $\lfloor\ln n\rfloor$ , moreover, the vertices of height $\lfloor\ln n\rfloor$ have leftmost positions among the $2^{\lfloor\ln n\rfloor}$ possible ones; see, e.g., [28, Page 401]. We use the notation $\lg_{2}n=(\ln n)/(\ln 2)$ for the logarithm with base $2$ of $n\in\mathbb{N}$ . It should be clear that condition (a) in Theorem 4 is satisfied with $a_{n}=(\lg_{2}n)^{-1}$ . Furthermore, one readily checks that $(\lg_{2}n)^{-1}(L_{n}(u_{1}),L_{n}(u_{1},u_{2})){\,{\buildrel d\over{\rightarrow}}\,}(1,2)$ , as $n\rightarrow\infty$ . By a simple application of [5, Corollary 1], this implies that condition (b) in Theorem 4 is satisfied with $\zeta_{1}\equiv 1$ . Notice that each vertex in ${\mathbb{T}}_{n}^{\rm bi}$ has at most $2$ children. Then it should be clear that condition (c) of Theorem 4 follows from Lemma 12 since ${\mathbb{E}}[W_{i}({\mathbb{T}}_{n}^{\rm bi})]\leq 2^{i}$ for $i\in{\mathbb{Z}}_{\geq 0}$ . Therefore, Theorem 4 implies that $n^{-1}(\lg_{2}n)^{1/k}{\mathcal{K}}_{1}({\mathbb{T}}_{n}^{\rm bi}){\,{\buildrel d\over{\rightarrow}}\,}Z_{1}$ , as $n\rightarrow\infty$ , where $Z_{1}$ is the random variable whose law is determined entirely by its moments: ${\mathbb{E}}[Z_{1}^{0}]=1$ , and for $q\in{\mathbb{N}}$ ,

[TABLE]

It should be clear that Lemma 11 and Lemma 13 imply that ${\mathbb{E}}[{\mathcal{K}}_{r}({\mathbb{T}}_{n}^{\rm bi})]=O(n(\lg_{2}n)^{-r/k})$ for $r\in\{1,\dots,k\}$ . Therefore, by the identity (6) and the Markov’s inequality, $n^{-1}(\lg_{2}n)^{1/k}{\mathcal{K}}({\mathbb{T}}_{n}^{\rm bi}){\,{\buildrel d\over{\rightarrow}}\,}Z_{1}$ , as $n\rightarrow\infty$ . However, it follows from the next lemma that $Z_{1}\equiv(k!)^{\frac{1}{k}}\Gamma\left(1+1/k\right)$ . Therefore, we actually have

[TABLE]

Remark 3.

As Theorem 1.1 of [11] shows, ${\mathcal{K}}({\mathbb{T}}^{\rm bi})$ , after proper shifting and rescaling, also converges to a non-degenerate limit distribution with an infinite mean. Thus it is not possible to derive the result in [11] with the method of moments which we use to derive Theorem 1 for conditioned Galton-Watson trees. The same is true for split trees, random recursive trees and scale-free trees.

Lemma 14.

For $q\in{\mathbb{N}}$ , we have that

[TABLE]

Proof.

By making the change of variables $w_{i}=x^{k}_{i}/k!$ , for $1\leq i\leq q$ , we notice that the integral at the right-hand side of (155) is equal to

[TABLE]

To see the last identity, we notice that the integral at the left-hand side is simply the probability that $G_{1}\geq G_{2}\geq\dots\geq G_{q}$ , where $G_{1},\dots,G_{q}$ are independent $\text{Gamma}(1/k,1)$ random variables, which is equal to $1/q!$ since each order of $G_{1},\dots,G_{q}$ is equally likely. ∎

4.3.2 Split trees

The class of random split trees was first introduced by Devroye [13] to encompass many families of trees that are frequently used in algorithm analysis, e.g., binary search trees and tries. Its exact construction is somewhat lengthy and we refer readers to either the original algorithmic definition in [13, 21] or the more probabilistic version in [10, Section 2]. Informally speaking, a split tree ${\mathbb{T}}_{n}^{\rm sp}$ is constructed by first distributing $n\in{\mathbb{N}}$ balls among the vertices of an infinite $b$ -ary tree ( $b\in{\mathbb{N}}\setminus\{1\}$ ) and then removing all subtrees without balls. Each vertex in the infinite $b$ -ary tree is given a random non-negative split vector ${\mathcal{V}}=(V_{1},\dots,V_{b})$ such that $\sum_{i=1}^{b}V_{i}=1$ and $V_{i}\geq 0$ , drawn independently from the same distribution. These vectors affect how balls are distributed. In the study of split-trees, the following condition of ${\mathcal{V}}$ is often assumed (see, e.g., Holmgren [21]):

Condition A. The split vector ${\mathcal{V}}$ is permutation invariant. Moreover, ${\mathbb{P}}(V_{1}=1)={\mathbb{P}}(V_{1}=0)=0$ , and that $-\log(V_{1})$ is non-lattice.

Set $\mu\coloneqq b{\mathbb{E}}[-V_{1}\ln V_{1}]\in(0,\ln b)$ . Devroye [13] showed that $\max_{v\in{\mathbb{T}}_{n}^{\rm sp}}d_{n}(v)=O_{\rm p}(\ln n)$ , that is, condition (a) in Theorem 4 with $a_{n}=\mu(\ln n)^{-1}$ . Berzunza et al. [7, Lemma 5 and Corollary 1] have shown that $\mu(\ln n)^{-1}(L_{n}(u_{1}),L_{n}(u_{1},u_{2})){\,{\buildrel d\over{\rightarrow}}\,}(1,2)$ , as $n\rightarrow\infty$ . By a simple application of [5, Corollary 1], this implies that condition (b) in Theorem 4 is satisfied with $\zeta_{1}\equiv 1$ . Notice that each vertex in ${\mathbb{T}}_{n}^{\rm sp}$ has at most $b$ children. Then it should be clear that condition (c) of Theorem 4 follows from Lemma 12 since ${\mathbb{E}}[W_{i}({\mathbb{T}}_{n}^{\rm sp})]\leq b^{i}$ for $i\in{\mathbb{Z}}_{\geq 0}$ . Therefore, Theorem 4 implies that $\mu^{-1/k}n^{-1}(\ln n)^{1/k}{\mathcal{K}}_{1}({\mathbb{T}}_{n}^{\rm sp}){\,{\buildrel d\over{\rightarrow}}\,}Z_{1}$ , as $n\rightarrow\infty$ , where $Z_{1}$ is the random variable whose law is determined entirely by its moments given in (155). Furthermore, Lemma 11 and Lemma 13 imply that ${\mathbb{E}}[{\mathcal{K}}_{r}({\mathbb{T}}_{n}^{\rm sp})]=O(n(\ln n)^{-r/k})$ for $r\in\{1,\dots,k\}$ . Therefore, by the identity (6) and the Markov’s inequality,

[TABLE]

4.3.3 Uniform random recursive trees

A uniform random recursive tree ${\mathbb{T}}_{n}^{\rm rr}$ is a random tree of $n\in{\mathbb{N}}$ vertices constructed recursively as follows: let ${\mathbb{T}}_{1}^{\rm rr}$ be the tree of a single vertex labelled $1$ , given ${\mathbb{T}}_{n-1}^{\rm rr}$ , choose a vertex in ${\mathbb{T}}_{n-1}^{\rm rr}$ uniformly at random and attach a vertex labelled $n$ to the selected vertex as its child, which give ${\mathbb{T}}_{n}^{\rm rr}$ . The uniform random recursive tree is one of the most studied random tree models. They appear for instance as simple epidemic models, or in computer science as data structures. We refer to [15, Chapter 6] for background. Theorem 6.32 in [15] shows that $\max_{v\in{\mathbb{T}}_{n}^{\rm rr}}d_{n}(v)=O_{\rm p}(\ln n)$ , that is, condition (a) in Theorem 4 is satisfied with $a_{n}=(\ln n)^{-1}$ . From the results of Dobrow [14] (see also [15, Section 2.5.5]), it is not difficult to see that $(\ln n)^{-1}(L_{n}(u_{1}),L_{n}(u_{1},u_{2})){\,{\buildrel d\over{\rightarrow}}\,}(1,2)$ , as $n\rightarrow\infty$ . By a simple application of [5, Corollary 1], this implies that condition (b) in Theorem 4 is satisfied with $\zeta_{1}\equiv 1$ . By [17, Equation (11)],

[TABLE]

uniformly for $n\geq 3$ and $1\leq i\leq K\ln n$ , for all $K\geq 1$ . Then it should be clear that condition (c) of Theorem 4 follows from Lemma 12. Therefore, Theorem 4 implies that $n^{-1}(\ln n)^{1/k}{\mathcal{K}}_{1}({\mathbb{T}}_{n}^{\rm rr}){\,{\buildrel d\over{\rightarrow}}\,}Z_{1}$ , as $n\rightarrow\infty$ , where $Z_{1}$ is the random variable whose law is entirely determined by its moments given in (155). Furthermore, Lemma 11 and Lemma 13 imply that ${\mathbb{E}}[{\mathcal{K}}_{r}({\mathbb{T}}_{n}^{\rm rr})]=O(n(\ln n)^{-r/k})$ for $r\in\{1,\dots,k\}$ . Therefore, by the identity (6) and the Markov’s inequality,

[TABLE]

4.3.4 Scale-free random trees

Scale-free random trees form a family of random trees that grow following a preferential attachment algorithm, and are commonly used to model complex real-world networks; see Móri [32]. A scale-free random tree ${\mathbb{T}}_{n}^{\rm sf}$ is a random tree of $n\in{\mathbb{N}}$ vertices constructed recursively as follows: Fix a parameter $\alpha\in(-1,\infty)$ , and start from the tree ${\mathbb{T}}_{1}^{\rm sf}$ that consists in a single edge connecting the vertices labelled $1$ and $2$ . Suppose that $T_{n}^{\rm sf}$ has been constructed for some $n\geq 1$ , and for every $i\in\{1,\dots,n+1\}$ , denote by ${\rm deg}_{n}(i)$ the degree of the vertex $i$ in $T_{n}^{\rm sf}$ . Then conditionally given $T_{n}^{\rm sf}$ , $T_{n+1}^{\rm sf}$ is built by adding an edge between the new vertex $n+2$ and a vertex $v_{n}$ in $T_{n}^{\rm sf}$ chosen at random according to the law

[TABLE]

The standard preferential attachment tree (also known as plane-oriented recursive tree) was made popular by Barabási and Albert [4] and it corresponds to the choice of $\alpha=0$ . On the other hand, if one lets $\alpha\rightarrow\infty$ , then the algorithm yields a uniform random recursive tree. Janson [26] showed that scale-free random trees can also be viewed as split trees with the branching factor $b=\infty$ .

Pittel [35] showed that $\max_{v\in{\mathbb{T}}_{n}^{\rm sf}}d_{n}(v)=O_{\rm p}(\ln n)$ , that is, condition (a) in Theorem 4 is satisfied with $a_{n}=(\beta\ln n)^{-1}$ , where $\beta\coloneqq(1+\alpha)/(2+\alpha)$ . From the results of Borovkov and Vatutin [9] (see the bibliography therein for further references), it is not difficult to see that $(\beta\ln n)^{-1}(L_{n}(u_{1}),L_{n}(u_{1},u_{2})){\,{\buildrel d\over{\rightarrow}}\,}(1,2)$ , as $n\rightarrow\infty$ . By a simple application of [5, Corollary 1], this implies that condition (b) in Theorem 4 is satisfied with $\zeta_{1}\equiv 1$ . Hwang [22, Equation 8] showed that, for $\alpha=0$ , i.e., for the standard preferential attachment tree,

[TABLE]

uniformly for $1\leq i\leq K\ln n$ for all $K\geq 1$ . Thus by an argument similar to that for uniform random recursive trees, we have for $\alpha=0$ ,

[TABLE]

Open problem. To apply Theorem 4 to general scale-free trees, we need an estimate of ${\mathbb{E}}\left[W_{i}({\mathbb{T}}^{\rm sf}_{n})\right]$ for all $\alpha>-1$ , which is currently missing in the literature. Thus we leave it as an open problem that an estimation similar to (157) holds for all $\alpha>-1$ . This would imply that the convergence in (158) holds for all scale-free trees.

Remark 4.

In all previous examples of Section 4.3, the limit distributions found here are all degenerate. However, we conjecture that another normalization should yield to non-degenerate limits. This is known to be the case, when $k=1$ , for complete binary trees (Janson [24]), recursive trees (Drmota et al. [16], Iksanov and Möhle [23]), binary search trees (Holmgren [19]) and split trees (Holmgren [20]). In the general case $k\geq 1$ , Cai and Holmgren [11] obtained also a weak limit theorem in the case of complete binary trees suggesting that our conjecture must be true.

4.3.5 Mixture of regular trees

Our next example provides a method to build trees that fulfill the conditions of Theorem 4 where the random variables $\zeta_{1},\zeta_{2},\dots$ in the hypotheses are not constants. Basically, the procedure consists of gluing trees which satisfy the assumptions of Theorem 4. In this example, we consider a mixture of complete regular trees but one may consider other families of trees as well. For a fixed integer $m\geq 1$ , let $(d_{i})_{i=1}^{m}$ denote a positive sequence of integers. Next, for $i=1,\dots,m$ , let $h_{i}(n):\mathbb{R}_{+}\rightarrow\mathbb{R}_{+}$ be a function with $\lim_{n\rightarrow\infty}h_{i}(n)=\infty$ . Let $T_{n_{i}}^{(d_{i})}$ be a complete $d_{i}$ -regular tree with height $\lfloor h_{i}(n)\rfloor$ . Since there are $d_{i}^{j}$ vertices at distance $j=0,1,\dots,\lfloor h_{i}(n)\rfloor$ from the root, its size is given by

[TABLE]

In particular, one can check that each tree $T_{n_{i}}^{(d_{i})}$ fulfills the assumptions in Theorem 4 with $a_{n}=\ln n_{i}$ and $\zeta_{1}=(\ln d_{i})^{-1}$ ; note that condition (c) in Theorem 4 follows from Lemma 12 and the fact that the number of descendants of each vertex is bounded. Now imagine that we merge all the $m$ regular trees into one common root. This leads us to a new tree $T_{n}^{(d)}$ of size $n=\sum_{i=1}^{m}n_{i}+1-m$ . Assume further that $n_{1}\sim n_{2}\sim\cdots\sim n_{m}$ , as $n\rightarrow\infty$ . Then, we observe that the probability that a vertex of $T_{n}^{(d)}$ chosen uniformly at random belongs to the tree $T_{n_{i}}^{(d_{i})}$ converges when $n\rightarrow\infty$ to $1/m$ . Then, one readily checks that this new tree satisfies the hypotheses in Theorem 4 with $a_{n}=\ln n$ and $\zeta_{1},\zeta_{2},\dots$ are i.i.d. random variables uniformly distributed in the set $\{1/\ln d_{1},\dots,1/\ln d_{m}\}$ . To see this, note that the probability that a uniform chosen vertex of $T_{n}^{(d)}$ belongs to $T_{n_{i}}^{(d_{i})}$ converges to $1/m$ .

Acknowledgements.

This work is supported by the Knut and Alice Wallenberg Foundation, a grant from the Swedish Research Council and The Swedish Foundations’ starting grant from Ragnar Söderbergs Foundation.

Bibliography36

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] L. Addario-Berry, N. Broutin, and C. Holmgren, Cutting down trees with a Markov chainsaw , Ann. Appl. Probab. 24 (2014), no. 6, 2297–2339. MR 3262504
2[2] D. Aldous, The continuum random tree. II. An overview , Stochastic analysis (Durham, 1990), London Math. Soc. Lecture Note Ser., vol. 167, Cambridge Univ. Press, Cambridge, 1991, pp. 23–70. MR 1166406
3[3] D. Aldous, The continuum random tree. III , Ann. Probab. 21 (1993), no. 1, 248–289. MR 1207226
4[4] A.-L. Barabási and R. Albert, Emergence of Scaling in Random Networks , Science 286 (1999), no. 5439, 509–512 (en).
5[5] J. Bertoin, Almost giant clusters for percolation on large trees with logarithmic heights , J. Appl. Probab. 50 (2013), no. 3, 603–611 (EN).
6[6] J. Bertoin and G. Miermont, The cut-tree of large Galton-Watson trees and the Brownian CRT , Ann. Appl. Probab. 23 (2013), no. 4, 1469–1493. MR 3098439
7[7] G. Berzunza, X. Shi Cai, and C. Holmgren, The asymptotic non-normality of the giant cluster for percolation on random split trees , ar Xiv e-prints (2019), ar Xiv:1902.08109.
8[8] R. M. Blumenthal, Excursions of Markov processes , Probability and its Applications, Birkhäuser Boston, Inc., Boston, MA, 1992. MR 1138461

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

The kkk-cut model in deterministic and random trees

Abstract

1 Introduction and main result

Theorem 1**.**

2 Preliminary results

Theorem 2**.**

Proof.

Lemma 1**.**

Remark 1**.**

Lemma 2**.**

Proof.

Lemma 3**.**

Proof.

Proof of Lemma 1.

Lemma 4**.**

Proof.

Lemma 5**.**

Proof.

3 Proof of Theorem 1

Lemma 6**.**

Proof.

Lemma 7**.**

Proof.

Lemma 8**.**

Proof.

Theorem 3**.**

Proof.

Proof of Theorem 1.

Lemma 9**.**

Proof.

4 Further applications

4.1 Paths

Lemma 10**.**

Proof.

Remark 2**.**

4.2 General trees

Theorem 4**.**

Proof.

Lemma 11**.**

Proof.

Lemma 12**.**

Proof.

Lemma 13**.**

Proof.

4.3 Trees of logarithmic height

4.3.1 Complete binary trees

Remark 3**.**

Lemma 14**.**

Proof.

4.3.2 Split trees

4.3.3 Uniform random recursive trees

4.3.4 Scale-free random trees

Remark 4**.**

4.3.5 Mixture of regular trees

Acknowledgements.

The $k$ -cut model in deterministic and random trees

Theorem 1.

Theorem 2.

Lemma 1.

Remark 1.

Lemma 2.

Lemma 3.

Lemma 4.

Lemma 5.

Lemma 6.

Lemma 7.

Lemma 8.

Theorem 3.

Lemma 9.

Lemma 10.

Remark 2.

Theorem 4.

Lemma 11.

Lemma 12.

Lemma 13.

Remark 3.

Lemma 14.

Remark 4.