Scaling limit of random forests with prescribed degree sequences

Tao Lei

arXiv:1704.02064·math.PR·April 10, 2017

Scaling limit of random forests with prescribed degree sequences

Tao Lei

PDF

TL;DR

This paper investigates the scaling limits of random plane forests with fixed degree sequences, demonstrating convergence to a continuum random tree in the Gromov-Hausdorff-Prokhorov topology, extending Aldous's framework.

Contribution

It establishes the Gromov-Hausdorff-Prokhorov convergence of large random forests with prescribed degrees to a continuum limit, using excursions of first passage bridges.

Findings

01

Convergence of random forests to Brownian Continuum Random Tree

02

Identification of the limit as a sequence of real trees encoded by excursions

03

Utilization of Lukasiewicz walks to study scaling limits

Abstract

In this paper, we consider the random plane forest uniformly drawn from all possible plane forests with a given degree sequence. Under suitable conditions on the degree sequences, we consider the limit of a sequence of such forests with the number of vertices tends to infinity in terms of Gromov-Hausdorff-Prokhorov topology. This work falls into the general framework of showing convergence of random combinatorial structures to certain Gromov-Hausdorff scaling limits, described in terms of the Brownian Continuum Random Tree, pioneered by the work of Aldous. In fact we identify the limiting random object as a sequence of random real trees encoded by excursions of some first passage bridges reflected at minimum. We establish such convergence by studying the associated Lukasiewicz walk of the degree sequences. In particular, our work is closely related to and uses the results from the…

Figures2

Click any figure to enlarge with its caption.

Equations274

U = n = 0 ⋃ \infty N^{n},

U = n = 0 ⋃ \infty N^{n},

s^{(i)} (T) = ∣ {u \in T : k_{T} (u) = i} ∣.

s^{(i)} (T) = ∣ {u \in T : k_{T} (u) = i} ∣.

s^{(i)} (F) = j = 1 \sum m s^{(i)} (T_{j}) .

s^{(i)} (F) = j = 1 \sum m s^{(i)} (T_{j}) .

i \geq 0 \sum i s^{(i)} (F) = j = 1 \sum m u \in T_{j} \sum k_{T_{j}} (u) = j = 1 \sum m (∣ T_{j} ∣ - 1)

i \geq 0 \sum i s^{(i)} (F) = j = 1 \sum m u \in T_{j} \sum k_{T_{j}} (u) = j = 1 \sum m (∣ T_{j} ∣ - 1)

n (s) := i \geq 0 \sum s^{(i)}, Δ (s) := max {i : s^{(i)} > 0} .

n (s) := i \geq 0 \sum s^{(i)}, Δ (s) := max {i : s^{(i)} > 0} .

(F_{λ}^{b r} (t), 0 \leq t \leq 1) = d (B (t), 0 \leq t \leq 1 ∣ T_{λ} = 1)

(F_{λ}^{b r} (t), 0 \leq t \leq 1) = d (B (t), 0 \leq t \leq 1 ∣ T_{λ} = 1)

E [f ((B_{l}^{b r} (t))_{0 \leq t \leq m})] = E [f ((B (t))_{0 \leq t \leq m}) \frac{p _{1 - m} ( - l - B ( m ))}{p _{1} ( - l )}]

E [f ((B_{l}^{b r} (t))_{0 \leq t \leq m})] = E [f ((B (t))_{0 \leq t \leq m}) \frac{p _{1 - m} ( - l - B ( m ))}{p _{1} ( - l )}]

E [f ((F_{λ}^{b r} (t))_{0 \leq t \leq s})] = E [(f (B (t))_{0 \leq t \leq s}) \frac{p _{1 - s}^{'} ( - λ - B ( s ))}{p _{1}^{'} ( - λ )} \mathbbm 1_{{r \leq s i n f B (r) > - λ}}]

E [f ((F_{λ}^{b r} (t))_{0 \leq t \leq s})] = E [(f (B (t))_{0 \leq t \leq s}) \frac{p _{1 - s}^{'} ( - λ - B ( s ))}{p _{1}^{'} ( - λ )} \mathbbm 1_{{r \leq s i n f B (r) > - λ}}]

d_{G H} ((X, d), (X^{'}, d^{'})) = ϕ, ϕ^{'}, Z in f d_{H}^{Z} (ϕ (X), ϕ^{'} (X^{'})),

d_{G H} ((X, d), (X^{'}, d^{'})) = ϕ, ϕ^{'}, Z in f d_{H}^{Z} (ϕ (X), ϕ^{'} (X^{'})),

d_{H}^{Z} (A, B) = in f {ϵ > 0 : A \subset B^{ϵ}, B \subset A^{ϵ}},

d_{H}^{Z} (A, B) = in f {ϵ > 0 : A \subset B^{ϵ}, B \subset A^{ϵ}},

A^{ϵ} = {z \in Z : y \in A in f d^{Z} (y, z) < ϵ} .

A^{ϵ} = {z \in Z : y \in A in f d^{Z} (y, z) < ϵ} .

d_{G H P} (X, X^{'}) = Φ, Φ^{'}, Z in f (d^{Z} (Φ (\emptyset), Φ^{'} (\emptyset^{'})) + d_{H}^{Z} (Φ (X), Φ^{'} (X^{'})) + d_{P}^{Z} (Φ_{*} μ, Φ_{*}^{'} μ^{'}))

d_{G H P} (X, X^{'}) = Φ, Φ^{'}, Z in f (d^{Z} (Φ (\emptyset), Φ^{'} (\emptyset^{'})) + d_{H}^{Z} (Φ (X), Φ^{'} (X^{'})) + d_{P}^{Z} (Φ_{*} μ, Φ_{*}^{'} μ^{'}))

d_{P}^{Z} (μ, ν) = in f {ϵ > 0 : μ (A) \leq ν (A^{ϵ}) + ϵ, ν (A) \leq μ (A^{ϵ}) + ϵ \mbox f or an y c l ose d se t A} .

d_{P}^{Z} (μ, ν) = in f {ϵ > 0 : μ (A) \leq ν (A^{ϵ}) + ϵ, ν (A) \leq μ (A^{ϵ}) + ϵ \mbox f or an y c l ose d se t A} .

d_{G H P}^{\infty} (X, X^{'}) = j \geq 1 sup d_{G H P} (X_{j}, X_{j}^{'}) .

d_{G H P}^{\infty} (X, X^{'}) = j \geq 1 sup d_{G H P} (X_{j}, X_{j}^{'}) .

L_{\infty} = {X \in K^{N} : j \to \infty lim sup d_{G H P} (X_{j}, Z) = 0} .

L_{\infty} = {X \in K^{N} : j \to \infty lim sup d_{G H P} (X_{j}, Z) = 0} .

d_{g}^{\circ} (s, t) = g (s) + g (t) - 2 m_{g} (s, t)

d_{g}^{\circ} (s, t) = g (s) + g (t) - 2 m_{g} (s, t)

m_{g} (s, t) = s \land t \leq r \leq s \lor t min g (r) .

m_{g} (s, t) = s \land t \leq r \leq s \lor t min g (r) .

F_{κ}^{↓} \mbox d (T_{γ_{l}}, l \geq 1) \mbox a s κ \to \infty,

F_{κ}^{↓} \mbox d (T_{γ_{l}}, l \geq 1) \mbox a s κ \to \infty,

((X_{κ, l})_{l \leq j}, (T_{κ, l})_{l \leq j}) \to d ((∣ γ_{l} ∣)_{l \leq j}, (T_{∣ γ_{l} ∣ e_{l}})_{l \leq j})

((X_{κ, l})_{l \leq j}, (T_{κ, l})_{l \leq j}) \to d ((∣ γ_{l} ∣)_{l \leq j}, (T_{∣ γ_{l} ∣ e_{l}})_{l \leq j})

(∣ T_{κ, l} ∣/ n_{κ})_{l \geq 1} \to d (∣ γ_{l} ∣)_{l \geq 1}

(∣ T_{κ, l} ∣/ n_{κ})_{l \geq 1} \to d (∣ γ_{l} ∣)_{l \geq 1}

S_{F} (i) = j = 1 \sum i (k_{F} (u_{j}) - 1) \mbox f or i = 1, 2, \dots, ∣ F ∣.

S_{F} (i) = j = 1 \sum i (k_{F} (u_{j}) - 1) \mbox f or i = 1, 2, \dots, ∣ F ∣.

(\frac{S _{F_{κ}} ( t n _{κ} )}{σ ( p _{κ} ) n _{κ} ^{1/2}})_{t \in [0, 1]} \to d F_{λ}^{b r}

(\frac{S _{F_{κ}} ( t n _{κ} )}{σ ( p _{κ} ) n _{κ} ^{1/2}})_{t \in [0, 1]} \to d F_{λ}^{b r}

j \to \infty lim κ \to \infty lim sup P (l > j sup diam (T_{κ, l}) > a) = 0.

j \to \infty lim κ \to \infty lim sup P (l > j sup diam (T_{κ, l}) > a) = 0.

P (h (T (s)) \geq m) \leq 7 exp (- m^{2} /608 σ^{2} (s) 1_{s}^{2})

P (h (T (s)) \geq m) \leq 7 exp (- m^{2} /608 σ^{2} (s) 1_{s}^{2})

P (S_{k} \geq λ \frac{k}{n} S_{n}) \leq exp (- \frac{3 σ ^{2} ( c )}{16 n} \cdot \frac{λk}{Δ ^{2}}) .

P (S_{k} \geq λ \frac{k}{n} S_{n}) \leq exp (- \frac{3 σ ^{2} ( c )}{16 n} \cdot \frac{λk}{Δ ^{2}}) .

j \to \infty lim κ \to \infty lim sup P (l > j sup (diam (T_{κ, l}) + mass (T_{κ, l}) + diam (T_{γ_{l}}) + mass (T_{γ_{l}})) > a) = 0.

j \to \infty lim κ \to \infty lim sup P (l > j sup (diam (T_{κ, l}) + mass (T_{κ, l}) + diam (T_{γ_{l}}) + mass (T_{γ_{l}})) > a) = 0.

j \to \infty lim κ \to \infty lim sup P (l > j sup diam (T_{κ, l}) > a) = 0, j \to \infty lim κ \to \infty lim sup P (l > j sup mass (T_{κ, l}) > a) = 0

j \to \infty lim κ \to \infty lim sup P (l > j sup diam (T_{κ, l}) > a) = 0, j \to \infty lim κ \to \infty lim sup P (l > j sup mass (T_{κ, l}) > a) = 0

j \to \infty lim P (l > j sup diam (T_{γ_{l}}) > a) = 0, j \to \infty lim P (l > j sup mass (T_{γ_{l}}) > a) = 0.

j \to \infty lim P (l > j sup diam (T_{γ_{l}}) > a) = 0, j \to \infty lim P (l > j sup mass (T_{γ_{l}}) > a) = 0.

P (l > j sup mass (T_{κ, l}) > a)

P (l > j sup mass (T_{κ, l}) > a)

R (s) = F_{λ} (s) - s^{'} \in (0, s) in f F_{λ} (s^{'})

R (s) = F_{λ} (s) - s^{'} \in (0, s) in f F_{λ} (s^{'})

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Scaling limit of random forests with prescribed degree sequences

Tao Lei

Department of Mathematics and Statistics, McGill University, 805 Sherbrooke Street West, Montréal, Québec, H3A 0B9, Canada

[email protected]

(Date: March 31, 2017)

Abstract.

In this paper, we consider the random plane forest uniformly drawn from all possible plane forests with a given degree sequence. Under suitable conditions on the degree sequences, we consider the limit of a sequence of such forests with the number of vertices tends to infinity in terms of Gromov-Hausdorff-Prokhorov topology. This work falls into the general framework of showing convergence of random combinatorial structures to certain Gromov-Hausdorff scaling limits, described in terms of the Brownian Continuum Random Tree (BCRT), pioneered by the work of Aldous [6, 7, 8]. In fact we identify the limiting random object as a sequence of random real trees encoded by excursions of some first passage bridges reflected at minimum. We establish such convergence by studying the associated Lukasiewicz walk of the degree sequences. In particular, our work is closely related to and uses the results from the recent work of Broutin and Marckert [16] on scaling limit of random trees with prescribed degree sequences, and the work of Addario-Berry [3] on tail bounds of the height of a random tree with prescribed degree sequence.

2010 Mathematics Subject Classification:

60C05

1. Introduction

Scaling limits for finite graphs is a topic at the intersection of combinatorics and probability. In this paper, we investigate the Gromov-Hausdorff-Prokhorov convergence of random forests with prescribed degree sequence. Our work is a natural continuation of [16] where it is shown that under natural hypotheses on the degree sequences, after suitable normalization, uniformly random trees with given degree sequence converge to Brownian continuum random tree, with the size of trees going to infinity.

In a series of papers [6, 7, 8], Aldous introduced the concept of Brownian continuum random tree (BCRT) and showed that critical Galton-Watson tree conditioned on its size has BCRT as limiting objects. Since then, many families of graphs have been shown to have BCRT or random processes derived from BCRT as their limiting objects. For example, multi-type Galton-Watson trees [26], unordered binary trees [24], critical Erdös-Rényi random graph [4], random planar maps with a unique large face [22], random planar quadrangulations with a boundary [13].

As in [16], our combinatorial model is motivated by the metric structure of graphs with a prescribed degree sequence. This model was first introduced by Bender and Canfield [11] and by Bollobás [15] in the form of the configuration model. This model can give rise to graphs with any particular (legitimate) prescribed degree sequence (including, e.g., heavy tailed degree distributions, a feature which is observed in realistic networks but is not captured by the Erdös-Rényi random graph model).

Our main results, which are stated formally in Section 1.2, are that, under natural assumptions on degree sequences and after suitable normalization, large uniformly random forests with given degree sequence converge in distribution to the forests coded by Brownian first passage bridge, with respect to the Gromov-Hausdorff-Prokhorov topology. In order to present these results rigorously, we need the following subsection to introduce the necessary concepts and notations involved.

1.1. Definitions and Notation

Plane trees and forests

We recall the following definition of plane trees (as in e.g. [19]). Let

[TABLE]

where $\mathbb{N}=\{1,2,\cdots\}$ and $\mathbb{N}^{0}=\{\emptyset\}$ . If $u=(u_{1},u_{2},\cdots,u_{n})\in\mathcal{U}$ we write $u=u_{1}u_{2}\cdots u_{n}$ for short and let $|u|=n$ be the generation of $u$ . If $u=u_{1}\cdots u_{m},v=v_{1}\cdots v_{n}$ , we write $uv=u_{1}\cdots u_{m}v_{1}\cdots v_{n}$ for the concatenation of $u$ and $v$ .

Definition 1.1.

A rooted plane tree $\mathrm{T}$ is a subset of $\mathcal{U}$ satisfying the following conditions:

(i) $\emptyset\in\mathrm{T}$ ;

(ii) If $v\in\mathrm{T}$ and $v=uj$ for some $u\in\mathcal{U}$ and $j\in\mathbb{N}$ , then $u\in\mathrm{T}$ ;

(iii) For every $u\in\mathrm{T}$ , there exists a number $k_{\mathrm{T}}(u)\geq 0$ such that $uj\in\mathrm{T}$ if and only if $1\leq j\leq k_{\mathrm{T}}(u)$ . We call $k_{\mathrm{T}}(u)$ the degree of $u$ in $\mathrm{T}$ .

We denote the lexicographic order on $\mathcal{U}$ by $<$ (e.g. $\emptyset<11<21<22$ ). The lexicographic order on $\mathcal{U}$ induces a total order on the set of all rooted plane trees.

We call a finite sequence of finite rooted plane trees $\mathrm{F}=(\mathrm{T}_{1},\mathrm{T}_{2},\cdots,\mathrm{T}_{m})$ a rooted plane forest. For forest $\mathrm{F}$ , we let $\mathrm{F}^{\downarrow}$ be the sequence of tree components of $\mathrm{F}$ in decreasing order of size, breaking ties lexicographically (if again tied, then as the original order of appearance in $\mathrm{F}$ ).

Definition 1.2.

A degree sequence is a sequence $\mathbf{s}=(s^{(i)},i\geq 0)$ of non-negative integers with $\sum\limits_{i\geq 0}s^{(i)}<\infty$ such that $c(\mathbf{s}):=\sum\limits_{i\geq 0}(1-i)s^{(i)}>0$ . For a plane tree $\mathrm{T}$ , the degree sequence $\mathbf{s}(\mathrm{T})=(s^{(i)}(\mathrm{T}),i\geq 0)$ is given by

[TABLE]

For a plane forest $\mathrm{F}=\left(\mathrm{T}_{1},\cdots,\mathrm{T}_{m}\right)$ , the degree sequence $\mathbf{s}(\mathrm{F})=(s^{(i)}(\mathrm{F}),i\geq 0)$ is given by

[TABLE]

Note that $c(\mathbf{s}(\mathrm{T}))=1$ for any plane tree $\mathrm{T}$ . In general since

[TABLE]

and $\sum\limits_{i\geq 0}s^{(i)}(\mathrm{F})=\sum\limits_{j=1}^{m}|\mathrm{T}_{j}|$ , the number of tree components in $\mathrm{F}$ is always $c(\mathbf{s}(\mathrm{F}))$ . For any degree sequence $\mathbf{s}$ , we adopt the notations

[TABLE]

Figure 1, below, shows a plane forest with degree sequence $\mathbf{s}=(7,2,2,1,0,\cdots)$ with $s^{(i)}=0$ for $i\geq 4$ .

For any degree sequence $\mathbf{s}=(s^{(i)},i\geq 0)$ , we let $\mathrm{F}(\textbf{s})$ denote the set of all plane forests with degree sequence s. Let $\mathbb{P}_{\mathbf{s}}$ be the uniform measure on $\mathrm{F}(\textbf{s})$ and let $\mathbb{F}(\mathbf{s})$ be a random plane forest with law $\mathbb{P}_{\mathbf{s}}$ .

First passage bridge

We also need to recall the following definition of first passage bridge as in [10]. Informally, for $\lambda>0$ , the first passage bridge of unit length from 0 to $-\lambda$ , denoted $F^{br}_{\lambda}$ , is a $C[0,1]-$ valued random variable with law

[TABLE]

where $B$ is a standard Brownian motion and $T_{\lambda}:=\inf\{t:B(t)<-\lambda\}$ is the first passage time below level $-\lambda<0$ .

For $l\geq 0$ , we write $B^{br}_{l}$ for the Brownian bridge of duration 1 from 0 to $-l$ . As explained in Proposition 1 of [21], the law of the Brownian bridge $B^{br}_{l}$ is characterized by $B^{br}_{l}(1)=-l$ and the formula

[TABLE]

for all bounded measurable function $f$ , and all $0\leq m<1$ , where $p_{a}$ is the Gaussian density with variance $a$ and mean 0, that is, $p_{a}(x)=\frac{1}{\sqrt{2\pi a}}e^{-\frac{x^{2}}{2a}}$ . In a similar way the law of $F^{br}_{\lambda}$ can be defined as the law such that

[TABLE]

for all bounded measurable functions $f$ and all $0\leq s<1$ and $F^{br}_{\lambda}(1)=-\lambda$ , where $p^{\prime}_{a}$ is the derivative of $p_{a}$ . These formulae set the finite-dimensional laws of the first passage bridge. In [12] (see Section 5.1 for details) it is shown that it admits a continuous version, and that $F^{br}_{\lambda}$ is the weak limit of $F^{\epsilon}_{\lambda}$ where $(F^{\epsilon}_{\lambda}(t),0\leq t\leq 1)$ has the law of $B$ conditioned on the event $\{B(1)<-\lambda+\epsilon,~{}\inf\limits_{s\leq 1}B(s)>-\lambda-\epsilon\}$ , hence justifying the informal conditioning definition.

Gromov-Hausdorff-Prokhorov distance

We recall the definition of the Gromov-Hausdorff distance (see for example Definition 7.3.10 in [17]). Let $(X,d)$ and $(X^{\prime},d^{\prime})$ be compact metric spaces. Then the Gromov-Hausdorff distance between $(X,d)$ and $(X^{\prime},d^{\prime})$ is given by

[TABLE]

where the infimum is taken over all isometric embeddings $\phi:X\hookrightarrow Z$ and $\phi^{\prime}:X^{\prime}\hookrightarrow Z$ into some common Polish metric space $(Z,d^{Z})$ and $d_{H}^{Z}$ denotes the Hausdorff distance between compact subsets of $Z$ , that is,

[TABLE]

where $A^{\epsilon}$ is the $\epsilon-$ enlargement of $A$ :

[TABLE]

Note that strictly speaking $d_{GH}$ is not a distance since different compact metric spaces can have GH distance zero.

A rooted measured metric space $\mathcal{X}=(X,d,\emptyset,\mu)$ is a metric space $(X,d)$ with a distinguished element $\emptyset\in X$ and a finite Borel measure $\mu$ . Note that the definitions in this subsection work in more general settings, e.g. $\mu$ could be a boundedly finite Borel measure (see [2]), but for the purpose of this paper, finite measure $\mu$ is enough.

Let $\mathcal{X}=(X,d,\emptyset,\mu)$ and ${\mathcal{X}}^{\prime}=(X^{\prime},d^{\prime},\emptyset^{\prime},\mu^{\prime})$ be two compact rooted measured metric spaces, they are GHP-isometric if there exists an isometric one-to-one map $\Phi:X\rightarrow X^{\prime}$ such that $\Phi(\emptyset)=\emptyset^{\prime}$ and $\Phi_{\ast}\mu=\mu^{\prime}$ where $\Phi_{\ast}\mu$ is the push forward of measure $\mu$ to $(X^{\prime},d^{\prime})$ , that is, $\Phi_{\ast}\mu(A)=\mu(\Phi^{-1}(A))$ for $A\in\mathcal{B}(X^{\prime})$ . In this case, call $\Phi$ a GHP-isometry.

Suppose both $\mathcal{X}$ and $\mathcal{X^{\prime}}$ are compact, then define the Gromov-Hausdorff-Prokhorov distance as:

[TABLE]

where the infimum is taken over all isometric embeddings $\Phi:X\hookrightarrow Z$ and $\Phi^{\prime}:X^{\prime}\hookrightarrow Z$ into some common Polish metric space $(Z,d^{Z})$ , and $d_{P}^{Z}$ denotes the Prokhorov distance between finite Borel measures on $Z$ , that is,

[TABLE]

Let $\mathbb{K}$ denote the set of GHP-isometry classes of compact rooted measured metric spaces and we identify $\mathcal{X}$ with its GHP-isometry class. We have the following results from [2]:

Theorem 1.3 (Theorem 2.5 in [2]).

The function $d_{GHP}$ defines a metric on $\mathbb{K}$ and the space $(\mathbb{K},d_{GHP})$ is a Polish metric space.

We next define a distance between sequences of rooted measured metric spaces. For $\mathbf{X}=(\mathcal{X}_{j},j\geq 1),\mathbf{X}^{\prime}=(\mathcal{X}^{\prime}_{j},j\geq 1)$ in $\mathbb{K}^{\mathbb{N}}$ , we let

[TABLE]

If $\mathbf{X}\in\mathbb{K}^{n}$ for some $n\in\mathbb{N}$ , in order to view $\mathbf{X}$ as a member of $\mathbb{K}^{\mathbb{N}}$ , we append to $\mathbf{X}$ an infinite sequence of zero metric spaces $\mathcal{Z}$ . Here $\mathcal{Z}$ is the rooted measured metric space consisting of a single point with measure 0. Let $\mathbf{Z}=(\mathcal{Z},\mathcal{Z},\cdots)$ and

[TABLE]

By definition of GHP distance it is not hard to see that $d_{GHP}(\mathcal{X},\mathcal{Z})=\frac{\mathrm{diam}(X)}{2}+\mu(X)$ , hence $\mathbf{X}\in\mathbb{L}_{\infty}$ if and only if $\limsup\limits_{j\rightarrow\infty}\left(\mathrm{diam}(X_{j})+\mu_{j}(X_{j})\right)=0$ . It is likewise straightforward to show that $(\mathbb{L}_{\infty},d_{GHP}^{\infty})$ is a complete separable metric space.

Real trees

Next we briefly recall the concepts of real trees and real trees coded by continuous functions. A more lengthy presentation about the probabilistic aspects of real trees can be found in [20, 23].

Definition 1.4.

A compact metric space $(T,d)$ is a real tree if the following hold for every $a,b\in T$ :

(i) There is a unique isometric map $f_{a,b}$ from $[0,d(a,b)]$ into $T$ such that $f_{a,b}(0)=a$ and $f_{a,b}(d(a,b))=b$ .

(ii) If $q$ is a continuous injective map from $[0,1]$ into $T$ , such that $q(0)=a$ and $q(1)=b$ , we have $q([0,1])=f_{a,b}([0,d(a,b)])$ .

A real tree $(T,d)$ is rooted if there is a distinguished vertex (the root) $\emptyset\in T$ and we denote a rooted real tree by $(T,d,\emptyset)$ . If there is a finite Borel measure $\mu$ on $T$ , then $(T,d,\emptyset,\mu)$ is a measured rooted real tree.

Next we show a way of constructing real trees from continuous functions. Let $g:[0,\infty)\rightarrow[0,\infty)$ be a continuous function with compact support and such that $g(0)=0$ . For every $s,t\geq 0$ , let

[TABLE]

where

[TABLE]

The function $d^{\circ}_{g}$ is a pseudometric on $[0,\infty)$ . Define an equivalence relation $\sim$ on $[0,\infty)$ by setting $s\sim t$ iff $d^{\circ}_{g}(s,t)=0$ . Then let $T_{g}=[0,\infty)/\sim$ and let $d_{g}$ be the induced distance on $T_{g}$ . Then $(T_{g},d_{g})$ is a real tree (see, e.g. Theorem 2.2 in [23]).

To get an intuition of this construction, for a rooted plane tree $\mathrm{T}$ with graph distance $d_{gr}$ , let $\hat{\mathrm{T}}$ be the metric space obtained from $\mathrm{T}$ by viewing each edge as an isometric copy of the unit interval $[0,1]$ , and imagine a particle exploring the tree, starting from the root and moving at unit speed. Each time the particle leaves a vertex $u$ , it moves to the lexicographically next unvisited child of $u$ , if such a child exists; otherwise it moves to the parent of $u$ . The exploration concludes the moment the particle has visited all vertices and returned to the root. Let $C:[0,2(|\mathrm{T}|-1)]\rightarrow[0,\infty)$ be such that $C(t)$ equals to the graph distance between the particle and the root at time $t$ . $C$ is called the contour function of $\mathrm{T}$ . Then the metric space $\mathcal{T}_{C}$ constructed from $C$ is isometric to $\hat{\mathrm{T}}$ .

Let $\emptyset_{g}$ denote the equivalence class of 0. Let $p_{g}$ be the canonical projection from $[0,\infty)$ to $T_{g}$ and $\sigma_{g}=sup\{t:g(t)>0\}$ . Let $\textbf{m}_{g}$ be the push forward of the Lebesgue measure on $[0,\sigma_{g}]$ ( $(\sigma_{g},\infty)$ has measure 0) by $p_{g}$ . Then $\mathcal{T}_{g}=(T_{g},d_{g},\emptyset_{g},\textbf{m}_{g})$ is a compact measured rooted real tree. In particular, $\mathcal{T}_{g}\in\mathbb{K}$ . Let $\mathbf{e}$ denote the standard Brownian excursion, then $\mathcal{T}_{\mathbf{e}}$ is called the Brownian continuum random tree (BCRT for short).

1.2. Statement of main theorems

For $c>0$ , let $c\mathbf{e}\in C[0,\infty)$ denote the Brownian excursion of length $c$ , that is $(c\mathbf{e})(s):=\sqrt{c}\mathbf{e}(\frac{s}{c}\wedge 1)$ for $s\geq 0$ . For any probability distribution $\textbf{p}=(p^{(i)},i\geq 0)$ on $\mathbb{N}$ , let $\mu(\textbf{p})=\sum\limits_{i\geq 0}ip^{(i)}$ and $\sigma^{2}(\textbf{p})=\sum\limits_{i\geq 0}i^{2}p^{(i)}$ .

In this paper we consider a sequence of degree sequences $(\mathbf{s}_{\kappa},\kappa\in\mathbb{N})$ , where $\mathbf{s}_{\kappa}=(s_{\kappa}^{(i)},i\geq 0)$ . We assume $\mathbf{n}_{\kappa}:=\sum\limits_{i\geq 0}s_{\kappa}^{(i)}\rightarrow\infty$ and let $\mathbb{F}_{\kappa}:=\mathbb{F}(\mathbf{s}_{\kappa})$ and write $\mathbb{F}_{\kappa}^{\downarrow}=(\mathbb{T}_{\kappa,l},~{}l\geq 1)$ . We write $\textbf{p}_{\kappa}=(p_{\kappa}^{(i)},i\geq 0):=(\frac{s^{(i)}_{\kappa}}{\mathbf{n}_{\kappa}},i\geq 0)$ . For $\mathbb{F}_{\kappa}^{\downarrow}=(\mathbb{T}_{\kappa,l},~{}l\geq 1)$ , let $\mathcal{T}_{\kappa,l}$ denote the measured rooted real tree $(\mathbb{T}_{\kappa,l},\frac{\sigma_{\kappa}}{2\mathbf{n}_{\kappa}^{1/2}}d_{gr},\emptyset_{\kappa,l},\mu_{\kappa,l})$ where $\sigma_{\kappa}=\sigma(\textbf{p}_{\kappa})$ and $\mu_{\kappa,l}$ denotes the uniform measure putting mass $\frac{1}{\mathbf{n}_{\kappa}}$ on each vertex of $\mathbb{T}_{\kappa,l}$ . Let $\mathcal{F}_{\kappa}^{\downarrow}=(\mathcal{T}_{\kappa,l},l\geq 1)$ . Let $\Delta_{\kappa}:=\max\{i:s^{(i)}_{\kappa}>0\}$ . We are now prepared to state our main theorems.

Theorem 1.5.

Suppose that there exists a distribution $\textbf{p}=(p^{(i)},i\geq 0)$ on $\mathbb{N}$ with $p^{(1)}<1$ such that $\textbf{p}_{\kappa}$ converges to p coordinatewise. Suppose also that $\sigma(\textbf{p}_{\kappa})\rightarrow\sigma(\textbf{p})\in(0,\infty)$ . If $\frac{c(\mathbf{s}_{\kappa})}{\sigma(\mathbf{p}_{\kappa})\mathbf{n}_{\kappa}^{1/2}}\rightarrow\lambda\in(0,\infty)$ , then

[TABLE]

with respect to the product topology for $d_{GHP}$ where $(\gamma_{l},l\geq 1)$ are the excursions of the process $(F_{\lambda}(s)-\inf\limits_{s^{\prime}\in(0,s)}F_{\lambda}(s^{\prime}))_{0\leq s\leq 1}$ , listed in decreasing order of length.

Theorem 1.6.

Under the conditions of Theorem 1.5, suppose additionally that there exists $\epsilon>0$ such that $\Delta_{\kappa}=O(\mathbf{n}_{\kappa}^{\frac{1-\epsilon}{2}})$ . Then the convergence (1.3) holds in $(\mathbb{L}_{\infty},d_{GHP}^{\infty})$ .

Remark 1.1.

The assumptions of Theorem 1.5 imply that $\mu(\textbf{p}_{\kappa})\rightarrow\mu(\textbf{p})=1$ and that $\Delta_{\kappa}=o({\mathbf{n}_{\kappa}}^{1/2})$ . We include the proof of these facts as Lemma A.1 in the Appendix.

Remark 1.2.

The pair $((\gamma_{l},l\geq 1),(\mathcal{T}_{\gamma_{l}},l\geq 1))$ has the same law as $((\gamma_{l},l\geq 1),(\mathcal{T}_{|\gamma_{l}|\mathbf{e}_{l}},l\geq 1))$ where $(\mathbf{e}_{l},l\geq 1)$ are standard Brownian excursions, independent of each other and of $(\gamma_{l},l\geq 1)$ .

1.3. Key ingredients of the paper

Here we summarize the two key ingredients of this paper. The first element is the convergence of the large trees in (1.3), which is essentially given by the following proposition. For all $l\geq 1$ , let $X_{\kappa,l}=\frac{|\mathbb{T}_{\kappa,l}|}{\mathbf{n}_{\kappa}}$ .

Proposition 1.7.

Under the conditions of Theorem 1.5, for any fixed $j\geq 1$ ,

[TABLE]

as $\kappa\rightarrow\infty$ , where $(\mathbf{e}_{l})_{l\leq j}$ are independent copies of $\mathbf{e}$ , and $(\gamma_{l},l\geq 1)$ are the excursions of $(F_{\lambda}(s)-\inf\limits_{s^{\prime}\in(0,s)}F_{\lambda}(s^{\prime}))_{0\leq s\leq 1}$ ranked in decreasing order of length.

There are two parts of the convergence in (1.4). One is the convergence of the normalized sizes of large trees to lengths of excursions. This will be given by the following proposition. To state this result, we need to first introduce some notions. Let $\mathcal{C}_{0}(1)=\{x\in C([0,1],\mathbb{R}):x(0)=0\}$ For a non-negative function $g^{+}\in\mathcal{C}_{0}(1)$ , an excursion $\gamma$ of $g^{+}$ is the restriction of $g^{+}$ to a time interval $[l(\gamma),r(\gamma)]$ such that $g^{+}(l(\gamma))=g^{+}(r(\gamma))=0$ and $g^{+}(s)>0$ for $s\in(l(\gamma),r(\gamma))$ . In this case $[l(\gamma),r(\gamma)]$ is called an excursion interval of $g^{+}$ . The length of the excursion is denoted as $|\gamma|=r(\gamma)-l(\gamma)$ . For a function $g$ we write $g(s)-\min\limits_{0\leq s^{\prime}<s}g(s^{\prime})$ to denote $(g(s)-\min\limits_{0\leq s^{\prime}<s}g(s^{\prime}),~{}0\leq s\leq 1)$ . For $g\in\mathcal{C}_{0}(1)$ , sometimes we refer the excursions of $g(s)-\min\limits_{0\leq s^{\prime}<s}g(s^{\prime})$ as excursions of $g$ . Let $l^{\downarrow}_{1}=\{x=(x_{1},x_{2},\cdots):x_{1}\geq x_{2}\geq\cdots\geq 0,\sum\limits_{i}x_{i}\leq 1\}$ and endow $l^{\downarrow}_{1}$ with the topology induced by the $l_{1}$ distance: $d(x,y)=\sum\limits_{i}|x_{i}-y_{i}|$ .

Proposition 1.8.

Under the hypothesises of Theorem 1.5, we have

[TABLE]

in $l^{\downarrow}_{1}$ , where $(\gamma_{l},l\geq 1)$ are the excursions of $F_{\lambda}^{br}(s)-\min\limits_{0\leq s^{\prime}\leq s}F_{\lambda}^{br}(s^{\prime})$ ranked in decreasing order of length.

This proposition will be a corollary of the following theorem, which is the main result of Section 4. For a plane forest $\mathrm{F}$ , let $u_{1}<u_{2}<\cdots<u_{|\mathrm{F}|}$ be the nodes of $\mathrm{F}$ listed according to their lexicographical order in $\mathcal{U}$ in each tree component, with nodes of first tree listed first, then the nodes of second tree and so on. The depth-first walk, or Lukasiewicz path $S_{\mathrm{F}}$ is defined as follows. First set $S_{\mathrm{F}}(0)=0$ and then let

[TABLE]

We extend the definition of $S_{\mathrm{F}}$ to the compact interval $[0,|\mathrm{F}|]$ by linear interpolation.

Theorem 1.9.

Under the conditions of Theorem 1.5, we have

[TABLE]

in $\mathcal{C}_{0}(1)$ as $\kappa\rightarrow\infty$ .

The second part of the convergence of (1.4) is the convergence of the large trees, for which we will rely on the following result about random trees with given degree sequences from [16].

Theorem 1.10 (Theorem 1 in [16]).

Let $\{\mathbf{s}_{\kappa},\kappa\geq 1\}$ be a degree sequence such that $\mathbf{n}_{\kappa}:=n(\mathbf{s}_{\kappa})\rightarrow\infty,\Delta_{\kappa}:=\Delta(\mathbf{s}_{\kappa})=o(\mathbf{n}_{\kappa}^{1/2})$ . Suppose that there exists a distribution p on $\mathbb{N}$ with mean 1 such that $\textbf{p}_{\kappa}$ converges to p coordinatewise and such that $\sigma(\textbf{p}_{\kappa})\rightarrow\sigma(\textbf{p})\in(0,\infty)$ . Let $\mathbb{T}_{\kappa}$ be the random plane tree under $\mathbb{P}_{\mathbf{s}_{\kappa}}$ , the uniform measure on the set of plane trees with degree sequence $\mathbf{s}_{\kappa}$ . Let $\mathcal{T}_{\kappa}$ denote the measured rooted metric space $(\mathbb{T}_{\kappa},\frac{\sigma(\textbf{p}_{\kappa})}{2\mathbf{n}_{\kappa}^{1/2}}d_{gr},\emptyset_{\kappa},\mu_{\kappa})$ where $\mu_{\kappa}$ denotes the uniform measure putting mass $\frac{1}{\mathbf{n}_{\kappa}}$ on each vertex of $\mathbb{T}_{\kappa}$ . Then when $\kappa\rightarrow\infty,\mathcal{T}_{\kappa}\overset{d}{\rightarrow}\mathcal{T}_{\mathbf{e}}$ in the Gromov-Hausdorff-Prokhorov sense.

Remark 1.3.

In fact Theorem 1 in [16] is only stated in the Gromov-Hausdorff sense, that is, $(\mathbb{T}_{\kappa},\frac{\sigma(\textbf{p}_{\kappa})}{2\mathbf{n}_{\kappa}^{1/2}}d_{gr})\overset{d}{\rightarrow}(T_{\mathbf{e}},d_{\mathbf{e}},\emptyset_{\mathbf{e}})$ . But the conclusion can be strengthened to GHP convergence easily. For completeness, we include a proof of this fact in Appendix B.

The following proposition contains the additional ingredient required to prove Theorem 1.6.

Proposition 1.11.

Under the conditions of Theorem 1.6, for all $a>0$ , we have

[TABLE]

The key results leading to Proposition 1.11 include a height bound for random tree with prescribed degree sequence and a variance bound for uniformly permuted child sequences. The height bound of uniformly random tree with prescribed degree sequence is given in the following theorem.

Theorem 1.12 (Theorem 1 in [3]).

Fix a degree sequence $\mathbf{s}=(s^{(i)},i\geq 0)$ such that $\sum\limits_{i\geq 0}is^{(i)}=|\mathbf{s}|-1$ , and let $\mathbb{T}(\mathbf{s})$ be a uniformly random plane tree with degree sequence $\mathbf{s}$ . Then for all $m\geq 1$ we have

[TABLE]

where $1_{\mathbf{s}}=\frac{|\mathbf{s}|-2}{|\mathbf{s}|-1-s^{(1)}}$ .

The following probability bound on variances of uniformly permuted integer sequences allows us to control the variance of degrees of trees in random forests, and thereby apply Theorem 1.12 to prove Proposition 1.11.

Proposition 1.13.

Fix $c=(c_{1},\cdots,c_{n})\in\mathbb{N}^{n}$ and let $\pi$ be a uniformly random permutation of $\{1,\cdots,n\}$ . Set $C_{i}=c_{\pi(i)}$ for $1\leq i\leq n$ , and let $S_{j}=\sum\limits_{i\leq j}C^{2}_{i}$ for $1\leq i\leq n$ . Then for all $\lambda\geq 2$ and $1\leq k\leq n$ , with $\Delta=\max\limits_{1\leq i\leq n}C_{i}=\max\limits_{1\leq i\leq n}c_{i}$ , and $\sigma^{2}(c)=\sum\limits_{i\leq n}c^{2}_{i}=S_{n}$ , we have

[TABLE]

Now let us prove our main theorems with these key results.

Proof of Theorem 1.5 and Theorem 1.6.

By Skorokhod’s representation theorem, we may work in a probability space in which the convergence in Proposition 1.7 is almost sure. Hence Proposition 1.7 yields that for any fixed $j,~{}\sup\limits_{l\leq j}d_{GHP}(\mathcal{T}_{\kappa,l},\mathcal{T}_{|\gamma_{l}|\mathbf{e}_{l}})\overset{d}{\to}0$ . This establishes Theorem 1.5. Now to prove the convergence in $(\mathbb{L}_{\infty},d_{GHP}^{\infty})$ , it suffices to prove that for any $a>0$ ,

[TABLE]

It suffices to separately prove

[TABLE]

For this purpose, we need to control the probability that small trees having either large diameter or large mass. Note that for a tree its diameter is bounded by twice of its height.

In fact the mass of tree is easy to control since for any $a>0$ and any $\kappa$ ,

[TABLE]

For the diameter we resort to Proposition 1.11.

We also need to bound $\mathrm{diam}(\mathcal{T}_{\gamma_{l}})$ and $\mathrm{mass}(\mathcal{T}_{\gamma_{l}})$ for $l$ large. Note that $\mathrm{mass}(\mathcal{T}_{\gamma_{l}})=|\gamma_{l}|$ and for any $a$ , let $j>1/a$ , then ${\mathbf{P}}\left(\sup\limits_{l>j}|\gamma_{l}|>a\right)=0$ .

For $\mathrm{diam}(\mathcal{T}_{\gamma_{l}}),~{}\mathrm{diam}(\mathcal{T}_{\gamma_{l}})\leq 2h(\mathcal{T}_{\gamma_{l}})=2\max(\gamma_{l})$ . For $0\leq s\leq 1$ , let

[TABLE]

and the excursion interval of $\gamma_{l}$ be $[g_{l},d_{l}]$ . Then

[TABLE]

and $d_{l}-g_{l}=|\gamma_{l}|\leq 1/l$ . So for any $j\geq 1/\epsilon$ ,

[TABLE]

since $F_{\lambda}$ is uniformly continuous. Hence we have the tail insignificance for diameter of $\mathcal{T}_{\gamma_{l}}$ and the claim is proved. ∎

To conclude this section, we sketch how our paper is organized. In Section 2 we investigate a special rotation mapping, which connects the collection of lattice bridges corresponding to certain degree sequence $\mathbf{s}$ and the set of first passage lattice bridges corresponding to $\mathbf{s}$ . This will be the key starting point of our work using depth-first walk process to code the structure of random forests with given degree sequences. The combinatorial argument in this section will be also useful for our later work on transferring results such as Proposition 1.13 to something similar which is applicable to random forests. This section will be purely combinatorial and only deal with fixed degree sequences. In Section 3, we collect some concentration results using martingale methods. These probability bounds will be useful for checking that the assumptions in Theorem 1.10 are satisfied for large trees of $\mathcal{F}^{\downarrow}_{\kappa}$ . The second part of this section proves the variance bound in Proposition 1.13. Again all results in this section is non-asymptotic and hence are presented with regards to a fixed degree sequence. In Section 4, we prove Theorem 1.9, the convergence of scaled exploration processes to some random process related to first passage bridge, using the rotation mapping in Section 2. We will then get Proposition 1.8 as a corollary from this weak convergence result. Finally, in Section 5 we finish the proof of Proposition 1.7 and Proposition 1.11 using results from Section 3 and Section 4.

2. An $n-$ to $-1$ map transforming lattice bridge to first passage lattice bridge

Given a degree sequence $\textbf{s}=(s^{(i)},i\geq 0)$ , let $d(\textbf{s})\in\mathbb{Z}_{\geq 0}^{n(\textbf{s})}$ be the vector whose entries are weakly increasing and with $s^{(i)}$ entries equal to $i$ , for each $i\geq 0$ . For example, if $\textbf{s}=(3,2,0,1,0,\cdots)$ with $s^{(i)}=0$ for $i\geq 4$ , then $d(\textbf{s})=(0,0,0,1,1,3)$ . Let $\mathrm{D}(\mathbf{s})$ be the collection of all possible child sequences corresponding to degree sequence s, i.e., all possible result as a permutation of $d(\textbf{s})$ .

A lattice bridge is a function $b:[0,k]\rightarrow\mathbb{R}$ with $b(0)=0$ and $b(i)\in\mathbb{Z},\ \forall i\in[k]$ , which is piecewise linear between integers. Here $k$ is an arbitrary positive integer. We let

[TABLE]

and call $\Lambda(\textbf{s})$ the set of lattice bridges corresponding to s. Note that if $b\in\Lambda(\textbf{s})$ , then $b(n(\textbf{s}))=-c(\textbf{s})$ . Furthermore, we have

[TABLE]

since to determine $b\in\Lambda(\textbf{s})$ , it suffices to choose the $s^{(0)}$ positions with step size $-1$ , $s^{(1)}$ positions with step size 0, $s^{(2)}$ positions with step size 1, etc.

We then let

[TABLE]

and call $F(\textbf{s})$ the collection of first passage lattice bridges corresponding to s.

For $s>0$ , let $\mathcal{C}_{0}(s)=\{x\in C([0,s],\mathbb{R}):x(0)=0\}$ . For $u\in[0,s]$ , let $\theta_{u,s}:\mathcal{C}_{0}(s)\rightarrow\mathcal{C}_{0}(s)$ denote the cyclic shift at $u$ , that is,

[TABLE]

For $x\in\mathcal{C}_{0}(s)$ and $y\in\mathbb{R}^{-}$ , let $t(y,x):=\inf\{t\in[0,s]:x(t)\leq y\}$ be the first time the graph of $x$ drops below $y$ . Sometimes we drop the argument $x$ for convenience and simply write $t(y)$ . If $y<\min\limits_{u\in[0,s]}x(u)$ we set $t(y,x)=0$ by convention, so $\theta_{t(y)}(x)=x$ .

In what follows, for $k\in\mathbb{N}$ we write $[k]-1=\{0,1,\cdots,k-1\}$ . And when the context is clear, we simply drop the subscript $s$ and write $\theta_{u}$ for $\theta_{u,s}$ .

Lemma 2.1.

For $b\in\Lambda(\textbf{s})$ , and for each $j\in[c(\textbf{s})]-1$ , we have $\theta_{t(\min(b)+j)}(b)\in F(\textbf{s})$ .

Proof.

Let $m\leq 0$ be the minimum of $b$ . Fix an integer $i$ such that $m\leq i\leq m+c(\textbf{s})-1$ and $u<n(\textbf{s})$ . We shall prove that $\theta_{t(i)}(b)(u)>-c(\textbf{s})$ , which proves the lemma. If $0\leq u\leq n(\textbf{s})-t(i)$ , then $\theta_{t(i)}(b)(u)=b(t(i)+u)-b(t(i))\geq m-i>-c(\textbf{s})$ . If $n(\textbf{s})-t(i)\leq u<n(\textbf{s})$ , then $\theta_{t(i)}(b)(u)=b(t(i)+u-n(\textbf{s}))+b(n(\textbf{s}))-b(t(i))=b(t(i)+u-n(\textbf{s}))-c(\textbf{s})-i$ . Since $u<n(\textbf{s})$ , $t(i)+u-n(\textbf{s})<t(i)$ and we must have $b(t(i)+u-n(\textbf{s}))>i$ by our definition of $t$ . Therefore in this case we also have $\theta_{t(i)}(b)(u)>-c(\textbf{s})$ . ∎

Next, define a function $f:\Lambda(\textbf{s})\times([c(\textbf{s})]-1)\rightarrow F(\textbf{s})$ by $f(b,j):=\theta_{t(\min(b)+j)}(b)$ .

Lemma 2.2.

$f$ * is an $n(\textbf{s})-$ to $-1$ map from $\Lambda(\textbf{s})\times([c(\textbf{s})]-1)$ to $F(\textbf{s})$ .*

Proof.

For $l\in F(\textbf{s})$ , if size of preimage of $l$ under $f$ is strictly large than $n(\textbf{s})$ , then we must have $b_{1},b_{2}\in\Lambda(\textbf{s}),j_{1},j_{2}\in[c(\textbf{s})]-1$ such that $f(b_{1},j_{1})=f(b_{2},j_{2})=l$ and $t(\min(b_{1})+j_{1})=t(\min(b_{2})+j_{2})$ , since $t$ can only take values in $[n(\textbf{s})]$ . By the definition of $f$ we must then have $b_{1}=b_{2}$ and hence $j_{1}=j_{2}$ . Therefore each element in $F(\textbf{s})$ can have at most $n(\textbf{s})$ preimages in $\Lambda(\textbf{s})\times([c(\textbf{s})]-1)$ . On the other hand, we have (see, e.g., [28], page 128)

[TABLE]

Hence $n(\textbf{s})\times|F(\textbf{s})|=c(\textbf{s})\times|\Lambda(\textbf{s})|=|\Lambda(\textbf{s})\times([c(\textbf{s})]-1)|$ , so it must in fact hold that each $l\in F(\textbf{s})$ has exactly $n(\textbf{s})$ preimages. ∎

Recall the concept of depth-first walk $S_{\mathrm{F}}$ of a plane forest $\mathrm{F}$ . For a sequence $\mathbf{c}=(c_{1},\cdots,c_{n})\in\mathbb{R}^{n}$ , we write $W_{\mathbf{c}}(j)=\sum\limits_{i=1}^{j}(c_{i}-1)$ for $j\in[n]$ . We let $W_{\mathbf{c}}(0)=0$ and make $W_{\mathbf{c}}$ a continuous function on $[0,n]$ by linear interpolation. Note that $S_{\mathrm{F}}$ is precisely $W_{\mathbf{c}}$ where $\mathbf{c}=(k_{\mathrm{F}}(u_{1}),\cdots,k_{\mathrm{F}}(u_{|\mathrm{F}|}))$ .

For $\mathbf{c}=(c_{1},\cdots,c_{n})\in\mathbb{R}^{n}$ and a permutation $\pi$ of $[n]$ , write $\pi(\mathbf{c})=(c_{\pi(1)},\cdots,c_{\pi(n)})$ . Also, recall from the beginning of this section that for a degree sequence s, $d(\textbf{s})$ is a vector with $s^{(i)}$ entries equal to $i$ for each $i\geq 0$ .

Corollary 2.3.

Let s be a degree sequence. Let $\pi$ be a uniformly random permutation of $[n(\textbf{s})]$ and let $\nu$ be independent of $\pi$ and drawn uniformly at random from $[c(\textbf{s})]-1$ . Then

[TABLE]

and both are uniformly random elements of $F(\textbf{s})$ .

Proof.

By definition, $(W_{\pi(d(\textbf{s}))},\nu)$ is uniformly at random in $\Lambda(\textbf{s})\times([c(\textbf{s})]-1)$ . By Lemma 2.2, it follows that $f(W_{\pi(d(\textbf{s}))},\nu)$ is uniformly random in $F(\textbf{s})$ . On the other hand, the map sending plane forest $\mathrm{F}$ to its Lukasiewicz path $S_{\mathrm{F}}$ restricts to an invertible map from $\mathrm{F}(\textbf{s})$ to $F(\textbf{s})$ . Thus, $S_{\mathbb{F}(\mathbf{s})}$ is also uniformly distributed in $F(\textbf{s})$ . ∎

First-passage bridges are naturally connected to plane forests. In a similar way, general lattice bridges are naturally connected to marked plane forests. This interpretation will be more convenient for some later proofs (Propositions 3.5, 3.9 and 3.10).

A marked forest is a pair $(F,v)$ where $F$ is a plane forest and $v\in v(F)$ . Sometimes we refer $v$ as the mark of $(F,v)$ . Recall that $\mathrm{F}(\textbf{s})$ denotes the collection of all plane forests with degree sequence s. Let $\mathrm{MF}(\textbf{s})$ be the collection of all marked forests with degree sequence s and for $1\leq i\leq c(\textbf{s})$ , let $\mathrm{MF}^{i}(\textbf{s})$ be the collection of marked forests $(F,v)\in\mathrm{MF}(\mathbf{s})$ such that the mark $v$ lies within the $i-$ th tree of $F$ . We define a map $g:\mathrm{MF}(\textbf{s})\rightarrow\mathrm{D}(\textbf{s})$ which lists the degrees of vertices of a marked forest starting from the mark in DFS order. Formally, for $(F,v)\in\mathrm{MF}(\textbf{s})$ , if the DFS ordering of $v(F)$ is $v_{1},\cdots,v_{n(\textbf{s})}$ and $v=v_{i}$ , then $g((F,v))=(k_{F}(v_{i}),\cdots,k_{F}(v_{n(\textbf{s})}),k_{F}(v_{1}),\cdots,k_{F}(v_{i-1}))$ . Next define a map $h:\mathrm{MF}(\textbf{s})\rightarrow\mathrm{F}(\textbf{s})$ by $h((F,v))=F$ . Then we have the following easy fact.

Lemma 2.4.

$g$ * is a $c(\textbf{s})-$ to $-1$ surjective map and for each $1\leq i\leq c(\mathbf{s}),~{}g^{i}:=g|_{\mathrm{MF}^{i}(\mathbf{s})}$ is a bijection between $\mathrm{MF}^{i}(\mathbf{s})$ and $\mathrm{D}(\mathbf{s})$ . Also, $h$ is a $n(\textbf{s})-$ to $-1$ surjective map.*

Proof.

For $d\in\mathrm{D}(\textbf{s}),|g^{-1}(\{d\})\cap\mathrm{MF}^{i}(\textbf{s})|=1$ for all $1\leq i\leq c(\textbf{s})$ . In fact, the element of each $g^{-1}(\{d\})\cap\mathrm{MF}^{i}(\textbf{s})$ can be obtained by cyclically permuting the tree components of the element of $g^{-1}(\{d\})\cap\mathrm{MF}^{1}(\textbf{s})$ . This shows that $g^{i}$ is a bijection. The other two claims are straightforward. ∎

The map $g$ being surjective immediately gives the following result.

Corollary 2.5.

*Let $\mathbb{MF}(\textbf{s})$ be a uniformly random element of $\mathrm{MF}(\textbf{s})$ , then $g(\mathbb{MF}(\textbf{s}))$ is a uniformly random element of $\mathrm{D}(\textbf{s})$ . *

3. Concentration results

In the first part of this section, we deal with a martingale concerning the proportion of a fixed degree of uniformly permuted degree sequence. This will be useful for proving Proposition 1.7 in Section 5 where we need to first show that the degree proportions in each large trees of $\mathcal{F}^{\downarrow}_{\kappa}$ are more or less in line with the degree proportion of the given degree sequences. The second part of this section deals with the variance bound of uniformly permuted child sequences, which leads to a key technical proposition on the height of tree components of $\mathbb{F}(\mathbf{s})$ . For both subsections we will use concentration results from [25].

Let $\mathbf{s}=(s^{(i)},i\geq 0)$ with $|\mathbf{s}|=n$ be a fixed degree sequence and let $\mathbf{C}=(C_{1},\cdots,C_{n})$ denote the uniformly permuted child sequence $\pi(d(\mathbf{s}))$ (recall the notation from Section 2), where $\pi$ is a uniform random permutation of $[n]$ . For each $i\geq 0$ , let $q^{(i)}=s^{(i)}/n$ be the degree proportion of degree $i$ of $\mathbf{s}$ .

3.1. Martingales of degree proportions of uniformly permuted degree sequence

In this subsection, we introduce some martingales concerning proportions of particular degree appeared at each step in a uniformly permuted degree sequence and use them and martingale concentration inequality from [25] as tools to prove Lemma 3.4 and Proposition 3.5, which are useful for eventually proving that the empirical degree distributions of large trees of $\mathbb{F}_{\kappa}$ behave well (Proposition 5.1). We first recall the following martingale bound in [25]. Let $\{X_{i}\}_{i=0}^{n}$ be a bounded martingale adapted to a filtration $\{\mathcal{F}_{i}\}_{i=0}^{n}$ . Let $V=\sum\limits_{i=0}^{n-1}var\{X_{i+1}~{}|~{}\mathcal{F}_{i}\},$ where

[TABLE]

Let

[TABLE]

Then we have the following bound.

Theorem 3.1 ([25], Theorem 3.15).

For any $t\geq 0$ ,

[TABLE]

For fixed $i$ , for $0\leq j\leq n-1$ , let $Y^{(i)}_{j}=|\{1\leq l\leq j:C_{l}=i\}|$ and let $X^{(i)}_{j}=s^{(i)}-Y^{(i)}_{j}$ . Note that for $j>0$

[TABLE]

Let $\mathcal{F}_{j}$ be the $\sigma-$ field generated by $C_{1},\cdots,C_{j}$ .

Lemma 3.2.

Let $M_{j}^{(i)}:=\frac{X_{j}^{(i)}}{n-j}-q^{(i)}$ , then

(a) $M_{j}^{(i)}$ is an $\mathcal{F}_{j}-$ martingale;

(b) The predictable quadratic variation of $M_{j+1}^{(i)}$ satisfies

$var\{M_{j+1}^{(i)}~{}|~{}\mathcal{F}_{j}\}:={\mathbf{E}}\left[{M_{j+1}^{(i)}}^{2}~{}|~{}\mathcal{F}_{j}\right]-{M_{j}^{(i)}}^{2}\leq\frac{1}{4}\frac{1}{(n-(j+1))^{2}}.$ **

Proof.

(a) Since $q^{(i)}$ is a constant, it suffices to show that $\frac{X_{j}^{(i)}}{n-j}$ is an $\mathcal{F}_{j}-$ martingale. In fact

[TABLE]

so

[TABLE]

Thus $\frac{X^{(i)}_{j}}{n-j}$ is an $\mathcal{F}_{j}-$ martingale.

(b) By definition, we have

[TABLE]

Now we substitute

[TABLE]

in the above result and obtain

[TABLE]

which gives the claim. ∎

Now we can apply Theorem 3.1.

Proposition 3.3.

For any $t>0$ and $0<s<n$ , we have

[TABLE]

Proof.

Fix $s<n$ , and consider the martingale $\{M^{(i)}_{j}\}_{j=0}^{n-s}$ . By Lemma 3.2(b), we know that

[TABLE]

Hence $v=\mbox{ess sup }V\leq\frac{1}{2s}$ . Also, for $j\leq n-s-1$ , if $X^{(i)}_{j+1}=X^{(i)}_{j}$ , then

[TABLE]

and if $X^{(i)}_{j+1}=X^{(i)}_{j}-1$ , then

[TABLE]

Applying Theorem 3.1 to both $\{M^{(i)}_{j}\}^{n-s}_{j=0}$ and $\{-M^{(i)}_{j}\}^{n-s}_{j=0}$ gives

[TABLE]

as claimed. ∎

Now we give a probability bound of proportion of certain degree $i$ deviates from $q^{(i)}$ by an error of at least $\epsilon$ .

Lemma 3.4.

For fixed $i\in\mathbb{N}$ and $\epsilon>0$ , let $B^{\epsilon,i}=\{\exists x\geq\log^{3}n:|Y^{(i)}_{x}-q^{(i)}x|\geq\epsilon x\}$ . Then for any $n$ large enough such that $\frac{\sqrt{5}}{\log n}<\epsilon<1,~{}{\mathbf{P}}\left(B^{\epsilon,i}\right)\leq n^{-3}.$

Proof.

By symmetry, the event $\{\exists j\geq\log^{3}n:|Y^{(i)}_{j}-q^{(i)}j|\geq\epsilon j\}$ has the same distribution as the event $\{\exists l\leq n-\log^{3}n:|X^{(i)}_{l}-q^{(i)}(n-l)|\geq\epsilon(n-l)\}$ . Hence we can write

[TABLE]

Taking $s=\log^{3}n,t=\epsilon$ in (3.1), the result follows. ∎

Now we consider how degrees distribute among the tree components of the random forest $\mathbb{F}(\mathbf{s})$ . Write $\mathbb{F}(\mathbf{s})^{\downarrow}=(\mathbb{T}_{l},~{}l\geq 1)$ . Let $\mathbf{s}_{l}=(s^{(i)}_{l},i\geq 0)$ denote the (empirical) degree sequence of the $l-$ th largest tree $\mathbb{T}_{l}$ , and let $\mathbf{n}_{l}=n(\mathbf{s}_{l})$ . Recall that $q^{(i)}=s^{(i)}/n$ and let $q_{l}^{(i)}=s^{(i)}_{l}/\mathbf{n}_{l}$ be the empirical proportion of degree $i$ vertices of $\mathbb{T}_{l}$ ; if $\mathbb{F}(\mathbf{s})$ has fewer than $l$ trees then $q^{(i)}_{l}=0$ . Note that $q^{(i)}$ is deterministic while $q_{l}^{(i)}$ is random.

Proposition 3.5.

For fixed $\epsilon>0$ and $i,l$ , let $B^{\epsilon,i}_{l}=\{|q_{l}^{(i)}-q^{(i)}|>\epsilon\}$ . Then for fixed $\epsilon>0,i\in\mathbb{N}$ , we have

[TABLE]

Proof.

Let $V$ be a uniformly random vertex of $\mathbb{F}(\mathbf{s})$ , then $(\mathbb{F}(\mathbf{s}),V)$ is uniformly distributed in $\mathrm{MF}(\mathbf{s})$ . List the nodes of $\mathbb{F}(\mathbf{s})$ in cyclic lexicographic order as $V=V_{1},V_{2},\cdots,V_{n}$ , and for $i\leq n$ let $C_{i}$ be the degree of $V_{i}$ . By Corollary 2.5, the sequence $(C_{1},\cdots,C_{n})=g(\mathbb{F}(\mathbf{s}),V)$ is uniformly distributed in $\mathrm{D}(\mathbf{s})$ ; in other words, it is distributed as a uniformly random permutation of $d(\mathbf{s})$ . For any $1\leq j\leq n$ , let $\tilde{B}^{\epsilon,i}_{j}$ be the event that there exists $m>n^{1/4}$ such that

[TABLE]

Since $(C_{1},\cdots,C_{n})$ is uniformly distributed in $\mathrm{D}(\mathbf{s})$ , it is immediate that ${\mathbf{P}}\left(\tilde{B}^{\epsilon,i}_{1}\right)=\cdots={\mathbf{P}}\left(\tilde{B}^{\epsilon,i}_{n}\right)$ . Suppose a tree $T\in\mathbb{F}(\mathbf{s})$ with $|T|>n^{1/4}$ has that

[TABLE]

If $V$ is not a node of $T$ , then there exists $m>n^{1/4},0<j\leq n-m$ such that

[TABLE]

If $V$ is a node of $T$ , then there exists $m>n^{1/4},j>n-m$ such that

[TABLE]

In either case we must have $\tilde{B}^{\epsilon,i}_{j}$ true for some $1\leq j\leq n$ . Therefore

[TABLE]

which gives the claim. ∎

3.2. Probability bound of trees of random forest having abnormally large height

In this subsection, we prove tail bounds on the heights of trees in $\mathbb{F}(\textbf{s})$ , by first proving tail bounds on the sums of squares of the child sequences. This will be used in proving Proposition 1.11 in Section 5. To be more specific, let $c=(c_{1},c_{2},\cdots,c_{n})\in\mathrm{D}(\mathbf{s})$ be a child sequence with $\sigma^{2}(\textbf{s}):=\sum\limits_{i=1}^{n}c_{i}^{2}=\sum\limits_{i}i^{2}s^{(i)}$ and write $M:=\sigma^{2}(\textbf{s})/n$ and $\Delta=\Delta(\mathbf{s}):=\max\limits_{i}c_{i}$ . Recall that $C_{1},C_{2},\cdots,C_{n}$ are the uniformly permuted child sequence and let $S_{j}:=\sum\limits_{i\leq j}C^{2}_{i}$ . We will use the following theorem from [25].

Theorem 3.6 (Theorem 2.7 in [25]).

Let random variables $X^{\ast}_{1},\cdots,X^{\ast}_{n}$ be independent, with $X^{\ast}_{k}-{\mathbf{E}}\left[X^{\ast}_{k}\right]\leq b$ for each $k$ . Let $S^{\ast}_{n}=\sum X^{\ast}_{k}$ , and let $S^{\ast}_{n}$ have expected value $\mu$ and variance $V$ (the sum of the variances of $X^{\ast}_{k}$ ). Then for any $t\geq 0$ , with $\epsilon=bt/V$ , we have

[TABLE]

Since $C_{1},C_{2},\cdots,C_{k}$ are sampled without replacement from the population $c_{1},c_{2},\cdots,c_{n}$ , we may not directly apply Theorem 3.6. We address this issue as follows.

Recall (or see, e.g., [5]) that given real random variables $U,V$ , we say $U$ is a dilation of $V$ if there exist random variables $\hat{U},\hat{V}$ such that

[TABLE]

Proposition 3.7 (Proposition 20.6 in [5]).

Suppose $X_{1},\cdots,X_{k}$ and $X^{\ast}_{1},\cdots,X^{\ast}_{k}$ are samples from the same finite population $x_{1},\cdots,x_{n}$ , without replacement and with replacement, respectively. Let $S_{k}=\sum\limits_{i=1}^{k}X_{i},S^{\ast}_{k}=\sum\limits_{i=1}^{k}X^{\ast}_{i}$ . Then $S^{\ast}_{k}$ is a dilation of $S_{k}$ . In particular, ${\mathbf{E}}\left[\phi(S^{\ast}_{k})\right]\geq{\mathbf{E}}\left[\phi(S_{k})\right]$ for all continuous convex function $\phi:\mathbb{R}\rightarrow\mathbb{R}$ .

The proof of Theorem 3.6, in [25], proceeds by bounding the quantity ${\mathbf{E}}\left[\exp(h(S^{\ast}_{n}-\mu))\right]$ , where $h$ is any real number. By Proposition 3.7, we have ${\mathbf{E}}\left[\exp(h(S_{n}-\mu))\right]\leq{\mathbf{E}}\left[\exp(h(S^{\ast}_{n}-\mu)\right]$ , which means that the proof applies mutatis mutandis in the setting of sampling without replacement.

Corollary 3.8.

Let $X_{1},\cdots,X_{k}$ be samples from finite population $x_{1},\cdots,x_{n}$ , without replacement, with $X_{1}-{\mathbf{E}}\left[X_{1}\right]\leq b$ . Let $S_{k}=\sum\limits_{i=1}^{k}X_{i},V=\sum\limits_{i=1}^{k}\mathrm{Var}(X_{i})$ and $\mu_{k}={\mathbf{E}}\left[S_{k}\right]$ . Then for any $t\geq 0$ , with $\epsilon=bt/V$ , we have

[TABLE]

Now we get our probability bound on the deviations of $(S_{k},k\leq n)$ .

Proof of Proposition 1.13.

We apply (3.3); we have $\mu_{k}={\mathbf{E}}\left[S_{k}\right]=\frac{k}{n}S_{n},b=\Delta^{2},$

[TABLE]

where $M=\sigma^{2}(c)/n$ . For $\lambda>1$ , taking $t=(\lambda-1)\frac{k}{n}\sigma^{2}(c)$ , we obtain

[TABLE]

Using the assumption $\lambda\geq 2$ twice, we have

[TABLE]

which finishes the proof. ∎

Using results from Section 2, we now have the following estimate on variance of tree components of $\mathbb{F}(\mathbf{s})$ . For a tree $T$ , we let $\sigma^{2}(T)=\sum\limits_{u\in T}k_{T}(u)^{2}$ .

Proposition 3.9.

Let $\textbf{s}=(s^{(i)},i\geq 0)$ be a degree sequence with $|\textbf{s}|=n$ and $M=\sigma^{2}(\textbf{s})/n$ . Then for $\lambda\geq 4,\alpha>\Delta^{2}(\textbf{s})/n$ ,

[TABLE]

Proof.

Let $V$ be a uniformly random vertex of $\mathbb{F}(\mathbf{s})$ , then $(\mathbb{F}(\mathbf{s}),V)$ is uniformly distributed in $\mathrm{MF}(\mathbf{s})$ . List the nodes of $\mathbb{F}(\mathbf{s})$ in cyclic lexicographic order as $V=V_{1},V_{2},\cdots,V_{n}$ , and for $i\leq n$ let $C_{i}$ be the degree of $V_{i}$ . By Corollary 2.5, the sequence $(C_{1},\cdots,C_{n})=g(\mathbb{F}(\mathbf{s}),V)$ is uniformly distributed in $\mathrm{D}(\mathbf{s})$ ; in other words, it is distributed as a uniformly random permutation of $d(\mathbf{s})$ . In what follows we omit some floor notations for readability. For $0\leq j\leq\lfloor\frac{1}{\alpha}\rfloor$ , let $B_{j}$ be the event that

[TABLE]

Since $C_{1},\cdots,C_{n}$ is distributed as a uniformly random permutation of $d(\mathbf{s})$ , we clearly have

[TABLE]

Suppose that a given tree $T\in\mathbb{F}(\mathbf{s})$ has $|T|\leq\alpha n$ and $\sigma^{2}(T)\geq\lambda\alpha\sigma^{2}(\mathbf{s})$ . Then there exist $0\leq l<n$ and $m\leq\alpha n$ such that $V(T)=\{V_{l+t~{}(\mathrm{mod}~{}n)}:1\leq t\leq m\}$ . Hence there exists $0\leq j\leq\lfloor\frac{1}{\alpha}\rfloor$ such that $V(T)\subset\{V_{i~{}(\mathrm{mod}~{}n)},~{}j\alpha n+1\leq i\leq(j+2)\alpha n\}$ . This implies that

[TABLE]

i.e. $B_{j}$ is true. Hence the probability in question is at most

[TABLE]

where we take $k=\lfloor 2\alpha n\rfloor$ in Proposition 1.13 and use $\alpha>\Delta^{2}(\textbf{s})/n$ at the last step. ∎

Now we finish this section by proving a key proposition on probability bound of $\mathbb{F}(\mathbf{s})$ containing trees with unusually large height.

Proposition 3.10.

$\forall~{}\epsilon,\rho\in(0,1),\exists n_{0}=n_{0}(\epsilon)\in\mathbb{N}$ * and $\beta_{0}>0$ such that the following is true. Let $\mathbf{s}$ be any degree sequence with $|\mathbf{s}|=n\geq n_{0}$ . Suppose that $\Delta(\mathbf{s})\leq n^{\frac{1-\epsilon}{2}},s^{(1)}\leq(1-\epsilon)|\mathbf{s}|$ and $\epsilon\leq\sigma^{2}(\mathbf{s})/n\leq 1/\epsilon$ , then for any $0<\beta<\beta_{0}$ ,*

[TABLE]

Proof.

Fix $\beta>0$ small, let $\delta=\beta^{1/8}$ , and consider the following four events.

•

$E_{1}$ is the event that there exists a tree $T$ (of $\mathbb{F}(\textbf{s})$ ) with $\Delta^{2}(\textbf{s})<|T|<\beta n$ and $\sigma^{2}(T)>(\frac{|T|}{n})^{1/2}\sigma^{2}(\textbf{s})$ .

•

$E_{2}$ is the event that there exists a tree $T$ with $|T|\leq n^{1-\epsilon}$ and $\sigma^{2}(T)>n^{1-\frac{\epsilon}{2}}$ .

•

$E_{3}$ is the event that there exists a tree $T$ with $\Delta^{2}(\textbf{s})<|T|<\beta n$ and $\sigma^{2}(T)\leq(\frac{|T|}{n})^{1/2}\sigma^{2}(\textbf{s})$ such that $h(T)>\delta n^{1/2}$ .

•

$E_{4}$ is the event that there exists a tree $T$ with $|T|\leq n^{1-\epsilon}$ and $\sigma^{2}(T)\leq n^{1-\frac{\epsilon}{2}}$ such that $h(T)>\delta n^{1/2}$ .

If there is $T\in\mathbb{F}(\textbf{s})$ with $|T|<\beta n$ , and $h(T)>\delta n^{1/2}$ , then one of $E_{1},E_{2},E_{3}$ or $E_{4}$ must occur, so it suffices to bound ${\mathbf{P}}\left(E_{1}\right)+{\mathbf{P}}\left(E_{2}\right)+{\mathbf{P}}\left(E_{3}\right)+{\mathbf{P}}\left(E_{4}\right)$ . For $E_{1}$ , we further decompose the interval $[\Delta^{2}(\mathbf{s}),\beta n]$ dyadically. In the next sum, we bound the $k-$ th summand by taking $\alpha=\frac{\beta}{2^{k}},\lambda=\frac{2^{\frac{k-1}{2}}}{\beta^{1/2}}\geq 4$ in Proposition 3.9.

[TABLE]

where we use that $\sigma^{2}(\mathbf{s})/n\geq\epsilon$ in the final line.

Next, note that ${\mathbf{P}}\left(E_{2}\right)\leq\sum\limits_{j=1}^{n^{1-\epsilon}}{\mathbf{P}}\left(\exists T\in\mathbb{F}(\mathbf{s}):|T|=j,\sigma^{2}(T)>n^{1-\epsilon/2}\right)$ . For any fixed $j$ , using Corollary 2.5, with similar argument as in proof of Proposition 3.9, we have

[TABLE]

For any $j\leq n^{1-\epsilon}$ , use Proposition 1.13 with $\lambda\frac{j}{n}\sigma^{2}(\mathbf{s})=n^{1-\epsilon/2}$ and $\Delta(\mathbf{s})\leq n^{\frac{1-\epsilon}{2}}$ , we have

[TABLE]

These give that

[TABLE]

We bound ${\mathbf{P}}\left(E_{3}\right)$ as follows. For $k\geq 0$ , let $E_{3,k}$ be the event that there exists $T\in\mathbb{F}(\mathbf{s})$ with $\frac{\beta n}{2^{k+1}}\leq|T|\leq\frac{\beta n}{2^{k}}$ and $\sigma^{2}(T)\leq(\frac{|T|}{n})^{1/2}\sigma^{2}(\textbf{s})$ such that height $h(T)>\delta n^{1/2}$ . Also, let $B$ be the event that there exists $T\in\mathbb{F}(\mathbf{s})$ with $|T|\geq n^{1/4}$ such that

[TABLE]

For $n$ large enough, we have $\frac{\sqrt{5}}{\log n}<\epsilon/2<1$ . Hence it is immediate from Lemma 3.4 and Proposition 3.5 that ${\mathbf{P}}\left(B\right)\leq n^{-2}$ for $n$ large. Also, for $n$ large, if $h(T)\geq\delta n^{1/2}$ then $|T|\geq h(T)\geq n^{1/4}$ , so

[TABLE]

Let $M$ be the number of trees $T\in\mathbb{F}(\mathbf{s})$ with $\frac{\beta n}{2^{k+1}}\leq|T|\leq\frac{\beta n}{2^{k}}$ and $\sigma^{2}(T)\leq(\frac{|T|}{n})^{1/2}\sigma^{2}(\textbf{s})$ , and list the random degree sequences of these trees as $\mathbf{R}_{1},\cdots,\mathbf{R}_{m}$ . Then for any degree sequences $\mathbf{r}_{1},\cdots,\mathbf{r}_{m}$ ,

[TABLE]

Moreover

[TABLE]

where $\mathbb{T}(\mathbf{r}_{i})$ is a uniformly random plane tree with degree sequence $\mathbf{r}_{i}$ . It follows from these identities that

[TABLE]

where the supremum is over vectors $(\mathbf{r}_{1},\cdots,\mathbf{r}_{m})$ of degree sequences such that

[TABLE]

The last condition implies that, for all $i\leq m$ ,

[TABLE]

and that

[TABLE]

Finally we must have $n(\mathbf{r}_{i})\geq\frac{\beta}{2^{k+1}}n$ for all $i\leq M$ , so $M\leq\frac{2^{k+1}}{\beta}$ . Now recall Theorem 1.12, which states that for a degree sequence $\mathbf{r}=(r^{(i)},i\geq 0)$ and for all $h\geq 1$ ,

[TABLE]

where $1_{\mathbf{r}}=\frac{|\mathbf{r}|-2}{|\mathbf{r}|-1-r^{(1)}}$ ; note that this is at most $4/\epsilon$ for all degree sequences under consideration (for $n$ large enough such that $n^{1/4}\geq 4/\epsilon$ ). Using a union bound in (3.8), and then applying Theorem 1.12, we obtain that

[TABLE]

where we use the assumption $\sigma^{2}(\mathbf{s})/n\leq 1/\epsilon$ . And summing over $k$ in (3.7) yields that

[TABLE]

if we take $\delta=\beta^{1/8}$ , where $C_{5}>0$ is some universal constant and $C_{6}>0$ is some constant depending on $\epsilon$ .

For ${\mathbf{P}}\left(E_{4}\right)$ , similar to the previous treatment of ${\mathbf{P}}\left(E_{3}\right)$ , for $n$ large, we have

[TABLE]

There are at most $n$ trees in total, so a reprise of the conditioning argument used to bound ${\mathbf{P}}\left(E_{3}\right)$ gives

[TABLE]

where the supremum is over degree sequences $\mathbf{r}$ with $n(\mathbf{r})\leq n^{1-\epsilon}$ , with $\sigma^{2}(\mathbf{r})\leq n^{1-\epsilon/2}$ , and with $r^{(1)}\leq(1-\epsilon/2)n(\mathbf{r})$ . By Theorem 1.12, we obtain that

[TABLE]

recall that we take $\delta=\beta^{1/8}$ . Of the bounds on ${\mathbf{P}}\left(E_{i}\right),1\leq i\leq 4$ in (3.5), (3.6), (3.9) and (3.10), the largest is for ${\mathbf{P}}\left(E_{3}\right)$ (provided $n$ is large enough). Hence by taking $\beta>0$ small enough, we can make the bound less than any prescribed number $\rho>0$ , which yields the result. ∎

4. Convergence of the Lukasiewicz walk of forest to first passage bridge

In this section, we aim to prove Theorem 1.9 and conclude Proposition 1.8 as a corollary of Theorem 1.9. Throughout the section, we fix a sequence $(\mathbf{s}_{\kappa},\kappa\in\mathbb{N})$ of degree sequences, and let $\mathbf{n}_{\kappa},\textbf{p}_{\kappa}$ be as in Section 1 and the function $d$ be as in Section 2. Write $\sigma_{\kappa}=\sigma(\textbf{p}_{\kappa}),d_{\kappa}=d(\mathbf{s}_{\kappa}),\sigma=\sigma(\textbf{p})$ . Recall from Section 1 that for $l\geq 0$ , we write $B^{br}_{l}$ for the Brownian bridge of duration 1 from 0 to $-l$ . Moreover, we simply write $B^{br}$ for the case $l=0$ .

Proposition 4.1.

Assume $(\mathbf{s}_{\kappa},\kappa\geq 0)$ satisfies the hypothesis of Theorem 1.5, and in particular that $c_{\kappa}=c(\mathbf{s}_{\kappa})=(1+o(1))\lambda\sigma_{\kappa}\mathbf{n}_{\kappa}^{1/2}$ as $\kappa\rightarrow\infty$ for some $\lambda>0$ and that $\sigma_{\kappa}\rightarrow\sigma$ . For each $\kappa\geq 0$ , fix a uniform random permutation $\pi_{\kappa}$ of $[\mathbf{n}_{\kappa}]$ , and define a $C[0,1]$ function $\widetilde{W}_{\kappa}$ by

[TABLE]

Then

[TABLE]

To prove this theorem, we make use of the following result, which is Corollary 20.10 (a) in [5].

Theorem 4.2.

Consider a triangular array $(Z_{q,i}:1\leq i\leq M_{q},1\leq q)$ of random variables satisfying

(a) For each $q$ , the sequence $(Z_{q,1},\cdots,Z_{q,M_{q}})$ is exchangeable;

(b) $\max\limits_{i}|Z_{q,i}|\overset{p}{\to}0$ as $q\to\infty$ .

Define $\mu_{q}=\sum\limits_{i}Z_{q,i},~{}\tau_{q}^{2}=\sum\limits_{i}(Z_{q,i}-\frac{\mu_{q}}{M_{q}})^{2}$ and $S^{q}(t)=\sum\limits_{i=1}^{\lfloor tM_{q}\rfloor}Z_{q,i}$ .

Let $X(t)=\tau B^{br}(t)+\mu t$ where $(\tau,\mu)$ is independent of $B^{br}$ . Then

[TABLE]

Proof of Proposition 4.1.

Let $d_{\kappa,i}:=\pi_{\kappa}(d_{\kappa})_{i}-1,$ for $1\leq i\leq\mathbf{n}_{\kappa}$ . Although $d_{\kappa,i}$ depends on $\kappa$ , we will write $d_{i}$ instead of $d_{\kappa,i}$ from here for readability. We apply the above theorem directly with $Z_{\kappa,i}=\frac{d_{i}}{\sigma_{\kappa}\mathbf{n}_{\kappa}^{1/2}}$ . Condition (a) is satisfied since $\pi_{\kappa}$ is a uniformly random permutation of $[\mathbf{n}_{\kappa}]$ . Condition $(b)$ is satisfied since $\Delta_{\kappa}=o({\mathbf{n}_{\kappa}}^{1/2})$ and $\sup\sigma_{\kappa}<\infty$ .

Next note that, since $\sum\limits_{i}d_{i}=\sum\limits_{i}(\pi_{\kappa}(d_{\kappa})_{i}-1)=-c_{\kappa}$ ,

[TABLE]

the final convergence holding by our assumption on $c_{\kappa}$ . We also have

[TABLE]

the last equation holding since $c_{\kappa}=O(\mathbf{n}_{\kappa}^{1/2})$ .

Next note that

[TABLE]

It follows that

[TABLE]

as $\kappa\rightarrow\infty$ by our assumption on $\mathbf{s}_{\kappa}$ .

Using equations (4.1) and (4.2), by Theorem 4.2 we conclude that

[TABLE]

For all $t$ ,

[TABLE]

by assumption, so we must also have $\left(\widetilde{W}_{\kappa}(t),0\leq t\leq 1\right)\overset{d}{\to}\left(B^{br}(t)-\lambda t,~{}0\leq t\leq 1\right)$ in $D[0,1]$ . Since the Skorohod topology relativized to $C[0,1]$ coincides with the uniform topology (see page 124 of [14]), the result follows. ∎

Let $f:\mathcal{C}_{0}(1)\times[0,\infty)\to\mathcal{C}_{0}(1)$ be defined by $f(b,v):=\theta_{u}(b)$ where $u=\inf\{t:b(t)\leq\min\limits_{0\leq s\leq 1}b(s)+v\}$ . Note that since $b$ is continuous, the minimum of $b$ exists. Also, for $v\leq-\min\limits_{0\leq s\leq 1}b(s)$ , we have $u=\inf\{t:b(t)=\min\limits_{0\leq s\leq 1}b(s)+v\}$ and for $v\geq-\min\limits_{0\leq s\leq 1}b(s)$ we have $u=0$ so $f(b,v)=\theta_{0}(b)=b$ .

Recall from Section 1 the first passage bridge (of unit length from 0 to $-\lambda$ ) $F^{br}_{\lambda}$ is

[TABLE]

where $T_{\lambda}:=\inf\{t:B(t)<-\lambda\}$ is the first passage time below level $-\lambda<0$ and $B$ is the standard Brownian motion. We are going to use the following result from [10].

Theorem 4.3 ([10], Theorem 7).

Let $\nu$ be uniformly distributed over $[0,\lambda]$ and independent of $B^{br}_{\lambda}$ . Define the r.v. $U=\inf\{t:B^{br}_{\lambda}(t)=\inf_{0\leq s\leq 1}B^{br}_{\lambda}(s)+\nu\}$ . Then the process $\theta_{U}(B^{br}_{\lambda})$ has the law of the first passage bridge $F^{br}_{\lambda}$ . Moreover, $U$ is uniformly distributed over $[0,1]$ and independent of $\theta_{U}(B^{br}_{\lambda})$ .

Remark 4.1.

Note that [10] considers first passage times above positive levels, whereas we consider first passage below negative levels. But the two cases are clearly equivalent.

As preparation we begin with showing the almost sure continuity of the map $f$ . We first show that for a fixed function $b$ , the closeness of the location where $b$ is cyclically shifted will guarantee the continuity of the map $f$ .

Lemma 4.4.

For any $b\in\mathcal{C}_{0}(1)$ , the function $g^{b}:[0,1]\rightarrow\mathcal{C}_{0}(1)$ with $g^{b}(u)=\theta_{u}(b)$ is uniformly continuous.

Proof.

We want to show that $\|\theta_{u}-\theta_{v}\|$ is small when $|u-v|$ is small. Since $\theta_{u}\circ\theta_{v}=\theta_{u+v\mod 1}$ , without loss of generality, we can assume that $v=0$ . In other words we just aim to bound $\|\theta_{u}(b)-b\|$ for small $u$ . Fix $\delta\in(0,1/2)$ and let $\epsilon=\epsilon(\delta)=\sup\limits_{|t-s|<\delta}|b(t)-b(s)|$ be the modulus of continuity of $b$ . Let $0<u<\delta$ . If $t\in[0,1-u]$ , then $|\theta_{u}(b)(t)-b(t)|=|b(t+u)-b(u)-b(t)|\leq|b(u)-b(0)|+|b(t+u)-b(t)|\leq 2\epsilon(u)$ . If $t\in[1-u,1]$ , then $|\theta_{u}(b)(t)-b(t)|=|b(t+u-1)+b(1)-b(u)-b(t)|\leq|b(t+u-1)-b(u)|+|b(1)-b(t)|\leq 2\epsilon(u)$ . Since $\epsilon(u)\rightarrow 0$ as $u\rightarrow 0$ , the result follows. ∎

Lemma 4.5.

Given $b\in\mathcal{C}_{0}(1)$ and $0\leq v\leq-\min(b)$ , if $f(b,v)=\theta_{t_{v+\min(b)}}(b)$ is not continuous at $v$ , then $b$ attains a local minimum at $t_{v+\min(b)}$ .

Proof.

By Lemma 4.4, if $f(b,v)$ is not continuous at $v$ , then $t_{v+\min(b)}$ is not continuous at $v$ . The continuity of $b$ clearly implies right-continuity of $t_{v+\min(b)}$ as a function of $v$ . Moreover, for all $0\leq v\leq-\min(b)$ , $b$ attains a left-local minimum at $t_{v+\min(b)}$ . Letting $t^{+}=\lim\limits_{v^{\prime}\uparrow v}t_{v^{\prime}+\min(b)}$ , then it follows that

[TABLE]

This implies that if $t_{v+\min(b)}$ is not continuous at $v$ , then $t^{+}>t_{v+\min(b)}$ , so $b$ also attains a right-local minimum at $t_{v+\min(b)}$ . This proves the lemma. ∎

For $\lambda>0$ , we next collect a few properties of Brownian bridge $B^{br}_{\lambda}$ and first passage bridge $F^{br}_{\lambda}$ :

Lemma 4.6.

Brownian bridge $B^{br}_{\lambda}$ satisfies the following properties:

(a) Let $\tau_{+}=\inf\{t>0:B^{br}_{\lambda}(t)>0\},~{}\tau_{-}=\inf\{t>0:B^{br}_{\lambda}(t)<0\}$ , then almost surely $\tau_{+}=\tau_{-}=0$ ;

(b) Given two nonoverlapping closed intervals (which may share one common endpoint) in $[0,1]$ , the minima of $B^{br}_{\lambda}$ on these two intervals are almost surely different;

(c) Almost surely, every local minimum of $B^{br}_{\lambda}$ is a strict local minimum;

(d) The set of times where local minima are attained is countable.

Moreover, these four properties also hold for first passage bridge $F^{br}_{\lambda}$ .

Proof.

First note that the four properties are satisfied by a standard Brownian motion $B$ (e.g. see Theorem 2.8 and Theorem 2.11 in [27]). Let $C_{n}$ be the set of functions $f\in C[0,1]$ such that all four properties in the lemma occur up to time $1-1/n$ (i.e. the restriction of $f$ on $[0,1-1/n]$ satisfies all four properties). Then ${\mathbf{P}}\left(B\in C_{n}\right)=1$ for all $n\in\mathbb{N}$ . By equation (1.1) and equation (1.2) we know that the law of $B^{br}_{\lambda}$ and the law of $F^{br}_{\lambda}$ are both absolutely continuous with respect to the law of $B$ up to time $1-1/n$ . Hence we must have ${\mathbf{P}}\left(B^{br}_{\lambda}\in C_{n}\right)={\mathbf{P}}\left(F^{br}_{\lambda}\in C_{n}\right)=1$ for any $n\in\mathbb{N}$ . This immediately implies that properties (a), (c) and (d) hold for $B^{br}_{\lambda}$ and $F^{br}_{\lambda}$ . It also implies (b), except for the case where one of the intervals has the form $[s,1]$ and the minimum on $[s,1]$ is reached at 1. For $F^{br}_{\lambda}$ , by definition the global minimum $-\lambda$ is uniquely achieved at 1, hence the minimum on $[s,1]$ will not be the same as the minimum on any nonoverlapping interval. For $B^{br}_{\lambda}$ , consider $\tilde{B}_{\lambda}(t)=-B^{br}_{\lambda}(1-t)-\lambda$ , then $\tilde{B}_{\lambda}\overset{d}{=}B^{br}_{\lambda}$ , so $\tilde{B}_{\lambda}$ almost surely takes positive values on any interval $[0,\epsilon]$ by property (a). It follows that $\min\limits_{t\in[s,1]}B^{br}_{\lambda}(t)$ is almost surely achieved at some $t\neq 1$ . This completes the proof. ∎

Lemma 4.7.

Let $\nu$ be $Unif[0,\lambda]-$ distributed and independent of $B^{br}_{\lambda}$ . Then the function $f:\mathcal{C}_{0}(1)\times[0,\infty)\to\mathcal{C}_{0}(1)$ satisfies ${\mathbf{P}}\left(f\mbox{ is continuous at }(B^{br}_{\lambda},\nu)\right)=1$ .

Proof.

By Lemma 4.5, we have

[TABLE]

Let $M=\{u\in[0,1]:B^{br}_{\lambda}\mbox{ attains local minimum at }u\}$ and let $\tilde{M}=\{B^{br}_{\lambda}(u):u\in M\}$ . By Lemma 4.6, $M$ is countable, hence $\tilde{M}$ is countable.

Next note that ${\mathbf{P}}\left(B^{br}_{\lambda}\mbox{ attains a local minimum at }t_{\nu+\min(B^{br}_{\lambda})}\right)\leq{\mathbf{P}}\left(\nu+\min(B^{br}_{\lambda})\in\tilde{M}\right)$ . Moreover, $\nu$ is a continuous random variable, independent of $B^{br}_{\lambda}$ , so the last probability equals zero. ∎

Now we are ready to give the proof of Theorem 1.9.

Proof of Theorem 1.9.

For each $\kappa\geq 1$ let $\nu_{\kappa}$ be a uniformly random element of $[c_{\kappa}]-1$ independent of $\pi_{\kappa}$ , and let $\nu$ be $Unif[0,\lambda]$ and independent of $B^{br}_{\lambda}$ . By Corollary 2.3,

[TABLE]

By Proposition 4.1, we have $\widetilde{W}_{\kappa}\overset{d}{\to}B^{br}_{\lambda}$ , and clearly we have ${\sigma_{\kappa}}^{-1}{\mathbf{n}_{\kappa}}^{-1/2}\nu_{\kappa}\overset{d}{\to}\nu$ . By independence we have $(\widetilde{W}_{\kappa},{\sigma_{\kappa}}^{-1}{\mathbf{n}_{\kappa}}^{-1/2}\nu_{\kappa})\overset{d}{\to}(B^{br}_{\lambda},\nu)$ . Since by Lemma 4.7 we have

[TABLE]

we can apply the mapping theorem (e.g. Theorem 2.7 in [14]) to conclude that

[TABLE]

By Theorem 4.3, $F^{br}_{\lambda}\overset{d}{=}f(B^{br}_{\lambda},\nu)$ , hence we conclude that

[TABLE]

as required. ∎

Now we begin with the preparation work to prove Proposition 1.8. We define the map $h:\mathcal{C}_{0}(1)\rightarrow l^{\downarrow}_{1}$ such that for $g\in\mathcal{C}_{0}(1),~{}h(g)$ equals to the decreasing ordering of excursion length of $g(s)-\min\limits_{0\leq s^{\prime}<s}g(s^{\prime})$ . (we append at most countably many zeros to make $h(g)$ an element of $l^{\downarrow}_{1}$ ). Define $h_{k}:\mathcal{C}_{0}(1)\rightarrow\mathbb{R}^{k}$ as $h_{k}=\pi_{k}\circ h$ where $\pi_{k}:l^{\downarrow}_{1}\rightarrow\mathbb{R}^{k}$ is the projection onto the subspace spanned by the first $k$ coordinates. To prove Proposition 1.8, we use the following result from [18].

Lemma 4.8.

[Lemma 3.8 and Corollary 3.10 in [18]] Suppose $\zeta:[0,1]\rightarrow\mathbb{R}$ is continuous. Let $E$ be the set of non-empty intervals of $I=(l,r)$ such that

[TABLE]

Suppose that for all intervals $(l_{1},r_{1}),(l_{2},r_{2})\in E$ with $l_{1}<l_{2}$ , we have

[TABLE]

Suppose also that the complement of $\cup_{I\in E}I$ has Lebesgue measure 0. Fix functions $(\zeta_{m},m\geq 1)$ such that $\zeta_{m}\rightarrow\zeta$ uniformly on $[0,1]$ , and real numbers $(t_{m,i},~{}m,i\geq 1)$ which satisfy the following:

(i) $0=t_{m,0}<t_{m,1}<\cdots<t_{m,k}=1$ ;

(ii) $\zeta_{m}(t_{m,i})=\min\limits_{u\leq t_{m,i}}\zeta_{m}(u)$ ;

(iii) $\lim_{m}\max_{i}(\zeta_{m}(t_{m,i})-\zeta_{m}(t_{m,i+1}))=0$ .

Then the vector consisting of decreasingly ranked elements of $\{t_{m,i}-t_{m,i-1}:1\leq i\leq k\}$ (attaching zeroes if necessary to make the vector an element in $\mathbb{R}^{|E|}$ ) converges componentwise and in $l_{1}$ to the vector consisting of decreasingly ranked elements of $\{r-l:(l,r)\in E\}$ .

Lemma 4.9.

Let $\mathcal{E}$ be the set of excursions $\gamma$ of $F^{br}_{\lambda}(s)-\min\limits_{0\leq s^{\prime}<s}F^{br}_{\lambda}(s^{\prime})$ . Then almost surely for all $\gamma_{1},\gamma_{2}\in\mathcal{E}$ with $l(\gamma_{1})<l(\gamma_{2})$ , we have $F^{br}_{\lambda}(l(\gamma_{1}))>F^{br}_{\lambda}(l(\gamma_{2}))$ .

Proof.

Suppose to the contrary that for some $\gamma_{1},\gamma_{2}\in\mathcal{E}$ with $l(\gamma_{1})<l(\gamma_{2})$ , we have $F^{br}_{\lambda}(l(\gamma_{1}))\leq F^{br}_{\lambda}(l(\gamma_{2}))$ , then since $\gamma_{1},\gamma_{2}$ are excursions of $F^{br}_{\lambda}(s)-\min\limits_{0\leq s^{\prime}<s}F^{br}_{\lambda}(s^{\prime})$ , we must in fact have $F^{br}_{\lambda}(l(\gamma_{1}))=F^{br}_{\lambda}(l(\gamma_{2}))$ . In this case then we can find $a,b,c\in\mathbb{Q}$ such that $a<l(\gamma_{1})<b<l(\gamma_{2})<c$ , and $F^{br}_{\lambda}$ achieves the same minima (at $l(\gamma_{1})$ and $l(\gamma_{2})$ respectively) on $[a,b]$ and $[b,c]$ . This has probability zero by Lemma 4.6 (b). ∎

To prove the next lemma, we introduce the following notation. Let $(S_{1/2}(\lambda),0\leq\lambda<\infty)$ denote a stable subordinator of index 1/2, which is the increasing process with stationary independent increments such that

[TABLE]

Lemma 4.10.

Almost surely, the coordinates of $h(F^{br}_{\lambda})$ sum to 1, and are all strictly positive.

Proof.

By Proposition 5 of [10], $h(F^{br}_{\lambda})$ has the law of the vector of ranked excursion lengths of $|B^{br}|$ conditioned to have total local time $\lambda$ at 0, which in turn has the same law as ranked excursion lengths of Brownian bridge conditioned to have total local time $\lambda$ at 0 (this vector has the same law as the random vector $Y(\lambda)$ in [9], see equation (36) there). The latter is distributed as the scaled ranked jump sizes of the stable subordinator $S_{1/2}(\cdot)$ conditioned to be $\frac{1}{\lambda^{2}}$ at time 1 (e.g. see Theorem 4 in [9]). By Lemma 10 in [9], the coordinates of $h(F^{br}_{\lambda})$ almost surely sum to 1. This immediately implies that the stable subordinator almost surely has infinitely many jumps, so almost surely all coordinates of $h(F^{br}_{\lambda})$ are strictly positive. Indeed, suppose to the contrary that the excursion intervals are $(l_{1},r_{1}),\cdots,(l_{k},r_{k})$ , where $r_{i}\leq l_{i+1},1\leq i\leq k-1$ . Then since $\sum\limits_{i=1}^{k}(r_{i}-l_{i})=1$ , we must in fact have $r_{i}=l_{i+1},\forall 1\leq i\leq k-1$ and $l_{1}=0,r_{k}=1$ . But this implies that $0=F^{br}_{\lambda}(l_{1})=F^{br}_{\lambda}(r_{1})=F^{br}_{\lambda}(l_{2})=\cdots=F^{br}_{\lambda}(l_{k})=F^{br}_{\lambda}(r_{k})=F^{br}_{\lambda}(1)$ , contradicting to the fact $F^{br}_{\lambda}(1)=-\lambda<0$ . ∎

Proof of Proposition 1.8.

We first prove that for any fixed $j\geq 1$ ,

[TABLE]

Let $\zeta_{\kappa}=\left(\frac{S_{\mathbb{F}_{\kappa}}(t\mathbf{n}_{\kappa})}{\sigma_{\kappa}{\mathbf{n}_{\kappa}}^{1/2}}\right)_{t\in[0,1]}$ and let $\zeta=\left(F^{br}_{\lambda}(t)\right)_{t\in[0,1]}$ . By (1.6) and by Skorokhod’s representation theorem, we may work in a probability space in which $\zeta_{\kappa}\overset{a.s.}{\rightarrow}\zeta$ . Let $E$ be the set of excursion intervals of $\zeta$ . Then Lemma 4.9 guarantees equation (4.3) in Lemma 4.8 is true and Lemma 4.10 guarantees that the complement of $\cup_{I\in E}I$ has Lebesgue measure 0, as required by Lemma 4.8. For each $\kappa$ let $t_{\kappa,0}=0$ and for $1\leq j\leq c_{\kappa}$ let $t_{\kappa,j}$ be such that $\mathbf{n}_{\kappa}t_{\kappa,j}$ is the time the depth-first walk $S_{\mathbb{F}_{\kappa}}$ finishes visiting the $j-$ th tree of $\mathbb{F}_{\kappa}$ . Then almost surely, condition (i) of Lemma 4.8 is clearly true and condition (iii) is also true since for each $1\leq j\leq c_{\kappa},~{}\zeta_{\kappa}(t_{\kappa,j})=\zeta_{\kappa}(t_{\kappa,j-1})-\frac{1}{\sigma_{\kappa}\mathbf{n}_{\kappa}^{1/2}}$ . The definition of Lukasiewicz walk guarantees that the times at which $\frac{S_{\mathbb{F}_{\kappa}}(t\mathbf{n}_{\kappa})}{\sigma_{\kappa}{\mathbf{n}_{\kappa}}^{1/2}}$ hits a new minimum coincide with the times at which the walk finishes exploring the trees of the forest. Hence almost surely condition (ii) of Lemma 4.8 is also satisfied. Also note that the vector consisting of decreasingly ranked elements of $\{t_{\kappa,j}-t_{\kappa,j-1},1\leq j\leq c_{\kappa}\}$ is simply the scaled decreasing ordering of tree component sizes $(|\mathbb{T}_{\kappa,l}|/\mathbf{n}_{\kappa})_{1\leq l\leq c_{\kappa}}$ . Hence by Lemma 4.8 we know that

[TABLE]

which immediately implies weak convergence. Lemma 4.10 guarantees that this is true for any positive integer $j$ . We also have $h_{j}(F^{br}_{\lambda})\overset{d}{=}(|\gamma_{l}|)_{1\leq l\leq j}$ by definition, and (4.4) follows.

To prove (1.5) from (4.4), we only need to prove that for any $\epsilon>0$ , there exists $I_{0}\in\mathbb{N}$ such that $\limsup\limits_{\kappa\to\infty}{\mathbf{P}}\left(\sum\limits_{l>I_{0}}\frac{|\mathbb{T}_{\kappa,l}|}{\mathbf{n}_{\kappa}}>\epsilon\right)<\epsilon$ . Since by Lemma 4.10 we have $\sum\limits_{l}|\gamma_{l}|=1$ almost surely, in particular, $\lim\limits_{I\rightarrow\infty}{\mathbf{P}}\left(\sum\limits_{l>I}|\gamma_{l}|>\epsilon\right)=0$ . So there exists $I_{0}$ such that ${\mathbf{P}}\left(\sum\limits_{l>I_{0}}|\gamma_{l}|>\epsilon\right)<\epsilon/2$ . Let $A_{\kappa}$ be the event that $\sum\limits_{l\leq I_{0}}\frac{|\mathbb{T}_{\kappa,l}|}{\mathbf{n}_{\kappa}}<1-\epsilon$ and $A$ be the event that $\sum\limits_{l\leq I_{0}}|\gamma_{l}|<1-\epsilon$ (which has probability less than $\epsilon/2$ by our choice of $I_{0}$ ). By (4.4), we have $|{\mathbf{P}}\left(A_{\kappa}\right)-{\mathbf{P}}\left(A\right)|<\epsilon/2$ for $\kappa$ large enough. Therefore

[TABLE]

as required. ∎

5. Proof of Proposition 1.7 and Proposition 1.11

We assume that we have the conditions of Theorem 1.5 hold. In particular, we have a probability distribution p on $\mathbb{N}$ . Recall that $\sigma=\sigma(\textbf{p}),\sigma_{\kappa}=\sigma(\textbf{p}_{\kappa})$ . Let $\mathbf{s}_{\kappa,l}=(s^{(i)}_{\kappa,l},i\geq 0)$ denote the degree sequence of $\mathbb{T}_{\kappa,l}$ and let $\mathbf{n}_{\kappa,l}=n(\mathbf{s}_{\kappa,l})$ . Recall that $p_{\kappa}^{(i)}=s_{\kappa}^{(i)}/\mathbf{n}_{\kappa}$ and let $p_{\kappa,l}^{(i)}=s^{(i)}_{\kappa,l}/\mathbf{n}_{\kappa,l}$ be the empirical proportion of degree $i$ among all vertices of the $l-$ th largest tree $\mathbb{T}_{\kappa,l}$ . Note that $p_{\kappa}^{(i)}$ is deterministic while $p_{\kappa,l}^{(i)}$ is random.

First, we are going to prove Proposition 1.7 by using Theorem 1.10. To do so, we will have to first show that the assumptions of Theorem 1.10 are satisfied in our setting.

Proposition 5.1.

Under the assumption of Theorem 1.5, for all $l\geq 1$ , as $\kappa\rightarrow\infty$ we have

(a) $\textbf{p}_{\kappa,l}\overset{p}{\rightarrow}\textbf{p}$ coordinatewise, that is, $p^{(i)}_{\kappa,l}\overset{p}{\rightarrow}p^{(i)}$ for all $i\geq 1$ .

(b) $\sigma(\textbf{p}_{\kappa,l})\overset{p}{\rightarrow}\sigma(\textbf{p})$ .

Proof.

For (a), we know that by Lemma 3.4 and Proposition 3.5, for fixed $\epsilon>0,i,l\in\mathbb{N}$ and $\kappa$ large enough, we have

[TABLE]

For any $\epsilon^{\prime}>0$ , there exists $\delta>0$ such that ${\mathbf{P}}\left(|\gamma_{l}|<\delta\right)<\epsilon^{\prime}/2$ and by (4.4) we can find $\kappa_{0}$ such that for all $\kappa\geq\kappa_{0}$ we have ${\mathbf{P}}\left(\frac{|\mathbb{T}_{\kappa,l}|}{\mathbf{n}_{\kappa}}<\delta\right)\leq{\mathbf{P}}\left(|\gamma_{l}|<\delta\right)+\epsilon^{\prime}/2$ and $\mathbf{n}_{\kappa}^{-3/4}<\delta$ . Hence ${\mathbf{P}}\left(|\mathbb{T}_{\kappa,l}|\leq\mathbf{n}_{\kappa}^{1/4}\right)={\mathbf{P}}\left(\frac{|\mathbb{T}_{\kappa,l}|}{\mathbf{n}_{\kappa}}\leq\mathbf{n}_{\kappa}^{-3/4}\right)\leq{\mathbf{P}}\left(\frac{|\mathbb{T}_{\kappa,l}|}{\mathbf{n}_{\kappa}}\leq\delta\right)<\epsilon^{\prime}$ . Hence ${\mathbf{P}}\left(|\mathbb{T}_{\kappa,l}|\leq\mathbf{n}_{\kappa}^{1/4}\right)=o(1)$ as $\kappa\rightarrow\infty$ . Therefore by (5.1) we know that $|p^{(i)}_{\kappa,l}-p^{(i)}_{\kappa}|\overset{p}{\rightarrow}0$ as $\kappa\rightarrow\infty$ , which implies (a) since by assumption of Theorem 1.5 we have $\textbf{p}_{\kappa}$ converges to p coordinatewise.

Now we proceed to prove (b). Fix $l\geq 1$ and $\delta>0$ , and let $\epsilon>0$ be small enough that

[TABLE]

Such $\epsilon$ exists by (4.4).

Then let $M$ be large enough that $\sigma^{2}_{\kappa,>M}:=\sum\limits_{i>M}i^{2}\frac{s_{\kappa}^{(i)}}{\mathbf{n}_{\kappa}}<\epsilon^{2}$ for all $\kappa$ (such $M$ exists since under the assumption of Theorem 1.5 $\sigma^{2}_{\kappa}$ converges). And let $\sigma^{2}_{\kappa,l,>M}=\sum\limits_{i>M}i^{2}\frac{s^{(i)}_{\kappa,l}}{\mathbf{n}_{\kappa,l}}$ similarly. Note that

[TABLE]

so if $\sigma^{2}_{\kappa,l,>M}>\epsilon$ then $|\mathbb{T}_{\kappa,l}|<\epsilon\mathbf{n}_{\kappa}$ . By the triangle inequality, we have

[TABLE]

Since $|p^{(i)}_{\kappa,l}-p^{(i)}_{\kappa}|\rightarrow 0$ in probability for all $i$ by part (a), and $\sum\limits_{i>M}i^{2}p^{(i)}_{\kappa}<\epsilon^{2}<\epsilon$ , and $\sigma(\textbf{p}_{\kappa})\rightarrow\sigma(\textbf{p})$ by assumption of Theorem 1.5, this yields that

[TABLE]

which proves part (b). ∎

Lemma 5.2.

Let $\Delta_{\kappa,l}$ be the largest degree of a vertex of $\mathbb{T}_{\kappa,l}$ . For any fixed $l$ , we have

[TABLE]

Proof.

For any $\delta>0$ , we need to prove $\lim\limits_{\kappa\to\infty}{\mathbf{P}}\left(\frac{\Delta_{\kappa,l}}{\sqrt{|\mathbb{T}_{\kappa,l}|}}>\delta\right)=0$ . For any $\epsilon>0$ , by Lemma 4.10 we can choose $\epsilon^{\prime}>0$ such that ${\mathbf{P}}\left(|\gamma_{l}|<\epsilon^{\prime}\right)\leq\epsilon/2$ . Then choose $\kappa_{0}$ such that when $\kappa\geq\kappa_{0}$ we have

[TABLE]

This is possible since $\Delta_{\kappa}=o(\mathbf{n}_{\kappa}^{1/2})$ by Remark 1.1 and $|\mathbb{T}_{\kappa,l}|/\mathbf{n}_{\kappa}\overset{d}{\rightarrow}|\gamma_{l}|$ by (4.4). Therefore

[TABLE]

hence the claim. ∎

With Proposition 5.1 and Lemma 5.2, we are now ready to give the proof of Proposition 1.7.

Proof of Proposition 1.7.

Let $\mathbf{s}_{\kappa,l}$ be the random degree sequence of the $l-$ th largest tree in the forest $\mathbb{F}_{\kappa}$ . Then by Proposition 1.8, we have

[TABLE]

By Proposition 5.1 and Lemma 5.2, we know we can apply Theorem 1.10 to $\mathcal{T}_{\kappa,l}$ to conclude that for each fixed $l\leq j$ ,

[TABLE]

where $(\mathbf{e}_{l})_{l\leq j}$ are independent copies of $\mathbf{e}$ . Since the trees $(\mathcal{T}_{\kappa,l},l\leq j)$ are conditionally independent given their degree sequences, it follows that

[TABLE]

The result follows by Brownian scaling. ∎

Finally, we give the proof of Proposition 1.11 based on Proposition 3.10, with the assumptions of Theorem 1.6.

Proof of Proposition 1.11.

By assumption we have $\sigma_{\kappa}\rightarrow\sigma\in(0,\infty)$ and $s^{(1)}_{\kappa}/|\mathbf{s}_{\kappa}|\rightarrow p^{(1)}<1$ . Fix $\rho>0$ and let $\epsilon>0$ be such that $2\epsilon<\sigma^{2}<\frac{1}{2\epsilon}$ . Then let $\beta_{0}=\beta_{0}(\rho,\epsilon)$ be as in Proposition 3.10, so that for all $n$ sufficiently large, if a degree sequence $\mathbf{s}$ satisfies $|\mathbf{s}|=n,\Delta(\mathbf{s})\leq n^{\frac{1-\epsilon}{2}},s^{(1)}\leq(1-\epsilon)|\mathbf{s}|$ and $\epsilon\leq\sigma^{2}(\mathbf{s})/n\leq 1/\epsilon$ , then for any $0<\beta<\beta_{0}$ ,

[TABLE]

For $\kappa$ sufficiently large, $\mathbf{s}_{\kappa}$ satisfies these conditions. Hence for any $0<\beta<\beta_{0}$ ,

[TABLE]

Finally, taking $\beta=(a/\sigma_{\kappa})^{8}$ in (5.2), since $\mathcal{T}_{\kappa,l}=\frac{\sigma_{\kappa}}{2\mathbf{n}_{\kappa}^{1/2}}\mathbb{T}_{\kappa,l}$ and for all $j>1/\beta$ we have $|\mathbb{T}_{\kappa,j}|<\beta\mathbf{n}_{\kappa}$ , it follows that for all $\kappa$ sufficiently large,

[TABLE]

Since $\mathrm{diam}(\mathcal{T}_{\kappa,l})\leq 2h(\mathcal{T}_{\kappa,l})$ , the result now follows easily. ∎

6. acknowledgements

I would like to thank Louigi Addario-Berry for suggesting this project and numerous helpful discussions thereafter. This work was partially supported by NSERC CGS and I thank the institution.

Appendix A proof of remark 1.1

Let’s restate remark 1.1 as the following lemma.

Lemma A.1.

Suppose distributions $\textbf{p}_{\kappa}$ converges to p coordinatewise and $\sigma(\textbf{p}_{\kappa})\rightarrow\sigma(\textbf{p})\in(0,\infty)$ and $\frac{c(\mathbf{s}_{\kappa})}{\mathbf{n}_{\kappa}^{1/2}}\rightarrow x\in(0,\infty)$ , then $\mu(\textbf{p}_{\kappa})\rightarrow\mu(\textbf{p})=1$ and $\Delta_{\kappa}/{\mathbf{n}_{\kappa}}^{1/2}\rightarrow 0$ as $\kappa\rightarrow\infty$ .

Proof.

First, since $0\leq\mu(\textbf{p})=\sum ip^{(i)}\leq\sum i^{2}p^{(i)}=\sigma^{2}(\textbf{p})<\infty$ , we have $\mu(\textbf{p})\in(0,\infty)$ . And we can compute the limit of $\mu(\textbf{p}_{\kappa})$ explicitly:

[TABLE]

by our assumption of the magnitude of $c_{\kappa}$ .

Next, since $\mathbf{p}_{\kappa}\rightarrow\mathbf{p}$ coordinatewise, for all $M\in\mathbb{N}$ we have

[TABLE]

It follows that

[TABLE]

where the final equality holds since $\sigma(\mathbf{p})<\infty$ and $\sigma(\mathbf{p}_{\kappa})\rightarrow\sigma(\mathbf{p})$ . Hence $\mu(\mathbf{p}_{\kappa})\rightarrow\mu(\mathbf{p})$ .

Since $\textbf{p}_{\kappa}\rightarrow\textbf{p}$ coordinatewise, it follows that for any integer $N$ ,

[TABLE]

Now let $\epsilon>0$ and let $N$ be large enough that $0<\sum\limits_{i\geq N}i^{2}p^{(i)}<\epsilon.$ Then for all $\kappa$ sufficiently large, $0<\sum\limits_{i\geq N}i^{2}p^{(i)}_{\kappa}<\epsilon$ . But $\sum\limits_{i\geq N}i^{2}p^{(i)}_{\kappa}\geq\epsilon\mathbbm{1}_{\Delta_{\kappa}\geq(\epsilon\mathbf{n}_{\kappa})^{1/2}}$ , so this implies that $\limsup\limits_{\kappa\rightarrow\infty}\frac{\Delta_{\kappa}}{\mathbf{n}_{\kappa}^{1/2}}\leq\epsilon^{1/2}$ . Since $\epsilon>0$ was arbitrary, the result follows. ∎

Appendix B proof of Remark 1.3

The following proposition will be useful for our justification of Remark 1.3 (see Lemma 2.4 in [23] for a version dealing with Gromov-Hausdorff distance instead of Gromov-Hausdorff-Prokhorov distance):

Proposition B.1 (Proposition 2.9 in [1]).

Let $f,g$ be two compactly supported non-negative continuous functions with $f(0)=g(0)=0$ . Then

[TABLE]

Now we prove the following result.

Proposition B.2.

The GH convergence in Theorem 1 in [16] can be strengthened to GHP convergence as in Theorem 1.10.

Proof.

Let $C_{\kappa}$ be the contour function of $\mathbb{T}_{\kappa}$ , define $\hat{C}_{\kappa}:[0,1]\rightarrow[0,\infty)$ by letting $\hat{C}_{\kappa}(t)=\frac{\sigma(\textbf{p}_{\kappa})}{2\mathbf{n}_{\kappa}^{1/2}}C_{\kappa}(2(\mathbf{n}_{\kappa}-1)t)$ , then it is shown in [16] (see Theorem 3 there) that $\hat{C}_{\kappa}\overset{d}{\rightarrow}\mathbf{e}$ in the space $C([0,1],\mathbb{R})$ , equipped with the supremum distance. By Proposition B.1 and Skorokhod’s representation theorem, it follows that $\mathcal{T}_{\hat{C}_{\kappa}}\overset{d}{\rightarrow}\mathcal{T}_{\mathbf{e}}$ in the GHP sense.

Next, metrically we may realize $\mathcal{T}_{\kappa}$ as the subspace of $\mathcal{T}_{\hat{C}_{\kappa}}$ consisting of the set $U$ of points whose distance from the root is an integer multiple of $\frac{\sigma(\textbf{p}_{\kappa})}{2\mathbf{n}_{\kappa}^{1/2}}$ . With this identification

[TABLE]

Moreover, the measure $\hat{\mu}_{\kappa}$ on $\mathcal{T}_{\hat{C}_{\kappa}}$ is the (normalized) length measure, and the measure $\mu_{\kappa}$ on $\mathcal{T}_{\kappa}$ is the uniform measure on its points. It follows that

[TABLE]

To see this, for each $u\in U$ which is not the root of $\mathcal{T}_{\kappa}$ , let $e_{u}$ be the parent edge of $u$ , which we view as a closed line segment of length $\epsilon=\frac{\sigma(\textbf{p}_{\kappa})}{2\mathbf{n}_{\kappa}^{1/2}}$ in $\mathcal{T}_{\hat{C}_{\kappa}}$ . For any non-empty set $S\subset U$ , we have $\mu_{\kappa}(S)=|S|/\mathbf{n}_{\kappa}.$ Hence

[TABLE]

where the first inequality is because for non-root $u\in S$ , we have $e_{u}\subset S^{\epsilon}$ . On the other hand, let $A$ be a closed set in $\mathcal{T}_{\hat{C}_{\kappa}}$ and let $l=|\{e\in E(\mathcal{T}_{\kappa}):A\cap e\neq\emptyset\}|$ . Then $A^{\epsilon}$ contains at least $l$ vertices of $\mathcal{T}_{\kappa}$ since no cycle exists, so

[TABLE]

Hence $d_{GHP}(\mathcal{T}_{\kappa},\mathcal{T}_{\hat{C}_{\kappa}})\overset{d}{\rightarrow}0$ . By the triangle inequality, it follows that $\mathcal{T}_{\kappa}\overset{d}{\rightarrow}\mathcal{T}_{\mathbf{e}}$ in the GHP sense. ∎

Bibliography28

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Abraham, R., Delmas, J.-F. and Hoscheit, P. (2014). Exit times for an increasing Lévy tree-valued process. Probab. Theory Related Fields 159 , 357–403.
2[2] Abraham, R., Delmas, J.-F. and Hoscheit, P. (2013). A note on the Gromov-Hausdorff-Prokhorov distance between (locally) compact metric measure spaces. Electron. J. Probab. 18 , 1–21.
3[3] Addario-Berry, L. (2012). Tail bounds for the height and width of a random tree with a given degree sequence. Random Structures and Algorithms 41 , 253–261.
4[4] Addario-Berry, L., Broutin, N. and Goldschmidt, C. (2010). Critical random graphs: limiting constructions and distributional properties, Electronic Journal of Probability , 15 , 741–775.
5[5] Aldous, D. (1985). Exchangeability and related topics, Ecole d’étë de probabilités de Saint-Flour, XIII. Lecture Notes in Mathematics, 1117 , Springer, Berlin, 1–198.
6[6] Aldous, D. (1991). The continuum random tree. I. Annals of Probability , 19 , 1–28.
7[7] Aldous, D. (1991). The continuum random tree. II. An Overview. Stochastic analysis (Durham, 1990) , London Math. Soc. Lecture Note Ser., 167 , Cambridge Univ. Press, Cambridge, 23–70.
8[8] Aldous, D. (1993). The continuum random tree. III. Annals of Probability , 21 , 248–289.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Scaling limit of random forests with prescribed degree sequences

Abstract.

2010 Mathematics Subject Classification:

1. Introduction

1.1. Definitions and Notation

Plane trees and forests

Definition 1.1**.**

Definition 1.2**.**

First passage bridge

Gromov-Hausdorff-Prokhorov distance

Theorem 1.3** (Theorem 2.5 in [2]).**

Real trees

Definition 1.4**.**

1.2. Statement of main theorems

Theorem 1.5**.**

Theorem 1.6**.**

Remark 1.1**.**

Remark 1.2**.**

1.3. Key ingredients of the paper

Proposition 1.7**.**

Proposition 1.8**.**

Theorem 1.9**.**

Theorem 1.10** (Theorem 1 in [16]).**

Remark 1.3**.**

Proposition 1.11**.**

Theorem 1.12** (Theorem 1 in [3]).**

Proposition 1.13**.**

Proof of Theorem 1.5 and Theorem 1.6.

2. An n−n-n−to−1-1−1 map transforming lattice bridge to first passage lattice bridge

Lemma 2.1**.**

Proof.

Lemma 2.2**.**

Proof.

Corollary 2.3**.**

Proof.

Lemma 2.4**.**

Proof.

Corollary 2.5**.**

3. Concentration results

3.1. Martingales of degree proportions of uniformly permuted degree sequence

Theorem 3.1** ([25], Theorem 3.15).**

Lemma 3.2**.**

Proof.

Proposition 3.3**.**

Proof.

Lemma 3.4**.**

Proof.

Proposition 3.5**.**

Proof.

3.2. Probability bound of trees of random forest having abnormally large height

Theorem 3.6** (Theorem 2.7 in [25]).**

Proposition 3.7** (Proposition 20.6 in [5]).**

Corollary 3.8**.**

Proof of Proposition 1.13.

Proposition 3.9**.**

Proof.

Proposition 3.10**.**

Proof.

4. Convergence of the Lukasiewicz walk of forest to first passage bridge

Proposition 4.1**.**

Theorem 4.2**.**

Proof of Proposition 4.1.

Theorem 4.3** ([10], Theorem 7).**

Remark 4.1**.**

Lemma 4.4**.**

Proof.

Lemma 4.5**.**

Proof.

Lemma 4.6**.**

Proof.

Lemma 4.7**.**

Proof.

Proof of Theorem 1.9.

Lemma 4.8**.**

Definition 1.1.

Definition 1.2.

Theorem 1.3 (Theorem 2.5 in [2]).

Definition 1.4.

Theorem 1.5.

Theorem 1.6.

Remark 1.1.

Remark 1.2.

Proposition 1.7.

Proposition 1.8.

Theorem 1.9.

Theorem 1.10 (Theorem 1 in [16]).

Remark 1.3.

Proposition 1.11.

Theorem 1.12 (Theorem 1 in [3]).

Proposition 1.13.

2. An $n-$ to $-1$ map transforming lattice bridge to first passage lattice bridge

Lemma 2.1.

Lemma 2.2.

Corollary 2.3.

Lemma 2.4.

Corollary 2.5.

Theorem 3.1 ([25], Theorem 3.15).

Lemma 3.2.

Proposition 3.3.

Lemma 3.4.

Proposition 3.5.

Theorem 3.6 (Theorem 2.7 in [25]).

Proposition 3.7 (Proposition 20.6 in [5]).

Corollary 3.8.

Proposition 3.9.

Proposition 3.10.

Proposition 4.1.

Theorem 4.2.

Theorem 4.3 ([10], Theorem 7).

Remark 4.1.

Lemma 4.4.

Lemma 4.5.

Lemma 4.6.

Lemma 4.7.

Lemma 4.8.

Lemma 4.9.

Lemma 4.10.

Proposition 5.1.

Lemma 5.2.

Lemma A.1.

Proposition B.1 (Proposition 2.9 in [1]).

Proposition B.2.