Geometry of weighted recursive and affine preferential attachment trees

Delphin S\'enizergues

arXiv:1904.07115·math.PR·June 4, 2020

Geometry of weighted recursive and affine preferential attachment trees

Delphin S\'enizergues

PDF

TL;DR

This paper analyzes the geometry of weighted recursive and affine preferential attachment trees, establishing their equivalence and deriving asymptotic properties of various tree statistics.

Contribution

It shows that affine preferential attachment trees can be represented as weighted recursive trees with random weights and proves convergence results for key tree metrics.

Findings

01

Affine preferential attachment trees are distributionally equivalent to weighted recursive trees with random weights.

02

Established almost sure scaling limits for degree sequences, height, and profile of the trees.

03

Proved weak convergence of measures associated with the tree structure.

Abstract

We study two models of growing recursive trees. For both models, initially the tree only contains one vertex $u_{1}$ and at each time $n \geq 2$ a new vertex $u_{n}$ is added to the tree and its parent is chosen randomly according to some rule. In the \emph{weighted recursive tree}, we choose the parent $u_{k}$ of $u_{n}$ among ${u_{1}, u_{2}, \dots, u_{n - 1}}$ with probability proportional to $w_{k}$ , where $(w_{n})_{n \geq 1}$ is some deterministic sequence that we fix beforehand. In the \emph{affine preferential attachment tree with fitnesses}, the probability of choosing any $u_{k}$ is proportional to $a_{k} + deg^{+} (u_{k})$ , where $deg^{+} (u_{k})$ denotes its current number of children, and the sequence of \emph{fitnesses} $(a_{n})_{n \geq 1}$ is deterministic and chosen as a parameter of the model. We show that for any sequence $(a_{n})_{n \geq 1}$ , the corresponding preferential attachment tree…

Equations543

\forall k \in {1, \dots, n}, P (K_{n + 1} = k ∣ T_{1}, T_{2}, \dots T_{n}) \propto w_{k} .

\forall k \in {1, \dots, n}, P (K_{n + 1} = k ∣ T_{1}, T_{2}, \dots T_{n}) \propto w_{k} .

\forall k \in {1, \dots, n}, P (J_{n + 1} = k ∣ P_{1}, P_{2}, \dots, P_{n}) \propto de g_{P_{n}}^{+} (u_{k}) + a_{k},

\forall k \in {1, \dots, n}, P (J_{n + 1} = k ∣ P_{1}, P_{2}, \dots, P_{n}) \propto de g_{P_{n}}^{+} (u_{k}) + a_{k},

w_{1}^{a} = W_{1}^{a} = 1 and \forall n \geq 2, W_{n}^{a} = k = 1 \prod n - 1 β_{k}^{- 1},

w_{1}^{a} = W_{1}^{a} = 1 and \forall n \geq 2, W_{n}^{a} = k = 1 \prod n - 1 β_{k}^{- 1},

w_{k}^{a} = n \to \infty lim \frac{de g _{P_{n}}^{+} ( u _{k} )}{de g _{P_{n}}^{+} ( u _{1} )} almost surely.

w_{k}^{a} = n \to \infty lim \frac{de g _{P_{n}}^{+} ( u _{k} )}{de g _{P_{n}}^{+} ( u _{1} )} almost surely.

x_{n} n \to \infty ⋈ y_{n} if and only if \exists ϵ > 0, x_{n} n \to \infty = y_{n} \cdot (1 + O (n^{- ϵ})) .

x_{n} n \to \infty ⋈ y_{n} if and only if \exists ϵ > 0, x_{n} n \to \infty = y_{n} \cdot (1 + O (n^{- ϵ})) .

A_{n} n \to \infty ⋈ c \cdot n .

A_{n} n \to \infty ⋈ c \cdot n .

W_{n} n \to \infty ⋈ cst \cdot n^{γ},

W_{n} n \to \infty ⋈ cst \cdot n^{γ},

γ = \frac{c}{c + 1} .

γ = \frac{c}{c + 1} .

W_{n} n \to \infty \sim C \cdot n^{γ},

W_{n} n \to \infty \sim C \cdot n^{γ},

de g_{T_{n}}^{+} (u_{k}) n \to \infty \sim \frac{w _{k}}{C ( 1 - γ )} \cdot n^{1 - γ} .

de g_{T_{n}}^{+} (u_{k}) n \to \infty \sim \frac{w _{k}}{C ( 1 - γ )} \cdot n^{1 - γ} .

n^{- (1 - γ)} \cdot (de g_{T_{n}}^{+} (u_{1}), de g_{T_{n}}^{+} (u_{2}), \dots) \to \frac{1}{C ( 1 - γ )} \cdot (w_{1}, w_{2}, \dots)

n^{- (1 - γ)} \cdot (de g_{T_{n}}^{+} (u_{1}), de g_{T_{n}}^{+} (u_{2}), \dots) \to \frac{1}{C ( 1 - γ )} \cdot (w_{1}, w_{2}, \dots)

(m_{n}^{a})_{n \geq 1} := \frac{1}{Z ( 1 - γ )} \cdot (w_{n}^{a})_{n \geq 1} = \frac{c + 1}{Z} \cdot (w_{n}^{a})_{n \geq 1} a.s..

(m_{n}^{a})_{n \geq 1} := \frac{1}{Z ( 1 - γ )} \cdot (w_{n}^{a})_{n \geq 1} = \frac{c + 1}{Z} \cdot (w_{n}^{a})_{n \geq 1} a.s..

n^{- \frac{1}{c + 1}} \cdot (de g_{P_{n}}^{+} (u_{1}), de g_{P_{n}}^{+} (u_{2}), \dots) n \to \infty ⟶ (m_{1}^{a}, m_{2}^{a}, \dots) .

n^{- \frac{1}{c + 1}} \cdot (de g_{P_{n}}^{+} (u_{1}), de g_{P_{n}}^{+} (u_{2}), \dots) n \to \infty ⟶ (m_{1}^{a}, m_{2}^{a}, \dots) .

M_{n}^{a} = \frac{c + 1}{Z} \cdot k = 1 \prod n - 1 β_{k}^{- 1},

M_{n}^{a} = \frac{c + 1}{Z} \cdot k = 1 \prod n - 1 β_{k}^{- 1},

M_{n}^{a} = β_{n} \cdot M_{n + 1}^{a},

M_{n}^{a} = β_{n} \cdot M_{n + 1}^{a},

a = a, b, b, b, \dots with a > - 1 and b > 0,

a = a, b, b, b, \dots with a > - 1 and b > 0,

a = a, b_{1}, b_{2}, \dots, b_{l}, b_{1}, b_{2}, \dots, b_{ℓ}, b_{1}, b_{2} \dots with a > - 1 and b_{1}, b_{2}, \dots, b_{ℓ} \in N .

a = a, b_{1}, b_{2}, \dots, b_{l}, b_{1}, b_{2}, \dots, b_{ℓ}, b_{1}, b_{2} \dots with a > - 1 and b_{1}, b_{2}, \dots, b_{ℓ} \in N .

L_{n} (k) := # {1 \leq i \leq n ∣ ht (u_{i}) = k}

L_{n} (k) := # {1 \leq i \leq n ∣ ht (u_{i}) = k}

f_{γ} : z \mapsto f_{γ} (z) := 1 + γ (e^{z} - 1 - z e^{z}) .

f_{γ} : z \mapsto f_{γ} (z) := 1 + γ (e^{z} - 1 - z e^{z}) .

z_{+} := sup {z \in R ∣ f_{γ} (z) > 0} and z_{-} := {- \infty lo g ((γ - 1) / γ) if γ \leq 1, if γ > 1.

z_{+} := sup {z \in R ∣ f_{γ} (z) > 0} and z_{-} := {- \infty lo g ((γ - 1) / γ) if γ \leq 1, if γ > 1.

W_{n} n \to \infty ⋈ cst \cdot n^{γ} and i = n \sum 2 n w_{i}^{p} \leq n^{1 + (γ - 1) p + o (1)} .

W_{n} n \to \infty ⋈ cst \cdot n^{γ} and i = n \sum 2 n w_{i}^{p} \leq n^{1 + (γ - 1) p + o (1)} .

L_{n} (k) n \to \infty = \frac{n}{2 π lo g n} exp {- \frac{1}{2} \cdot (\frac{k - γ lo g n}{γ lo g n})^{2}} + O (\frac{n}{lo g n}),

L_{n} (k) n \to \infty = \frac{n}{2 π lo g n} exp {- \frac{1}{2} \cdot (\frac{k - γ lo g n}{γ lo g n})^{2}} + O (\frac{n}{lo g n}),

L_{n} (⌊ γ e^{z} lo g n ⌋) = n^{f_{γ} (z) - \frac{1}{2} \frac{l o g l o g n}{l o g n} + O (\frac{1}{l o g n})},

L_{n} (⌊ γ e^{z} lo g n ⌋) = n^{f_{γ} (z) - \frac{1}{2} \frac{l o g l o g n}{l o g n} + O (\frac{1}{l o g n})},

\frac{ht ( T _{n} )}{lo g n} n \to \infty ⟶ γ \cdot e^{z_{+}} .

\frac{ht ( T _{n} )}{lo g n} n \to \infty ⟶ γ \cdot e^{z_{+}} .

n \to \infty lim \frac{ht ( T _{n} )}{\sum _{i = 2}^{n} \frac{w _{i}}{W _{i}}} = n \to \infty lim \frac{ht ( u _{n} )}{\sum _{i = 2}^{n} \frac{w _{i}}{W _{i}}} = 1,

n \to \infty lim \frac{ht ( T _{n} )}{\sum _{i = 2}^{n} \frac{w _{i}}{W _{i}}} = n \to \infty lim \frac{ht ( u _{n} )}{\sum _{i = 2}^{n} \frac{w _{i}}{W _{i}}} = 1,

d^{exp} (u, v) = {0, exp (- ht (u \land v)), if u = v, otherwise,

d^{exp} (u, v) = {0, exp (- ht (u \land v)), if u = v, otherwise,

μ_{n} ({u_{k}}) = \frac{w _{k}}{W _{n}} .

μ_{n} ({u_{k}}) = \frac{w _{k}}{W _{n}} .

(\frac{μ ( T ( u i ))}{μ ( T ( u ))})_{i \geq 1}, for u \in U,

(\frac{μ ( T ( u i ))}{μ ( T ( u ))})_{i \geq 1}, for u \in U,

W_{k} k \to \infty \sim C \cdot k^{γ} .

W_{k} k \to \infty \sim C \cdot k^{γ} .

(de g_{T_{n}}^{+} (u_{k}))_{n \geq 1} = (d) (i = k \sum n - 1 1_{{U_{i} \leq \frac{w _{k}}{W _{i}}}})_{n \geq 1},

(de g_{T_{n}}^{+} (u_{k}))_{n \geq 1} = (d) (i = k \sum n - 1 1_{{U_{i} \leq \frac{w _{k}}{W _{i}}}})_{n \geq 1},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

\DeclareSourcemap\maps

[datatype=bibtex] \map[overwrite=true] \step[fieldsource=fjournal] \step[fieldset=journal, origfieldval]

Geometry of weighted recursive and affine preferential attachment trees

Delphin Sénizergues

Abstract

We study two models of growing recursive trees. For both models, initially the tree only contains one vertex $u_{1}$ and at each time $n\geq 2$ a new vertex $u_{n}$ is added to the tree and its parent is chosen randomly according to some rule. In the weighted recursive tree, we choose the parent $u_{k}$ of $u_{n}$ among $\{u_{1},u_{2},\dots,u_{n-1}\}$ with probability proportional to $w_{k}$ , where $(w_{n})_{n\geq 1}$ is some deterministic sequence that we fix beforehand. In the affine preferential attachment tree with fitnesses, the probability of choosing any $u_{k}$ is proportional to $a_{k}+\deg^{+}(u_{k})$ , where $\deg^{+}(u_{k})$ denotes its current number of children, and the sequence of fitnesses $(a_{n})_{n\geq 1}$ is deterministic and chosen as a parameter of the model.

We show that for any sequence $(a_{n})_{n\geq 1}$ , the corresponding preferential attachment tree has the same distribution as some weighted recursive tree with a random sequence of weights (with some explicit distribution). We then prove almost sure scaling limit convergences for some statistics associated with weighted recursive trees as time goes to infinity, such as degree sequence, height, profile and also the weak convergence of some measures carried on the tree. Thanks to the connection between the two models, these results also apply to affine preferential attachment trees.

1 Introduction

The uniform recursive tree has been introduced in the 70’s as an example of random graphs constructed by addition of vertices: starting from a tree with a single vertex, the vertices arrive one by one and the $n$ -th vertex picks its parent uniformly at random from the $n-1$ already present vertices. Many properties of this tree were then investigated due to its particularly simple dynamics: number of leaves, profile, height, degrees, size of subtrees and others. We refer to the survey [48] and the more recent book [16, Section 6] for an overview of the results obtained for this model.

We consider a generalisation of the uniform recursive tree called the weighted recursive tree (WRT), which was introduced in [9] in 2006. In this model, each vertex is assigned a non-negative weight, constant in time. When a newcomer randomly picks its parent, it does so with probability proportional to those weights. Although more general than the uniform recursive tree, WRT’s have attracted far fewer contributions, see e.g. [31, 24]. In [31] those trees are studied because of their connection to a model of random walk with preferential relocation (a.k.a. "monkey walk"). The authors prove some limiting results for the distribution of the weight of vertices at different heights in the tree, for different assumptions on the weight sequence which cover a wide range of behaviours.

In this paper, we prove asymptotic results for this model about the degree sequence, the height, the profile and the convergence of some probability measures carried on the tree, mainly under some assumptions that ensure that the sequence $(w_{n})_{n\geq 1}$ describing the weights of the vertices in order of creation behaves roughly as a power of $n$ . Our deepest result is the one that concerns the asymptotic behaviour of the profile of the tree, which is the function that maps each integer $k$ to the number of vertices in the tree at height $k$ . Both the statement and the proof of this result are inspired from the work carried out in the last 20 years for different models of logarithmic trees, see [10, 11, 49, 46, 28]. They rely on the study of the Laplace transform of the profile using tools that ultimately date back to Biggins [6] in the context of the branching random walk, together with a Fourier inversion argument, which in our case is handled by a very precise theorem of [28]. The rest of our results and proofs on WRT’s are less involved and mostly rely on more elementary arguments, as well as a connection with Pemantle’s time-dependent Pólya urns, introduced in [40].

We will also consider another model of trees which we call the affine preferential attachment tree (PAT) with fitnesses. In this one, every vertex has a fixed fitness, and the probability of picking any vertex to be the parent of a newcomer is proportional to its fitness plus its current number of children.

The term "preferential attachment", coined by Barabási and Albert in [3], refers to the property that a vertex in the graph that has a high degree tends to increase its degree even more over time, also referred to as a "rich-get-richer" effect. Many different preferential attachment mechanisms have then been studied in the last two decades because the degree distribution that emerges from this type of construction shares some quantitative properties with real-world networks, see [50, 34] for good overviews of the vast literature on this subject.

In our case, one of our motivations for studying those trees arises from the analysis of some growing random graphs, developed in the companion paper [47]. The class of models that we study there is designed to encompass Rémy’s algorithm, described in [43], which creates a sequence of binary trees, and a lot of its natural generalizations, studied in [20, 32, 12, 22, 23, 44]. In particular, we show that the sequences graphs obtained using these constructions, considered as metric spaces, almost surely converge in the so-called Gromov–Hausdorff–Prokhorov scaling limit towards a limiting random continuous metric space. This proof relies on a decomposition of our graphs along the structure of a tree, whose evolution is that of an affine preferential attachment tree with fitnesses. Notably, a crucial result that is needed in this argument is a uniform control over the degree of all the vertices the tree, which we prove in this paper.

Let us note that only a few contributions in the literature concern this particular model, where the fitness can depend on the vertex. In the case where the fitnesses are i.i.d., the model is considered for the first time in [18] and the first rigorous mathematical result can be found in [5]. Very recently, still in the case of i.i.d. fitnesses, it has been studied in more detail in [30] along with some other similar models. The authors study the asymptotic degree distribution and maximum degree in the tree and show that these can exhibit different behaviours according to the tail of the fitness distribution, which the authors classify as weak, strong and extreme disorder. Let us also mention two models that do not fall in our setting but are somewhat related, studied in [14] and [8], in which the reinforcement is affine in the degree of the vertices but there is some inhomogeneity between vertices. Instead of coming from different fitnesses associated to vertices like in our model, this inhomogeneity is introduced using a random initial degree, or respectively a random time of creation.

Our approach for studying this model relies on the connection between the PAT and the WRT models (this was already known in the field in the case of constant fitnesses but stated in a slightly different form, see [8, 4]). Indeed we shall see that using a de Finetti-type argument, a PAT can be seen as a WRT with a random sequence of weights that almost surely decays like a power of $n$ . This enables us to translate all of the results obtained for WRT’s to corresponding results for PAT’s, and hence prove asymptotics for degrees, height and profile of the tree. In particular, we prove the almost sure scaling limit convergence of the sequence of degrees of the vertices in the tree in an $\ell^{p}$ norm. For some regular sequences of fitnesses, we can explicitly describe the distribution of the limiting sequence using Beta, Gamma and Mittag-Leffler distributions. This relates in various ways to other results that can be found in the literature associated to preferential attachment trees or to urn models, contained in [33, 26, 37, 25, 38, 36, 35, 2].

1.1 Two related models of growing trees

Definitions.

For any sequence of non-negative real numbers $(w_{n})_{n\geq 1}$ with $w_{1}>0$ , we define the distribution $\operatorname{WRT}((w_{n})_{n\geq 1})$ on sequences of growing rooted labelled trees111In fact, in the rest of the paper we will see them as plane trees, see Section 1.2.2., which is called the weighted recursive tree with weights $(w_{n})_{n\geq 1}$ . We construct a sequence of rooted trees $(\mathtt{T}_{n})_{n\geq 1}$ starting from $\mathtt{T}_{1}$ containing only one root-vertex $u_{1}$ with label $1$ and let it evolve in the following manner: the tree $\mathtt{T}_{n+1}$ is obtained from $\mathtt{T}_{n}$ by adding a vertex $u_{n+1}$ with label $n+1$ . The parent of this new vertex is chosen to be the vertex with label $K_{n+1}$ with probability proportional to its weight, that is

[TABLE]

Remark that this conditional distribution does not depend on the evolution $\mathtt{T}_{1},\mathtt{T}_{2},\dots\mathtt{T}_{n}$ up to time $n$ , which ensures in particular that the random variables $K_{2},K_{3},\dots$ are independent. In this definition, we also allow sequences of weights $(\mathsf{w}_{n})_{n\geq 1}$ that are random and in this case the distribution $\operatorname{WRT}((\mathsf{w}_{n})_{n\geq 1})$ denotes the law of the random tree obtained by the above process conditionally on $(\mathsf{w}_{n})_{n\geq 1}$ , so that the obtained distribution on growing trees is a mixture of WRT with deterministic sequences of weights.

Similarly, for any sequence $(a_{n})_{n\geq 1}$ of real numbers, with $a_{1}>-1$ and $a_{n}\geq 0$ for $n\geq 2$ , we define another model of growing tree. The construction goes on as before: $\mathtt{P}_{1}$ contains only one root-vertex $u_{1}$ with label $1$ and $\mathtt{P}_{n+1}$ is obtained from $\mathtt{P}_{n}$ by adding a vertex $u_{n+1}$ with label $n+1$ and the parent of the newcomer is chosen to be the vertex with label $J_{n+1}$ , where now

[TABLE]

where $\deg^{+}_{\mathtt{P}_{n}}(\cdot)$ denotes the number of children in the tree $\mathtt{P}_{n}$ . In the particular case where $n=1$ , the second vertex $u_{2}$ is always defined as a child of $u_{1}$ , even in the case $-1<a_{1}\leq 0$ for which the last display does not make sense. We call this sequence of tree an affine preferential attachment tree with fitnesses $(a_{n})_{n\geq 1}$ and its law is denoted by $\operatorname{PAT}((a_{n})_{n\geq 1})$ .

Notation.

Here and in the rest of the paper, whenever we have any sequence of real numbers $(x_{n})_{n\geq 1}$ , we write $\boldsymbol{x}=(x_{n})_{n\geq 1}$ in a bold font as a shorthand for the sequence itself, and $(X_{n})_{n\geq 1}$ with a capital letter to denote the sequence of partial sums defined for all $n\geq 1$ as $X_{n}:=\sum_{i=1}^{n}x_{i}$ . In particular, we do so for sequences of fitnesses $(a_{n})_{n\geq 1}$ , for deterministic sequences of weights $(w_{n})_{n\geq 1}$ and for random sequence of weights $(\mathsf{w}_{n})_{n\geq 1}$ .

Representation result.

The following result gives a connection between these two models of growing trees. It is an analogue of the so-called "Pólya urn-representation" result described in [4, Theorem 2.1] or [8, Section 1.2] for related models, which already cover the case of constant sequences $\mathbf{a}$ .

For $a,b>0$ the distribution $\mathrm{Beta}(a,b)$ has density $\frac{\Gamma(a+b)}{\Gamma(a)\Gamma(b)}\cdot x^{a-1}(1-x)^{b-1}\cdot\mathbf{1}_{\left\{0\leq x\leq 1\right\}}$ with respect to Lebesgue measure. If $b=0$ and $a>0$ , we use the convention that the distribution $\mathrm{Beta}(a,b)$ is a Dirac mass at $1$ . {theorem}[WRT-representation of PAT’s] For any sequence $\mathbf{a}$ of fitnesses, we define the associated random sequence $\boldsymbol{\mathsf{w}}^{\mathbf{a}}=(\mathsf{w}^{\mathbf{a}}_{n})_{n\geq 1}$ as

[TABLE]

where the $(\beta_{k})_{k\geq 1}$ are independent with respective distribution $\mathrm{Beta}(A_{k}+k,a_{k+1})$ . Then, the distributions $\operatorname{PAT}(\mathbf{a})$ and $\operatorname{WRT}(\boldsymbol{\mathsf{w}}^{\mathbf{a}})$ coincide.

The result of the theorem is obtained by studying the evolution of the degrees in the preferential attachment model $(\mathtt{P}_{n})_{n\geq 1}$ . The key argument lies in the fact that we can describe the whole process $(\mathtt{P}_{n})_{n\geq 1}$ using a sequence of Pólya urns, related to the degrees of the vertices. The connection of the evolution of the degrees to Pólya urns in the context of preferential attachment models is well-know and was observed for the first time in [33]. It explains why Beta-distributed random variables appear in the limit. In our case, the theorem relies on applying the de Finetti theorem to this sequence of urns and on proving that those urns are jointly independent.

In fact, the result stated in the theorem can be made a bit more precise than an equality in distribution as soon as the sequence $\mathbf{a}$ is chosen in such way that almost surely the degree of the first vertex $\deg^{+}_{\mathtt{P}_{n}}(u_{1})$ tends to infinity as $n\rightarrow\infty$ . For example, it is easy to check that the condition $A_{n}=O\mathopen{}\left(n\right)$ is sufficient to ensure this behaviour, and in this case we can state the following corollary.

Corollary \thetheorem.

For a sequence $\mathbf{a}$ such that $A_{n}=O\mathopen{}\left(n\right)$ , we can construct the sequence $\boldsymbol{\mathsf{w}}^{\mathbf{a}}$ from $(\mathtt{P}_{n})_{n\geq 1}$ in such a way that for all $k\geq 1$ :

[TABLE]

The obtained sequence has the distribution described in Theorem 1.1 and conditionally on this sequence $(\mathtt{P}_{n})_{n\geq 1}$ has distribution $\operatorname{WRT}(\boldsymbol{\mathsf{w}}^{\mathbf{a}})$ .

In fact, and this is the content of Proposition 1.1 below, if $A_{n}$ grows linearly as some $c\cdot n$ with some $c>0$ then the sequence $(\mathsf{W}^{\mathbf{a}}_{n})_{n\geq 1}$ almost surely grows as some power of $n$ which depends on $c$ . This is done using moment computations under the explicit definition of $(\mathsf{W}^{\mathbf{a}}_{n})_{n\geq 1}$ given by the theorem. In the rest of the paper, we investigate several properties of the WRT under this type of assumptions for the sequence of weights, such as convergence of height, profile and measures carried on the tree. Thanks to this connection, our results will then also hold for the PAT under the assumption that $A_{n}$ grows linearly.

Assumptions on the sequences.

For two sequences $(x_{n})$ and $(y_{n})$ we say that

[TABLE]

Our main assumption for sequences $\mathbf{a}=(a_{n})_{n\geq 1}$ of fitnesses is the following ( $H_{c}$ ), which is parametrised by some positive $c>0$ and ensures that the fitness of vertices is $c$ on average

[TABLE]

For sequences of weights $\boldsymbol{w}=(w_{n})_{n\geq 1}$ , we introduce the following hypothesis, which depends on a parameter $\gamma>0$

[TABLE]

where $\operatorname{cst}$ denotes a positive constant. The following proposition ensures in particular that our assumption on sequences of fitnesses $\mathbf{a}$ translates to a power-law behaviour for the random sequence of cumulated weights $(\mathsf{W}^{\mathbf{a}}_{n})_{n\geq 1}$ defined in Theorem 1.1.

Proposition \thetheorem.

Suppose that there exists $c>0$ such that $\mathbf{a}$ satisfies ( $H_{c}$ ), then the random sequence $(\mathsf{w}^{\mathbf{a}}_{n})_{n\geq 1}$ defined in Theorem 1.1 almost surely satisfies ( $\square_{\gamma}$ ) with

[TABLE]

If furthermore $\mathbf{a}$ is such that $a_{n}\leq(n+1)^{c^{\prime}+o\mathopen{}\left(1\right)}$ for some $c^{\prime}\in\mathopen{[}0\mathclose{}\mathpunct{},1\mathclose{)}$ , then almost surely $\mathsf{w}^{\mathbf{a}}_{n}\leq(n+1)^{c^{\prime}-\frac{1}{c+1}+o_{\omega}(1)}$ , where $o_{\omega}(1)$ is a random function of $n$ which tends to [math] when $n\rightarrow\infty$ .

Convergence of degrees using the WRT representation.

In the WRT with a deterministic sequence of weights $\boldsymbol{w}$ that satisfies

[TABLE]

for some $\gamma\in\mathopen{(}0\mathclose{}\mathpunct{},1\mathclose{)}$ , the degree of a fixed vertex evolves as a sum of independent Bernoulli random variables and it is possible to handle it with elementary methods and obtain

[TABLE]

Further calculations allow us to improve this statement to an almost sure convergence

[TABLE]

in the space $\ell^{p}$ of $p$ -th power summable sequences, for weight sequences $\boldsymbol{w}$ that satisfy some additional control. A precise version of this statement is given in Proposition 2.1.

Suppose that $\mathbf{a}$ satisfies ( $H_{c}$ ) and consider $(\mathtt{P}_{n})_{n\geq 1}$ which has distribution $\operatorname{PAT}(\mathbf{a})$ . Then, according to Theorem 1.1 and Corollary 1.1, we know that conditionally on the sequence $(\mathsf{w}^{\mathbf{a}}_{n})_{n\geq 1}$ obtained as in (4), the sequence $(\mathtt{P}_{n})_{n\geq 1}$ has distribution $\operatorname{WRT}((\mathsf{w}^{\mathbf{a}}_{n})_{n\geq 1})$ . Also, thanks to Proposition 1.1, we know that there exists some random variable $Z$ such that $\mathsf{W}^{\mathbf{a}}_{n}\underset{}{\sim}Z\cdot n^{\gamma}$ almost surely as $n\rightarrow\infty$ , with $\gamma=\frac{c}{c+1}$ . So let us introduce

[TABLE]

Applying the convergence (8) conditionally on the sequence $(\mathsf{w}_{n}^{\mathbf{a}})_{n\geq 1}$ (or equivalently conditionally on $(\mathsf{m}_{n}^{\mathbf{a}})_{n\geq 1}$ ) yields an almost sure convergence in the product topology on sequences, which can be improved to an $\ell^{p}$ convergence if $\mathbf{a}$ satisfies some additional control, thanks Proposition 2.1. This is stated below as a theorem. {theorem} Suppose that $\mathbf{a}$ satisfies ( $H_{c}$ ). Then for a sequence $(\mathtt{P}_{n})_{n\geq 1}\sim\operatorname{PAT}(\mathbf{a})$ we obtain the following almost sure convergence in the product topology

[TABLE]

Furthermore, if $a_{n}\leq(n+1)^{c^{\prime}+o\mathopen{}\left(1\right)},$ for some $0\leq c^{\prime}<\frac{1}{c+1}$ , the previous convergence also takes place in the space $\ell^{p}$ of $p$ -th power summable sequences, for all $p>\frac{c+1}{1-(c+1)c^{\prime}}$ .

Let us emphasize that the function $\mathrm{max}:\ell^{p}\rightarrow\mathbb{R}$ that outputs the maximum of a sequence is a continuous function, so that the scaling limit of the maximal degree in the tree $\mathtt{P}_{n}$ is ensured by the theorem whenever the appropriate condition on the sequence $\mathbf{a}$ is satisfied. Convergence of the rescaled degree of fixed vertices in preferential attachment trees is a well-know phenomenon in the case a preferential attachment trees with constant fitnesses, as is the convergence of the maximum of that sequence, see [33]. However, to the best of the author’s knowledge, Theorem 1.1 is the first result that ensures an almost sure convergence of the rescaled degrees as a sequence in such a topology. This improves the $\ell^{p}$ convergence proved in distribution in [36] for a related model, which we treat in Proposition 5.3.

The distribution of the limiting sequence $(\mathsf{m}_{n}^{\mathbf{a}})_{n\geq 1}$ can be characterized, and even has a reasonable description for certain regular sequences of fitnesses $\mathbf{a}$ , as it is explained in the following paragraph. This result is actually related to the study of some urn models like the Pólya urns with immigration of [38] or the periodic Pólya urns of [2] and allows us to provide some alternative proofs and complete some of the known results about those processes. This is developed in Section 5.2.

Distribution of the limiting chain.

Let us say a word on the properties of the non-decreasing sequence $(\mathsf{M}^{\mathbf{a}}_{n})_{n\geq 1}$ that corresponds using our notation to the sequence $(\mathsf{m}_{n}^{\mathbf{a}})_{n\geq 1}$ defined in (9). Of course, using the random variables $(\beta_{n})_{n\geq 1}$ defined in Theorem 1.1, we can write for any $n\geq 1$ ,

[TABLE]

but then because the random variable $Z$ depends on the whole sequence $(\beta_{n})_{n\geq 1}$ , the sequence $(\mathsf{M}^{\mathbf{a}}_{n})_{n\geq 1}$ is not just an iterated product of independent random variables, as it was the case for $(\mathsf{W}_{n}^{\mathbf{a}})_{n\geq 1}$ . Nevertheless, the sequence still has the nice property of being a time-inhomogeneous Markov chain with a simple backward transition, characterised by the equality

[TABLE]

where $\beta_{n}$ is independent of $\mathsf{M}^{\mathbf{a}}_{n+1}$ and has distribution $\mathrm{Beta}(A_{n}+n,a_{n+1})$ . This is the content of Proposition 4.3.

For some specific choices of sequences $\mathbf{a}$ , the distribution of the chain $(\mathsf{M}_{n}^{\mathbf{a}})_{n\geq 1}$ is explicit. Whenever $\mathbf{a}$ is of the form

[TABLE]

we retrieve Goldschmidt and Haas’ Mittag-Leffler Markov chain family, introduced in [21] and also studied by James [25].

The other case where the chain is explicit is when $\mathbf{a}$ is periodic starting from the second term, of the form

[TABLE]

Then the sequence $(\mathsf{M}_{n}^{\mathbf{a}})_{n\geq 1}$ has an explicit distribution defined using products of Gamma-distributed random variables. We define it in Section 5.1.2.

1.2 Other geometric properties of weighted random trees

Let us now state the convergence for other statistics of weighted random trees, namely profile, height and probability measures. Here we let $(\mathtt{T}_{n})_{n\geq 1}$ be a sequence of trees evolving according to the distribution $\operatorname{WRT}(\boldsymbol{w})$ for some deterministic sequence $\boldsymbol{w}$ and state our results in this setting. Our results will also apply to random sequences of weights $\boldsymbol{\mathsf{w}}$ that satisfy the assumptions of the theorems almost surely, they will hence apply to PAT with appropriate sequences of fitnesses, thanks to Theorem 1.1 and Proposition 1.1.

1.2.1 Height and profile of WRT

Let

[TABLE]

be the number of vertices of $\mathtt{T}_{n}$ at height $k$ . The function $k\mapsto\mathbb{L}_{n}(k)$ is called the profile of the tree $\mathtt{T}_{n}$ . The height of the tree is the maximal distance of a vertex to the root, which we can also express as $\operatorname{ht}(\mathtt{T}_{n}):=\max\left\{k\geq 0\mathrel{}\middle|\mathrel{}\mathbb{L}_{n}(k)>0\right\}$ . We are interested in the asymptotic behaviour of $\mathbb{L}_{n}$ and $\operatorname{ht}(\mathtt{T}_{n})$ as $n\rightarrow\infty$ .

In order to express our results, we need to introduce some quantities. For $\gamma>0$ , we define the function $f_{\gamma}:\mathbb{R}\rightarrow\mathbb{R}$ as

[TABLE]

This function is increasing on $\mathopen{(}-\infty\mathclose{}\mathpunct{},0\mathclose{]}$ and decreasing on $\mathopen{[}0\mathclose{}\mathpunct{},\infty\mathclose{)}$ with $f_{\gamma}(-\infty)=1-\gamma$ and $f_{\gamma}(0)=1$ and $f_{\gamma}(\infty)=-\infty$ . We define $z_{+}$ and $z_{-}$ as

[TABLE]

We are going to assume that we work with a sequence $\boldsymbol{w}$ which satisfies the following assumption ( $\square_{\gamma}^{p}$ ) for some $\gamma>0$ and $p\in\mathopen{(}1\mathclose{}\mathpunct{},2\mathclose{]}$ ,

[TABLE]

Thanks to Proposition 1.1, this property is almost surely satisfied for $\gamma=\frac{c}{c+1}$ by the random sequence $\boldsymbol{\mathsf{w}}^{\mathbf{a}}$ for any sequence $\mathbf{a}$ of fitnesses satisfying $A_{n}\underset{n\rightarrow\infty}{\bowtie}c\cdot n$ and $a_{n}\leq(n+1)^{o\mathopen{}\left(1\right)}$ . {theorem} Suppose that there exists $\gamma>0$ and $p\in\mathopen{(}1\mathclose{}\mathpunct{},2\mathclose{]}$ such that the sequence $\boldsymbol{w}$ satisfies ( $\square_{\gamma}^{p}$ ). Then, for a sequence of random trees $(\mathtt{T}_{n})_{n\geq 1}\sim\operatorname{WRT}(\boldsymbol{w})$ , we have the almost sure asymptotics for the profile

[TABLE]

where the error term is uniform in $k\geq 0$ . Also for any compact $K\subset\mathopen{(}z_{-}\mathclose{}\mathpunct{},z_{+}\mathclose{)}$ we have almost surely for all $z\in K$

[TABLE]

where the error term is uniform in $z\in K$ . Moreover, we have the almost sure convergence

[TABLE]

The proof of this result follows the path used for many similar results for trees with logarithmic growth (see [10, 11, 29]): we study the Laplace transform of the profile $z\mapsto\sum_{k=0}^{n}e^{zk}\mathbb{L}_{n}(k)$ on a domain of the complex plane and prove its convergence to some random analytic function when appropriately rescaled. Then, we apply [28, Theorem 2.1], which consists of a fine Fourier inversion argument and hence allows to obtain precise asymptotics for $\mathbb{L}_{n}$ . The application of the theorem in its full generality proves a so-called Edgeworth expansion for $\mathbb{L}_{n}$ , which we express here in a weaker form by equations (13) and (14). The convergence (13) expresses that the profile is asymptotically close to a Gaussian shape centred around $\gamma\log n$ and with variance $\gamma\log n$ , so that a majority of vertices have a height of order $\gamma\log n$ . The second equation (14) provides the behaviour of the number of vertices at a given height, for heights that are not necessarily close to $\gamma\log n$ (for which the preceding result ensure that there are of order $\frac{n}{\sqrt{\log n}}$ vertices per level). According to this result, at height $\lfloor\gamma e^{z}\log n\rfloor$ for any $z\in\mathopen{(}z_{-}\mathclose{}\mathpunct{},z_{+}\mathclose{)}$ there are of order $\frac{n^{f_{\gamma}(z)}}{\sqrt{\log n}}$ vertices.

Remark that the exponent $f_{\gamma}(z)$ is continuous in $z$ and tends to [math] when $z\rightarrow z_{+}$ . Although this does not directly prove the convergence (15), it already provides a lower-bound for $\operatorname{ht}(\mathtt{T}_{n})$ since it ensures that asymptotically there always exist vertices at height $\lfloor\gamma e^{(z_{+}-\epsilon)}\log n\rfloor$ , for any small $\epsilon>0$ . The convergence of the height (15) can then be obtained by proving a corresponding upper-bound, which can be done using quite rough estimates.

This result includes the well-known asymptotics $\operatorname{ht}(\mathtt{T}_{n})\sim e\log n$ as $n\rightarrow\infty$ for the uniform random tree, proved for example in [15, 42]. Using the connection of preferential attachment trees to weighted recursive trees given by Theorem 1.1, it also includes the case of preferential attachment trees with constant fitnesses, for which similar results were proved, in [42] for the height and in [29, 28] for the asymptotic behaviour of the profile (13).

Remark \thetheorem.

One can also notice that, in the case $\gamma>1$ , the function $f_{\gamma}$ tends to a negative value $1-\gamma$ as $z\rightarrow-\infty$ so that it is understandable that the asymptotics (14) couldn’t be valid for values of $z$ below a certain threshold. Nevertheless, one can check that $f_{\gamma}(z_{-})>0$ and wonder what happens for values of $z$ that are slightly below $z_{-}$ . In fact, in the proof, we first study the weighted profile of the tree, which corresponds to the total weight of vertices at every height instead their number. In this case, the study of the corresponding Laplace transform is easier and would lead to a statement similar to Theorem 1.2.1 for the asymptotics of the weighted profile that would hold for any $z$ such that $f_{\gamma}(z)>0$ . A subsequent part of the proof then consists in transferring this result to the Laplace transform of the "true" profile of the tree, and this is the part that breaks down if $z$ is chosen smaller than $z_{-}$ .

As a complement to this result, let us mention that there is another case where we can compute the asymptotic height of the tree, which corresponds to sequences $\boldsymbol{w}$ that grow fast to infinity. For any sequence of weights $\boldsymbol{w}$ , a quantity of interest is $\sum_{i=2}^{n}\frac{w_{i}}{W_{i}}$ , which is the expected height of a vertex taken with probability proportional to its weight in $\mathtt{T}_{n}$ . When this quantity grows faster than logarithmically, we have the almost sure convergence (see Proposition 3.3 in Section 3.3)

[TABLE]

which indicates that all the action takes place at the very tip of the tree.

1.2.2 Convergence of the weight measure

We also study the convergence of some natural probability measures defined on the trees $(\mathtt{T}_{n})_{n\geq 1}$ . This will prove useful for the applications developed in the companion paper [47].

Plane-tree framework.

For this result it will be easier to work with plane trees. We introduce the Ulam-Harris tree $\mathbb{U}=\bigcup_{n=0}^{\infty}\mathbb{N}^{n}$ , where $\mathbb{N}:=\{1,2,\dots\}$ , with the convention that $\mathbb{N}^{0}=\{\emptyset\}$ . Classically, a plane tree $\tau$ is defined as a non-empty subset of $\mathbb{U}$ such that

(i)

if $v\in\tau$ and $v=ui$ for some $i\in\mathbb{N}$ , then $u\in\tau$ , 2. (ii)

for all $u\in\tau$ , there exists $\deg^{+}_{\tau}(u)\in\mathbb{N}\cup\{0\}$ such that for all $i\in\mathbb{N}$ , $ui\in\tau$ iff $i\leq\deg^{+}_{\tau}(u)$ .

We choose to construct our sequence $(\mathtt{T}_{n})_{n\geq 1}$ of weighted recursive trees as plane trees by considering that each time a vertex is added, it becomes the right-most child of its parent. In this way the vertices $(u_{1},u_{2}\dots)$ of the trees $(\mathtt{T}_{n})_{n\geq 1}$ , listed in order of arrival, form a sequence of elements of $\mathbb{U}$ . In fact, from now on, we will always assume that we use this particular embedded construction, both for the WRT and the PAT. Note that with this representation as unlabelled subsets of $\mathbb{U}$ , the tree $\mathtt{T}_{n}$ itself, for any $n\geq 1$ , does not contain information relative to the labelling (hence the weight) of its vertices, but this piece of information can be read from the sequence $(\mathtt{T}_{1},\mathtt{T}_{2},\dots,\mathtt{T}_{n})$ .

We also denote $\partial\mathbb{U}=\mathbb{N}^{\mathbb{N}}$ , which we can be interpreted as the set of infinite paths from the root to infinity, and write $\overline{\mathbb{U}}=\mathbb{U}\cup\partial\mathbb{U}$ . We classically endow this set with the distance

[TABLE]

where $u\wedge v$ denotes the most recent common ancestor of $u$ and $v$ , and the height $\operatorname{ht}(u)$ of a vertex $u\in\mathbb{U}$ is defined as the only $n$ such that $u\in\mathbb{N}^{n}$ . Note that even when $u,v\in\partial\mathbb{U}$ , their most recent common ancestor $u\wedge v$ belongs to $\mathbb{U}$ , as long as $u\neq v$ . Endowed with this distance, $\overline{\mathbb{U}}$ is then a complete separable metric space.

In the paper, except when proving results related to the weak convergence of measures, for which we use the topology generated by $\operatorname{d}^{\mathrm{exp}}$ , we consider $\mathbb{U}$ as a graph and that we compute distances between vertices using the corresponding graph distance, which we denote $\operatorname{d}$ . In particular, the height $\operatorname{ht}(u)$ of a vertex $u$ is always its graph distance $\operatorname{d}(\emptyset,u)$ to the root $\emptyset$ .

Convergence of measures.

For every $n\geq 1$ , we define the measure $\mu_{n}$ on $\mathbb{U}$ , which only charges the set $\{u_{1},\dots,u_{n}\}$ of vertices of $\mathtt{T}_{n}$ , with for any $1\leq k\leq n$ ,

[TABLE]

We refer to $\mu_{n}$ as the natural weight measure on $\mathtt{T}_{n}$ . The following theorem classifies the possible behaviours of $(\mu_{n})$ for any weight sequence.

{theorem}

The sequence $(\mu_{n})_{n\geq 1}$ converges almost surely weakly towards a limiting probability measure $\mu$ on $\overline{\mathbb{U}}$ . There are three possible behaviours for $\mu$ :

(i)

If $\sum_{i=1}^{\infty}w_{i}<\infty$ , then $\mu$ is carried on $\mathbb{U}$ . 2. (ii)

If $\sum_{i=1}^{\infty}w_{i}=\infty$ and $\sum_{i=1}^{\infty}\left(\frac{w_{i}}{W_{i}}\right)^{2}<\infty$ , then $\mu$ is diffuse and supported on $\partial\mathbb{U}$ . 3. (iii)

If $\sum_{i=1}^{\infty}\left(\frac{w_{i}}{W_{i}}\right)^{2}=\infty$ then $\mu$ is concentrated on one point of $\partial\mathbb{U}$ .

This convergence can be extended to other natural measures on the tree, such as the uniform measure on $\mathtt{T}_{n}$ , or some "preferential attachment measure" which charges each vertex proportionally to some affine function of its degree. This is the content of Proposition 2.2.2. Note that in our case of interest, when the sequence $\boldsymbol{w}$ satisfies the assumption ( $\square_{\gamma}$ ) for some $\gamma>0$ , we are in case (ii) of the theorem.

In the specific case of $\operatorname{WRT}$ (resp. $\operatorname{PAT}$ ) with a sequence of weights (resp. of fitnesses) that is constant starting from the second term, the measure $\mu$ has an explicit description: if any $u\in\mathbb{U}$ , writing $T(u):=\left\{uv\mathrel{}\middle|\mathrel{}v\in\overline{\mathbb{U}}\right\}$ for the sub-tree descending from $u$ , the sequences

[TABLE]

are independent and have an explicit $\mathrm{GEM}$ distribution (see [41] for a definition). Furthermore, the corresponding sequence of trees, conditionally on $\mu$ , can be described as a split tree. This result, along with other other properties of these families of growing trees can be found in [27].

1.3 Organisation of the paper

The paper is organised as follows.

We first investigate some properties of weighted random trees $(\mathtt{T}_{n})_{n\geq 1}$ with deterministic weight sequence $\boldsymbol{w}$ . In Section 2.1 we first prove Proposition 2.1 which states the convergence of the degree sequence using elementary methods. Then in Section 2.2, we prove the weak convergence of the weight measure $\mu_{n}$ to some limit $\mu$ and describe three regimes for its behaviour. We also study other natural measures related to the sequence of trees $(\mathtt{T}_{n})$ and prove that they also converge towards $\mu$ . For all these measures, our main tool consists in introducing martingales related to the mass of a subtree descending from a fixed vertex. This is the content of Theorem 1.2.2 and Proposition 2.2.2. In Section 3, we prove Theorem 1.2.1 about the convergence of the height and the profile of WRT. This is achieved by first proving the uniform convergence of a rescaled version of the Laplace transform of the profile on a complex domain, which is the content of Proposition 3. This ensures that we can use [28, Theorem 2.1] for the convergence of the profile. This convergence provides a lower-bound for the height of the tree; we then prove a matching upper-bound to obtain asymptotics for the height. We also prove Proposition 3.3, which identify the asymptotic behaviour of the height of the tree in the case where the weights increase very fast.

Then we switch to studying a sequence $(\mathtt{P}_{n})_{n\geq 1}$ of preferential attachment trees with sequence of fitnesses $\mathbf{a}$ . In Section 4, we present a proof of Theorem 1.1 and Corollary 1.1 using a coupling between the preferential attachment process with a sequence of Pólya urn processes and this establishes that $(\mathtt{P}_{n})_{n\geq 1}$ can also be described as having distribution $\operatorname{WRT}(\boldsymbol{\mathsf{w}}^{\mathbf{a}})$ for a random sequence $\boldsymbol{\mathsf{w}}^{\mathbf{a}}$ ; we then prove Proposition 1.1 which relates the properties of $\boldsymbol{\mathsf{w}}^{\mathbf{a}}$ to the ones of $\mathbf{a}$ . We finish the section by stating and proving Proposition 4.3 in which we prove that the sequence $(\mathsf{M}^{\mathbf{a}}_{n})$ defined above as some random multiple of $(\mathsf{W}^{\mathbf{a}}_{n})$ is a Markov chain. In Section 5, we identify in Proposition 5.1 the distribution of the chain $(\mathsf{M}^{\mathbf{a}}_{n})$ for particular sequences $\mathbf{a}$ using moment identifications. We then present two applications of this result, one concerning a model of Pólya urn with immigration and the other concerning another model of preferential attachment graphs, in Proposition 5.3.

Some technical results can be found in Appendix A.

Acknowledgements

The author would like to thank the anonymous referees for their numerous comments and suggestions that helped improve the presentation of this paper. He would also like to thank Philippe Marchal whose remarks led to an improvement in the generality of one of the results.

2 Measures and degrees in weighted random trees

In this section, we work with a sequence of trees $(\mathtt{T}_{n})_{n\geq 1}$ that has distribution $\operatorname{WRT}\left(\boldsymbol{w}\right)$ for a deterministic sequence $\boldsymbol{w}$ . We start with two statistics of the tree that are quite easy to analyse, namely the sequence of degrees of the vertices of the tree and also some natural measures defined on the tree.

2.1 Convergence of the degree sequence

We start the section by proving convergence for the sequence of degrees of the vertices in their order of creation under the $\operatorname{WRT}$ model. We suppose here that the sequence of weights $\boldsymbol{w}$ is such that there exists constants $C>0$ and $0<\gamma<1$ for which

[TABLE]

We write $\deg^{+}_{\mathtt{T}_{n}}(u_{k})$ for the out-degree of the vertex $u_{k}$ in $\mathtt{T}_{n}$ . For a fixed $k\geq 1$ remark that, as a sequence of random variables indexed by $n\geq 1$ , we have the equality in distribution

[TABLE]

with $(U_{i})_{i\geq 1}$ a sequence of independent uniform variables in $\mathopen{(}0\mathclose{}\mathpunct{},1\mathclose{)}$ . With this description of the distribution of the degrees of fixed vertices, only using some law of large numbers for the convergence and Chernoff bounds for the fluctuations we obtain the following result.

Proposition \thetheorem.

For a sequence of weights $\boldsymbol{w}$ satisfying (18), the following holds.

(i)

We have the almost sure pointwise convergence

[TABLE] 2. (ii)

If the sequence furthermore satisfies $w_{k}\leq(k+1)^{\gamma-1+c^{\prime}+o\mathopen{}\left(1\right)}$ for some constant $0\leq c^{\prime}<1-\gamma$ , then there exists a function of $k$ which goes to [math] as $k\rightarrow\infty$ , also denoted $o(1)$ , such that all $n$ large enough, we have for all $k\geq 1$

[TABLE]

and the convergence (20) holds almost surely in the space $\ell^{p}$ for all $p>\frac{1}{1-\gamma-c^{\prime}}$ .

Proof.

To prove (i), first remark that for any $k\geq 1$ such that $w_{k}\neq 0$ , thanks to (18), we have

[TABLE]

Using a version of the Borel-Cantelli lemma (see Lemma A in the appendix), we get that almost surely

[TABLE]

and hence $n^{-(1-\gamma)}\cdot\deg^{+}_{\mathtt{T}_{n}}(u_{k})\rightarrow\frac{w_{k}}{(1-\gamma)C}$ . For the indices $k$ for which $w_{k}=0$ , we of course have $\deg^{+}_{\mathtt{T}_{n}}(u_{k})=0$ almost surely for all $n\geq 1$ , and so the convergence also holds. This finishes the proof of (i).

For the second part of the statement, let us first compute

[TABLE]

where we have used the inequality $1+x\leq e^{x}$ . Now let $C^{\prime}$ be a constant such that for all $n\geq 1$ , we have $\sum_{i=1}^{n-1}\frac{1}{W_{i}}\leq C^{\prime}\cdot n^{1-\gamma}$ (such a constant exists because of the assumption (18)). For all $k\geq 1$ , we introduce the following

[TABLE]

where the real number $a>0$ is chosen in such a way that the function $x\mapsto x^{\gamma-1}\log(x+a)$ is decreasing on $\mathopen{(}0\mathclose{}\mathpunct{},\infty\mathclose{)}$ . Using Markov’s inequality, we get for any integers $k$ and $n$ such that $n\geq k$

[TABLE]

Using a union bound, the fact that $\deg^{+}_{\mathtt{T}_{n}}(u_{k})=0$ for any $k>n$ , and the definition of $\xi_{k}$ , we get that for all $n\geq 1$

[TABLE]

The last display is summable over all $n\geq 1$ and hence using the Borel-Cantelli lemma, we almost surely have for $n$ large enough

[TABLE]

We can conclude by noting that under our assumptions we have $\xi_{k}\leq(k+1)^{\gamma-1+c^{\prime}+o\mathopen{}\left(1\right)}$ . The convergence in $\ell^{p}$ for $p>\frac{1}{1-\gamma-c^{\prime}}$ is just obtained by dominated convergence using the componentwise convergence (20) and the $\ell^{p}$ domination (21). ∎

2.2 Convergence of measures

The goal of this section is to prove Theorem 1.2.2, which concerns the convergence of the sequence of weight measures $(\mu_{n})$ seen as measures on $\overline{\mathbb{U}}$ . One of the key arguments is the fact that the weight of the subtree descending from a fixed vertex can be described using a generalised Pólya urn scheme, as studied by Pemantle [40]. We also prove Proposition 2.2.2, which states the weak convergence of other measures.

Time-dependent Pólya urn scheme.

Let us start by describing an urn process, following Pemantle [40]. Let $a,b$ be two non-negative real numbers, with $a+b>0$ , and $k\geq 1$ be an integer and $(s_{n})_{n\geq k+1}$ be a sequence of non-negative real numbers. We refer to the following process as a time-dependent Pólya urn starting at time $k$ with $a$ red balls and $b$ black balls and weight sequence $(s_{n})_{n\geq k+1}$ :

•

At time $k$ , the urn contains $a$ red balls and $b$ black balls222Those numbers of balls are not required to be integers..

•

Then at every time $n\geq k+1$ , a ball is drawn at random and replaced in the urn, along with $s_{n}$ additional balls of the same colour.

For any $n\geq k$ we call $R_{n}$ the proportion of red balls in the urn at time $n$ . We can easily check that $(R_{n})_{n\geq k}$ is a martingale in its own filtration, with values in $\mathopen{[}0\mathclose{}\mathpunct{},1\mathclose{]}$ . As a result, it converges as $n\rightarrow\infty$ a.s. and in $L^{1}$ towards some random variable $R_{\infty}$ .

Characterization of the convergence of probability measures over $\overline{\mathbb{U}}$ .

Recall from the introduction the definition of the Ulam-Harris tree $\mathbb{U}=\bigcup_{n=0}^{\infty}\mathbb{N}^{n}$ and its completed version $\overline{\mathbb{U}}=\mathbb{U}\cup\partial\mathbb{U}$ , which is endowed with the distance $\operatorname{d}^{\mathrm{exp}}$ defined in (16). Recall that $(\overline{\mathbb{U}},\operatorname{d}^{\mathrm{exp}})$ is a separable and complete metric space.

For any $u\in\mathbb{U}$ , we write $T(u):=\left\{uv\mathrel{}\middle|\mathrel{}v\in\overline{\mathbb{U}}\right\}$ the subtree descending from $u$ . In $\overline{\mathbb{U}}$ there is an easy characterisation of the weak convergence of sequences of probability measures defined on the Borel- $\sigma$ -field associated to $\operatorname{d}^{\mathrm{exp}}$ , which a direct consequence of the Portmanteau theorem (see e.g. [7, Theorem 2.1]):

Lemma \thetheorem.

Let $(\pi_{n})_{n\geq 1}$ be a sequence of Borel probability measures on $\overline{\mathbb{U}}$ . Then $(\pi_{n})_{n\geq 1}$ converges weakly to a probability measure $\pi$ if and only if for any $u\in\mathbb{U}$ ,

[TABLE]

Let us provide a proof of this lemma for completeness.

Proof.

We can check that the sets of the form $\{u\}$ for $u\in\mathbb{U}$ , or $T(u)$ for $u\in\mathbb{U}$ , are clopen for the topology generated by $\operatorname{d}^{\mathrm{exp}}$ , so by the Portmanteau theorem this already proves the "only if" part of the lemma. Now reciprocally, suppose that the condition on $(\pi_{n})_{n\geq 1}$ is satisfied. We can check that every open set $\mathcal{O}$ can be written as a countable disjoint union of these clopen sets (see for example [45, Lemma 1.2] for a similar statement for the topology of $\partial\mathbb{U}$ ), which we write $\mathcal{O}=\bigsqcup_{k\geq 1}\mathcal{O}_{k}$ . Then, using Fatou’s lemma and the $\sigma$ -additivity of measures we get

[TABLE]

We conclude using the Portmanteau theorem again. ∎

2.2.1 Proof of Theorem 1.2.2.

We are going to apply this criterion to our sequence $(\mu_{n})_{n\geq 1}$ , which, we recall, is defined in such a way that for all $n\geq 1$ , the measure $\mu_{n}$ charges only the vertices $\{u_{1},u_{2},\dots,u_{n}\}$ of the tree $\mathtt{T}_{n}$ , and such that for any $1\leq k\leq n$ ,

[TABLE]

Proof of Theorem 1.2.2.

We can already see that if $(W_{n})_{n\geq 1}$ is bounded and hence converges to some $W_{\infty}$ we have $\mu_{n}(\{u_{k}\})\underset{}{\rightarrow}\frac{w_{k}}{W_{\infty}}$ as $n\rightarrow\infty$ . In this case it is easy to verify that $(\mu_{n})$ weakly converges to the measure $\mu$ which is such that $\mu(\{u_{k}\})=\frac{w_{k}}{W_{\infty}}$ . In this case $\mu(\mathbb{U})=1$ and so $\mu$ is carried on $\mathbb{U}$ .

Let us now assume that $W_{n}\underset{}{\rightarrow}\infty$ as $n\rightarrow\infty$ and show that in this case, $\mu_{n}$ converges weakly to some limit $\mu$ that is carried on $\partial\mathbb{U}$ . In this case we have $\mu_{n}(\{u_{k}\})=\frac{w_{k}}{W_{n}}\underset{}{\rightarrow}0$ as $n\rightarrow\infty$ . Let us denote for every integers $n,k\geq 1$ ,

[TABLE]

the proportion of the total mass above vertex $u_{k}$ at time $n$ . Remark that this quantity evolves as the proportion of red balls in a time-dependent Pólya urn scheme with weights $(w_{i})_{i\geq k+1}$ , starting at time $k$ with $W_{k-1}$ black balls and $w_{k}$ red balls. Hence for all $k\geq 1$ , the sequence $(M_{n}^{(k)})_{n\geq k}$ almost surely converges to a limit $M_{\infty}^{(k)}$ . Also, for any $u\in\mathbb{U}$ that does not receive a label in the process, the sequence $(\mu_{n}(T(u)))_{n\geq 1}$ (and also $(\mu_{n}(\{u\}))_{n\geq 1}$ ) is identically equal to zero. Hence we have convergence of $(\mu_{n}(\{u\}))_{n\geq 1}$ and $(\mu_{n}(T(u)))_{n\geq 1}$ for all $u\in\mathbb{U}$ .

The last step in order to prove the weak convergence of $(\mu_{n})_{n\geq 1}$ is to prove that the quantities that we obtain in the limit indeed define a probability measure on $\overline{\mathbb{U}}$ . If for all $u\in\mathbb{U}$ we have

[TABLE]

then it entails that $\mu_{n}\underset{n\rightarrow\infty}{\rightarrow}\mu$ , where $\mu$ is the unique probability measure on $\overline{\mathbb{U}}$ such that for all $u\in\mathbb{U}$ ,

[TABLE]

The existence of such a measure $\mu$ is ensured by the Kolmogorov extension theorem on the product space $\partial\mathbb{U}=\mathbb{N}^{\mathbb{N}}$ .

For any $u\notin\{u_{1},u_{2},\dots\}$ , the equality (23) is immediate, so let us prove it for all $u_{k}$ for $k\geq 1$ . For any $n,k,i\geq 1$ , let

[TABLE]

Using what we just proved, we know that for any $k,i$ , the quantity $M_{n}^{(k,i)}$ almost surely converges as $n\rightarrow\infty$ to some limit $M_{\infty}^{(k,i)}$ . Proving (23) reduces to proving that for any $k\geq 1$ , we almost surely have $M_{\infty}^{(k,i)}\underset{i\rightarrow\infty}{\rightarrow}0$ . By construction, the sequence $(M_{\infty}^{(k,i)})_{i\geq 1}$ is non-negative and non-increasing, hence it converges almost surely, so it suffices to prove that its almost sure limit is [math].

We define $\tau^{(k,i)}:=\inf\left\{n\geq 1\mathrel{}\middle|\mathrel{}u_{n}=u_{k}i\right\}$ , the time when the vertex $u_{k}$ receives its $i$ -th child in the growth procedure. Conditionally on the event $\{\tau^{(k,i)}=t\}$ , the process $(M_{n}^{(k,i)})_{n\geq t}$ evolves as the proportion of red balls in a time-dependent Pólya urn scheme, starting at time $t$ with $w_{k}$ red balls (that correspond to the weight of $u_{k}$ ) and $(W_{t}-w_{k})$ blacks balls (that correspond to the total weight of other vertices in the tree), and weights $(w_{m})_{m\geq t+1}$ . Hence we have

[TABLE]

On the event $\{\tau^{(k,i)}=\infty\}$ , we have $M_{n}^{(k,i)}=0$ for $n<k$ and $M_{n}^{(k,i)}=\mu_{n}(\{u_{k}\})$ for $n\geq k$ , which decreases almost surely to [math], so $M_{\infty}^{(k,i)}=0$ a.s. on that event.

Using the crude bound $\tau^{(k,i)}\geq i$ , which entails that $W_{\tau^{(k,i)}}\geq W_{i}$ almost surely, we get

[TABLE]

hence $M_{\infty}^{(k,i)}\underset{i\rightarrow\infty}{\rightarrow}0$ in $L^{1}$ , so its almost sure limit is also [math]. In the end, by Lemma 2.2, the sequence of measures $(\mu_{n})$ almost surely converges weakly to a limit $\mu$ , and this measure only charges the set $\partial\mathbb{U}$ .

We just finished proving that, for any sequence of weight $\boldsymbol{w}$ , the sequence $(\mu_{n})_{n\geq 1}$ almost surely converges weakly to a probability measure $\mu$ . Furthermore, we proved that $\mu$ is carried on $\mathbb{U}$ when $(W_{n})$ is bounded and carried on $\partial\mathbb{U}$ when $W_{n}\rightarrow\infty$ . The proof is then finished by applying the lemma stated below. ∎

Lemma \thetheorem.

Suppose that $\sum_{n=1}^{\infty}w_{n}=\infty$ so that $\mu$ is carried on $\partial\mathbb{U}$ . Then either $\sum_{n=1}^{\infty}\left(\frac{w_{n}}{W_{n}}\right)^{2}<\infty$ and then $\mu$ is almost surely diffuse, or $\sum_{n=1}^{\infty}\left(\frac{w_{n}}{W_{n}}\right)^{2}=\infty$ and then $\mu$ is carried on one point of $\partial\mathbb{U}$ .

Proof.

For any $k\geq 1$ the process $(\mu_{n}(T(u_{k}))_{n\geq k}$ evolves as the proportion of red balls in a time-dependent Pólya urn started at time $k$ with $w_{k}$ red balls, $W_{k-1}$ black balls and a weight sequence $(w_{n})_{n\geq k+1}$ . By the work of Pemantle in [39], if we assume $\sum_{n=1}^{\infty}\left(\frac{w_{n}}{W_{n}}\right)^{2}=\infty$ then the limiting proportion $\mu(T(u_{k}))$ almost surely belongs to the set $\{0,1\}$ . This translates into the fact that $\mu(T(u))\in\{0,1\}$ almost surely for any $u\in\mathbb{U}$ , which entails that $\mu$ is almost surely carried on one leaf of $\partial\mathbb{U}$ .

On the contrary, let us suppose that $\sum_{n=1}^{\infty}\left(\frac{w_{n}}{W_{n}}\right)^{2}<\infty$ and prove that this entails that the limiting measure $\mu$ is diffuse almost surely. Consider the function $(\cdot\wedge\cdot):\overline{\mathbb{U}}\times\overline{\mathbb{U}}\rightarrow\overline{\mathbb{U}}$ which associates to each couple $(u,v)$ their most recent common ancestor $u\wedge v$ in the completed tree $\overline{\mathbb{U}}$ . This function is continuous with respect to the distance $\operatorname{d}^{\mathrm{exp}}$ . Then, since $\mu_{n}\rightarrow\mu$ almost surely weakly, we also get the following almost sure weak convergence of the push-forward of the product measure $\mu_{n}\otimes\mu_{n}$ on $\mathbb{U}\times\mathbb{U}$ by the function $(\cdot\wedge\cdot)$ :

[TABLE]

Let us fix $n\geq 1$ and conditionally on $(\mathtt{T}_{1},\mathtt{T}_{2},\dots,\mathtt{T}_{n})$ , let $D_{n}$ and $D_{n}^{\prime}$ be two independent vertices taken under $\mu_{n}$ . Then, an argument taken from the proof of [13, Lemma 3.8] in a slightly different setting ensures that

[TABLE]

The argument goes as follows:

•

with probability $\left(\frac{w_{n}}{W_{n}}\right)^{2}$ , we have $D_{n}=D_{n}^{\prime}=D_{n}\wedge D_{n}^{\prime}=u_{n}$ ,

•

with probability $1-\left(\frac{w_{n}}{W_{n}}\right)^{2}$ , it is not the case, and we can check that conditionally on this event, the vertices $\left[D_{n}\right]_{n-1}$ and $\left[D_{n}^{\prime}\right]_{n-1}$ defined as the most recent ancestor in $\mathtt{T}_{n-1}$ of respectively $D_{n}$ and $D_{n}^{\prime}$ , are independent and taken under the measure $\mu_{n-1}$ , and that $D_{n}\wedge D_{n}^{\prime}=\left[D_{n}\right]_{n-1}\wedge\left[D_{n}^{\prime}\right]_{n-1}$ .

It suffices to then apply this in cascade to get the last display.

Note that thanks to the summability condition, the infinite product $\prod_{i=2}^{\infty}\left(1-\left(\frac{w_{i}}{W_{i}}\right)^{2}\right)$ is non-zero, and this suffices to ensure that the obtained sequence $(p_{k})_{k\geq 1}$ is a probability distribution. Thanks to the weak convergence (24), it corresponds to the (annealed) distribution $p_{k}=\mathbb{P}\left(D_{\infty}\wedge D_{\infty}^{\prime}=u_{k}\right)$ , where $D_{\infty}$ and $D_{\infty}^{\prime}$ are two independent points taken under the measure $\mu$ , conditionally on $(\mathtt{T}_{n})_{n\geq 1}$ . Now we can write

[TABLE]

where the inequality is due to the fact that the vertices $u_{1},u_{2},\dots,u_{k}$ have a height smaller than $k$ . Hence $\mathbb{P}\left(\operatorname{d}^{\mathrm{exp}}(D_{\infty},D_{\infty}^{\prime})=0\right)\leq\lim_{k\rightarrow\infty}\mathbb{P}\left(\operatorname{d}^{\mathrm{exp}}(D_{\infty},D_{\infty}^{\prime})\leq e^{-k}\right)=0$ . So, almost surely, two points taken independently under $\mu$ are different, and this ensures that $\mu$ is diffuse. ∎

2.2.2 Convergence of other sequences of measures.

We also study two other sequences of measures $(\eta_{n})$ and $(\nu_{n})$ carried on the Ulam tree $\mathbb{U}$ . For every $n\geq 2$ , these measures only charge the vertices $\{u_{1},u_{2},\dots,u_{n}\}$ in such a way that for any $1\leq k\leq n$ ,

[TABLE]

where $(b_{n})_{n\geq 1}$ is a sequence of real numbers such that $b_{1}>-1$ and $b_{n}\geq 0$ for all $n\geq 2$ . We write $B_{n}:=\sum_{k=1}^{n}b_{k}$ . We suppose that $B_{n}=O\mathopen{}\left(n\right)$ and that there exists $\epsilon>0$ such that $b_{n}=O\mathopen{}\left(n^{1-\epsilon}\right)$ . The assumptions on the sequence $(b_{n})_{n\geq 1}$ are chosen such that they are satisfied by a sequence $(a_{n})_{n\geq 1}$ of fitnesses that satisfies ( $H_{c}$ ) for some $c>0$ .

Proposition \thetheorem.

Under the assumptions $\sum_{n=1}^{\infty}w_{n}=\infty$ and $\sum_{n=1}^{\infty}\left(\frac{w_{n}}{W_{n}}\right)^{2}<\infty$ , the sequences $(\eta_{n})_{n\geq 1}$ and $(\nu_{n})_{n\geq 1}$ converge almost surely weakly towards the limiting measure $\mu$ on $\partial\mathbb{U}$ that is defined in Theorem 1.2.2.

The rest of this section is devoted to proving Proposition 2.2.2. We treat the two sequences of measures separately.

The degree measure.

Consider the sequence $(\eta_{n})_{n\geq 1}$ on $\overline{\mathbb{U}}$ . Since the sequence $(W_{n})_{n\geq 1}$ tends to infinity, we have $\eta_{n}(\{u\})\rightarrow 0$ for every $u\in\mathbb{U}$ . Indeed, using the equality in distribution (19) and Lemma A in the appendix, it is easy to see that either $\sum_{i=1}^{\infty}W_{i}^{-1}<\infty$ in which case the degrees $\deg^{+}_{\mathtt{T}_{n}}(u_{k})$ are eventually constant as $n\rightarrow\infty$ ; or $\sum_{i=1}^{\infty}W_{i}^{-1}=\infty$ , in which case we have the almost sure asymptotic behaviour $\deg^{+}_{\mathtt{T}_{n}}(u_{k})\sim w_{k}\cdot\sum_{i=k}^{n}W_{i}^{-1}$ . In both cases, for all $k\geq 1$ , we have $n^{-1}\deg^{+}_{\mathtt{T}_{n}}(u_{k})\rightarrow 0$ almost surely as $n\rightarrow\infty$ .

For all $k\geq n$ , we keep the notation $M_{n}^{(k)}=\mu_{n}(T(u_{k}))$ introduced in the proof of Theorem 1.2.2 and let

[TABLE]

We can check that

[TABLE]

Now, using that $\mathbb{P}\left(u_{n+1}\in T(u_{k})\mathrel{}\middle|\mathrel{}\mathcal{F}_{n}\right)=M_{n}^{(k)}$ and that $\mathbb{E}\left[M_{n+1}^{(k)}\mathrel{}\middle|\mathrel{}\mathcal{F}_{n}\right]=M_{n}^{(k)}$ , we get

[TABLE]

Hence, if we denote $X_{n}^{(k)}:=(B_{n}+n-1)\cdot\left(N_{n}^{(k)}-M_{n}^{(k)}\right)$ , then the last computation shows that $\left(X_{n}^{(k)}\right)_{n\geq k}$ is a martingale for the filtration generated by $(\mathtt{T}_{n})_{n\geq 1}$ . More precisely we can write

[TABLE]

hence we have

[TABLE]

Then, using [19, Chapter VII.9, Theorem 3], we get that if

[TABLE]

then $\frac{X_{n}^{(k)}}{n}\rightarrow 0$ a.s. as $n\rightarrow\infty$ , which would prove that $N_{n}^{(k)}\underset{}{\longrightarrow}M_{\infty}^{(k)}$ as $n\rightarrow\infty$ . In our case, we can verify that (25) holds. Indeed, using the fact that we assumed that $B_{n}=O\mathopen{}\left(n\right)$ and $b_{n+1}=O\mathopen{}\left(n^{1-\epsilon}\right)$ , we have

[TABLE]

which is summable under our assumptions. In the end, using Lemma 2.2, we have the almost sure convergence

[TABLE]

The uniform measure on the vertices of $\mathtt{T}_{n}$ .

Consider the sequence $(\nu_{n})$ on $\overline{\mathbb{U}}$ . Fix $k\geq 1$ . For any $n\geq k$ we can write $\nu_{n}(T(u_{k}))=\frac{1}{n}\sum_{i=k}^{n}\mathbf{1}_{\left\{u_{i}\in T(u_{k})\right\}}$ . For any $i\geq k+1$ , we have $\mathsf{p}_{i}:=\mathbb{P}\left(u_{i}\in T(u_{k})\mathrel{}\middle|\mathrel{}\mathcal{F}_{i-1}\right)=\mu_{i-1}(T(u_{k}))$ , which tends a.s. to some limit $\mu(T(u_{k}))$ as $i\rightarrow\infty$ . Using Lemma A in the appendix, we have

[TABLE]

and also

[TABLE]

In both cases we get $\nu_{n}(T(u_{k}))\underset{n\rightarrow\infty}{\rightarrow}\lim_{i\rightarrow\infty}\mathsf{p}_{i}=\mu(T(u_{k}))$ almost surely. We also have for any $k\geq 1$ ,

[TABLE]

so we can conclude using Lemma 2.2 that almost surely $\nu_{n}\underset{n\rightarrow\infty}{\rightarrow}\mu$ weakly.

3 Height and profile of WRT

The main goal of this section is to prove Theorem 1.2.1 which gives asymptotics for the profile and height of the tree. Recall that we denote

[TABLE]

the number of vertices at height $k$ in the tree $\mathtt{T}_{n}$ . In order to get information on the sequence of functions $(k\mapsto\mathbb{L}_{n}(k))_{n\geq 1}$ we study their Laplace transform

[TABLE]

where the last expression is given using an integral against the probability measure $\nu_{n}$ defined in Section 2.2 as the uniform measure on the vertices of $\mathtt{T}_{n}$ . The key result in our approach is to prove the convergence of this sequence of analytic functions when appropriately rescaled, uniformly in $z$ on an open neighbourhood of [math] in the complex plane. It then allows us to use [28, Theorem 2.1] and hence derive a convergence result for the profile. We actually start in Section 3.1 by studying the convergence of the similarly defined sequence of functions

[TABLE]

where we integrate with respect to the weight measure $\mu_{n}$ instead of the uniform measure $\nu_{n}$ as before. This one is easier to study because for every fixed $z\in\mathbb{C}$ , it defines a martingale as $n$ grows, up to some deterministic scaling. Then in Section 3.2, we make use of this first convergence and show that up to some deterministic multiplicative constant, the two sequences of integrals appearing in (26) and (27) are almost surely equivalent when $n$ tends to infinity.

Let us fix some $\gamma>0$ for this whole section. Throughout this section, we always work under the assumption that ( $\square_{\gamma}$ ) holds for the sequence $\boldsymbol{w}$ . For some results, we will assume that their exists $p\in\mathopen{(}1\mathclose{}\mathpunct{},2\mathclose{]}$ such that the stronger condition ( $\square_{\gamma}^{p}$ ) holds, i.e.

[TABLE]

We let $\phi:z\mapsto\gamma(e^{z}-1)$ be a function of a complex parameter $z$ and let $z\mapsto N_{n}(z)$ be the following rescaled version of the Laplace transform of the profile

[TABLE]

The proposition below ensures that the sequence $(z\mapsto N_{n}(z))_{n\geq 1}$ converges uniformly on all compact subsets of some domain $\mathscr{D}\subset\mathbb{C}$ to some limiting function $z\mapsto N_{\infty}(z)$ which does not vanish anywhere on the set $\mathscr{D}\cap\mathbb{R}$ , along with some more technical statements.

Proposition \thetheorem.

Suppose that the weight sequence $\boldsymbol{w}$ satisfies ( $\square_{\gamma}^{p}$ ) for some $\gamma>0$ and some $p\in\mathopen{(}1\mathclose{}\mathpunct{},2\mathclose{]}$ . Then there exists a domain $\mathscr{D}\subset\mathbb{C}$ such that $\mathscr{D}\cap\mathbb{R}=\mathopen{(}z_{-}\mathclose{}\mathpunct{},z_{+}\mathclose{)}$ where $z_{-}<0$ and $z_{+}>0$ are defined as in (12), such that the following properties are satisfied.

(i)

With probability $1$ , the sequence of random analytic functions $(z\mapsto N_{n}(z))_{n\geq 1}$ converges uniformly on all compact subsets of $\mathscr{D}$ , as $n\rightarrow\infty$ , to some random analytic function $z\mapsto N_{\infty}(z)$ which satisfies $\mathbb{P}\left(N_{\infty}(z)\neq 0\text{ for all }z\in(z_{-},z_{+})\right)=1$ . 2. (ii)

For every compact subset $K\subset\mathscr{D}$ and $r\in\mathbb{N}$ , we can find an a.s. finite random variable $C_{K,r}$ such that for all $n\in\mathbb{N}$ ,

[TABLE] 3. (iii)

For every compact subset $K\subset\mathopen{(}z_{-}\mathclose{}\mathpunct{},z_{+}\mathclose{)}$ , every $0<a<\pi$ and $r\in\mathbb{N}$ ,

[TABLE]

Under the results of Proposition 3 we can apply [28, Theorem 2.1] whose conclusions for the sequence $(k\mapsto\mathbb{L}(k))_{n\geq 1}$ are the following. For any $k\geq 0,\ n\geq 1$ and $z\in\mathopen{(}z_{-}\mathclose{}\mathpunct{},z_{+}\mathclose{)}$ , we denote

[TABLE]

Then, for every integer $r\geq 0$ and every compact subset $K\subset\mathopen{(}z_{-}\mathclose{}\mathpunct{},z_{+}\mathclose{)}$ , we have the convergence

[TABLE]

where for all $j\geq 0$ , the (random) functions $G_{j}(x,z)$ are polynomials of degree at most $3$ in $x$ and are entirely determined from $\phi$ and $N_{\infty}$ , with $G_{0}=1$ , see [28, Equation (16)] for their complete definition. The asymptotics (13) and (14) stated in Theorem 1.2.1 follow from the last display. Indeed, (13) is obtained by letting $r=0$ and $z=0$ and using the fact that $N_{\infty}(0)=1$ almost surely. For (14), we let $r=0$ , and use $k=\left\lfloor\gamma e^{z}\log n\right\rfloor$ .

In Section 3.3, we complete the proof of Theorem 1.2.1 by computing the asymptotic behaviour of the height of the tree. Since the convergence of the profile already ensures that there almost surely are vertices at height $\gamma e^{(z_{+}-\epsilon)}\log n$ for $\epsilon>0$ small enough and all $n$ large enough, it suffices to prove a corresponding upper-bound in order to finish proving the convergence (15) in Theorem 1.2.1.

3.1 Study of the Laplace transform of the weighted profile

We study the sequence $\left(z\mapsto\sum_{i=1}^{n}\frac{w_{i}}{W_{n}}e^{z\operatorname{ht}(u_{i})}\right)_{n\geq 1}$ . The following lemma is the starting point of our analysis. In this section, we will use the notation $\mathcal{F}_{n}=\sigma(\mathtt{T}_{1},\mathtt{T}_{2},\dots,\mathtt{T}_{n})$ .

Lemma \thetheorem.

For all $z\in\mathbb{C}$ and all $n\geq 1$ , we have

[TABLE]

Proof.

Recall that conditionally on $\mathcal{F}_{n}$ , the $n+1$ -st vertex $u_{n+1}$ of $\mathtt{T}_{n+1}$ is a child of the vertex $u_{K_{n+1}}$ , where $\mathbb{P}\left(K_{n+1}=k\mathrel{}\middle|\mathrel{}\mathcal{F}_{n}\right)=\frac{w_{k}}{W_{n}}$ . We compute

[TABLE]

Taking conditional expectation with respect to $\mathcal{F}_{n}$ yields:

[TABLE]

This concludes the proof. ∎

Let $J$ be an integer that we are going to fix later on. The last result ensures that if $z\in\mathbb{C}$ is such that $\forall i\geq J,\ 1+(e^{z}-1)\frac{w_{i}}{W_{i}}\neq 0$ , then we can define for all $n\geq J$

[TABLE]

and the sequence $(M_{n}(z))_{n\geq J}$ is a martingale. We want to prove results about the asymptotic behaviour of $(z\mapsto M_{n}(z))_{n\geq J}$ , uniformly in $z$ on an appropriate set. If $J$ is fixed, then there exist parameters $z$ with $\imaginary(z)=\pi\mod 2\pi$ for which the sequence $(C_{n}(z))_{n\geq J}$ takes the value [math]. Under the assumption ( $\square_{\gamma}$ ) on the sequence $\boldsymbol{w}$ , we know that $\frac{w_{n}}{W_{n}}\underset{}{\rightarrow}0$ as $n\rightarrow\infty$ . If we restrict ourselves to a set of the form $\left\{z\in\mathbb{C}\mathrel{}\middle|\mathrel{}\real(z)<x\right\}$ for some $x>0$ , then

[TABLE]

hence it suffices to take $J$ large enough in order for the sequence $(C_{n}(z))_{n\geq J}$ to only take non-zero values for all $z\in\left\{\xi\in\mathbb{C}\mathrel{}\middle|\mathrel{}\real(\xi)<x\right\}$ and all $n\geq J$ . In what follows we work on the set

[TABLE]

where $z_{+}$ is as defined in Proposition 3. For technical reasons, we also sometimes consider the larger set

[TABLE]

Using the preceding discussion, we fix $J\geq 1$ such that the sequence $z\mapsto(C_{n}(z))_{n\geq J}$ does not have any zero on $\mathscr{E}^{\prime}$ , so that $z\mapsto(M_{n}(z))_{n\geq J}$ is well-defined for all $z\in\mathscr{E}^{\prime}$ .

We introduce the following notation. Let $F(z,n)$ and $G(z,n)$ be two functions of a complex parameter $z$ and an integer $n\in\mathbb{N}$ . For $D\subset\mathbb{C}$ a set of the complex plane we write

[TABLE]

to express the fact that $F(n,z)$ is a big (resp. small) o of $G(n,z)$ as $n\rightarrow\infty$ , uniformly on every compact $K\subset D$ . Note that later in the paper, we will also use this notation for random functions of $z$ and $n$ when such a comparison holds almost surely.

Now, let us derive some information on the asymptotic behaviour of $C_{n}(z)$ .

Lemma \thetheorem.

Suppose that $\boldsymbol{w}$ satisfies ( $\square_{\gamma}$ ). Then there exists $\epsilon>0$ and an analytic function $z\mapsto c(z)$ on $\mathscr{E}^{\prime}$ such that

[TABLE]

Remark that the lemma implies that for any $z\in\mathscr{E}^{\prime}$ , we have

[TABLE]

as $n\rightarrow\infty$ . It is also immediate that $\mathbb{E}\left[\sum_{k=1}^{n}\frac{w_{k}}{W_{n}}e^{z\operatorname{ht}(u_{k})}\right]=\mathbb{E}\left[M_{J}(z)\right]\cdot C_{n}(z)$ satisfies the same asymptotics up to a constant, as soon as $z$ is such that $\mathbb{E}\left[M_{J}(z)\right]\neq 0$ .

Before proving the lemma, we state the following result which follows from elementary calculus. Its proof can be found in the appendix.

Lemma \thetheorem.

Suppose that $\boldsymbol{w}$ satisfies ( $\square_{\gamma}$ ). Then there exists $\epsilon>0$ such that

[TABLE]

Proof of Lemma 3.1.

We write $\operatorname{Log}$ for the principal value of the complex logarithm. For $z\in\mathbb{C}$ such that $\absolutevalue{z}<1$ we have $\operatorname{Log}(1+z)=\sum_{i=1}^{\infty}\frac{(-1)^{n-1}}{n}z^{n}$ . If for every $i\geq J$ and $z\in\mathscr{E}^{\prime}$ , we let

[TABLE]

which is well-defined thanks to our choice of $J$ , then $\absolutevalue{h(i,z)}=O_{\mathscr{E}^{\prime}}\mathopen{}\left(\left(\frac{w_{i}}{W_{i}}\right)^{2}\right)$ is summable in $i$ and the rest of the series is

[TABLE]

for some $\epsilon>0$ , thanks to Lemma 3.1. Then we write

[TABLE]

which yields using (31) and Lemma 3.1

[TABLE]

and $c(z)$ is an analytic function of $z$ , which finishes the proof. ∎

Convergence of the martingales $(M_{n}(z))_{n\geq 1}$ .

When the parameter $z$ is a positive real number, the sequence $(M_{n}(z))_{n\geq 1}$ is a positive martingale and so it converges almost surely to some limit. We want to prove that these martingales converge almost surely and in $L^{1}$ for the largest possible range of parameters $z$ . For the rest of Section 3.1 and also in the subsequent Section 3.2, we assume that the weight sequence $\boldsymbol{w}$ satisfies ( $\square_{\gamma}^{p}$ ) for some fixed parameters $\gamma>0$ and $p\in\mathopen{(}1\mathclose{}\mathpunct{},2\mathclose{]}$ .

We align our notation with the one used in [11, Theorem 2.2] which states something similar to our forthcoming Proposition 3.1 for another model, the binary search tree.

For any $z\in\mathscr{E}$ and $q\in\mathopen{(}1\mathclose{}\mathpunct{},p\mathclose{]}$ , we let

[TABLE]

For any $q\in\mathopen{(}1\mathclose{}\mathpunct{},p\mathclose{]}$ , let $\mathscr{V}_{q}=\left\{z\in\mathscr{E}\mathrel{}\middle|\mathrel{}g(z,q)<0\right\}$ , and denote

[TABLE]

Lemma \thetheorem.

The set $\mathscr{V}$ is open and contains the open interval of real numbers $I_{\gamma}:=\left\{x\in\mathbb{R}\mathrel{}\middle|\mathrel{}\gamma(xe^{x}-e^{x}+1)-1<0\right\}$ which contains [math].

Proof.

Of course $\mathscr{V}$ is open as a union of open sets. For any real $x$ we have $g(x,1)=0$ . So, if $\frac{\partial g}{\partial q}(x,1)<0$ then there exists $q>1$ for which $g(x,q)<0$ . Since $\frac{\partial g}{\partial q}(x,1)=\gamma(xe^{x}-e^{x}+1)-1$ , the set $\mathscr{V}$ contains the interval $I_{\gamma}$ defined above. Since $\frac{\partial g}{\partial q}(0,1)=-1<0$ , we have $0\in I_{\gamma}$ . ∎

Proposition \thetheorem.

The sequence of functions $(z\mapsto M_{n}(z))_{n\geq J}$ converges uniformly almost surely and in $L^{1}$ towards an analytic function $z\mapsto M_{\infty}(z)$ on every compact subset of $\mathscr{V}$ . Furthermore, for any compact subset $K\subset\mathscr{V}$ , there exists a real $\epsilon(K)>0$ such that almost surely

[TABLE]

The proof of the proposition will follow from the next lemma, together with Lemma A, stated in the appendix.

Lemma \thetheorem.

For any $q\in\mathopen{(}1\mathclose{}\mathpunct{},p\mathclose{]}$ and $z\in\mathscr{E}$ we have

[TABLE]

and also

[TABLE]

Proof.

For any $q\in\mathopen{(}1\mathclose{}\mathpunct{},p\mathclose{]}$ and $n\geq J$ , we write

[TABLE]

Taking the $q$ -th power of the modulus on both sides and using the inequality $\absolutevalue{a+b}^{q}\leq 2^{q}\cdot(\absolutevalue{a}^{q}+\absolutevalue{b}^{q})$ , we get

[TABLE]

Using Lemma A in the appendix, we have for any $n\geq J$ ,

[TABLE]

Using the last display and equation (3.1), we get a recurrence inequality of the form

[TABLE]

where

[TABLE]

Applying (37) in cascade we get

[TABLE]

Now notice that from our assumption on the sequence $(w_{n})_{n\geq 1}$ we have

[TABLE]

On the other hand, since $z\in\mathscr{E}$ then $q\real z\in\mathscr{E}^{\prime}$ , so we can use Lemma 3.1 to get

[TABLE]

We conclude using the following lemma which is an application of Hölder’s inequality using the assumption ( $\square_{\gamma}^{p}$ ).

Lemma \thetheorem.

For any $q\in\mathopen{(}1\mathclose{}\mathpunct{},p\mathclose{]}$ we have $\displaystyle\sum_{i=n}^{2n}\left(\frac{w_{i}}{W_{i}}\right)^{q}\leq n^{1-q+o\mathopen{}\left(1\right)}$ .

Together with (39), this proves that $(a_{n}(z))_{n\geq 1}$ is summable and so $\prod_{i=J}^{\infty}(1+a_{i}(z))=O_{\mathscr{E}}\mathopen{}\left(1\right)$ . Also

[TABLE]

and so $\sum_{i=J}^{n}b_{i}(z)=O_{\mathscr{E}}\mathopen{}\left(n^{0\vee g(z,q)+o_{\mathscr{E}}\mathopen{}\left(1\right)}\right)$ . Replacing this in (38) finishes to prove (34). In order to prove (35), we use Lemma A again and write

[TABLE]

Using Lemma 3.1 we get $\mathbb{E}\left[\absolutevalue{M_{2n}(z)-M_{n}(z)}^{q}\right]=O_{\mathscr{E}}\mathopen{}\left(n^{(1-q)\vee g(z,q)+o_{\mathscr{E}}\mathopen{}\left(1\right)}\right)$ which finishes the proof of the lemma. ∎

Proof of Proposition 3.1.

Any compact subset $K\subset\mathscr{V}_{q}$ can be covered by a finite number of $\mathscr{V}_{q}$ . The convergence result is then an application of Lemma A, on the set $\mathscr{V}_{q}$ with $\alpha(z)=0$ and, say $\delta(z)=-\frac{1}{2}g(z,q)>0$ . The limiting function is analytic as a uniform limit of analytic functions. ∎

Zeros of the limit.

Now that we have proved that their exists a limiting function $z\mapsto M_{\infty}(z)$ defined on the set $\mathscr{V}$ , we are interested in the possible location of the zeros of this random function. In fact, the function $z\mapsto M_{\infty}(z)$ is related to the function $z\mapsto N_{\infty}(z)$ of Proposition 3, for which we aim to prove that it has almost surely no zero on some real interval $\mathopen{(}z_{-}\mathclose{}\mathpunct{},z_{+}\mathclose{)}$ which contains [math]. We will prove a similar result for $z\mapsto M_{\infty}(z)$ in Lemma 3.1, and we start by proving the following weaker statement. Recall the definition of the interval $I_{\gamma}$ in Lemma 3.1.

Lemma \thetheorem.

For all $z\in I_{\gamma}$ , we have almost surely $M_{\infty}(z)>0$ . As a consequence, the number of zeros of the map $(z\mapsto M_{\infty}(z))$ on the interval $I_{\gamma}$ is almost surely at most countable.

Let us recall from (1) the definition of the sequence of independent random variables $(K_{2},K_{3},\dots)$ that is used to construct the trees $(\mathtt{T}_{n})_{n\geq 1}$ .

Proof of Lemma 3.1.

This follows from an application of Kolmogorov’s $0-1$ law. Indeed, fix $N\geq J$ and $z\in I_{\gamma}$ and for all $n\geq N$ , let

[TABLE]

where $\operatorname{d}$ denotes the graph distance in $\mathbb{U}$ , and the distance between a vertex and a subset of vertices is defined the usual way. The idea behind $(M_{n}^{(N)}(z))_{n\geq N}$ is that, up to a positive multiplicative constant (i.e. a deterministic constant that depends on $N$ and $z$ but not on $n$ ), it has the same distribution as the sequence $M_{n}(z)$ associated to the growth of the tree that one informally obtains by contracting all the vertices $\{u_{1},u_{2},\dots,u_{N}\}$ into one. Note that the growth of such a tree can be described as that of a weighted recursive tree with weight sequence $(W_{N},w_{N+1},w_{N+2},\dots)$ , which shares the same asymptotic property ( $\square_{\gamma}^{p}$ ) as the original sequence $(w_{n})_{n\geq 1}$ .

We claim the following:

(i)

Due to the above remarks, $(M_{n}^{(N)}(z))_{n\geq N}$ is a positive martingale which satisfies the same assumptions as $M_{n}(z)$ so it converges a.s. and in $L^{1}$ towards a non-negative limit, $M_{\infty}^{(N)}(z)$ , thanks to Proposition 3.1, which we can apply here because $z\in I_{\gamma}\subset\mathscr{V}$ . 2. (ii)

We have $(1\wedge e^{z})^{N}M_{n}^{(N)}(z)\leq M_{n}(z)\leq(1\vee e^{z})^{N}M_{n}^{(N)}(z)$ . 3. (iii)

The sequence $(M_{n}^{(N)}(z))_{n\geq N}$ , hence its limit $M_{\infty}^{(N)}(z)$ , is independent of the $N$ first steps of the construction, and is hence a measurable function of the sequence $(K_{n})_{n\geq N+1}$ .

Using all these observations we deduce that for any $N\geq J$ we have the equality of events $\{M_{\infty}(z)>0\}=\{M^{(N)}_{\infty}(z)>0\}$ . This proves that $\{M_{\infty}(z)>0\}$ is measurable with respect to the tail $\sigma$ -algebra generated by the sequence $(K_{n})_{n\geq 2}$ , which is a sequence of jointly independent random variables. Kolmogorov’s [math]- $1$ law then ensures that this event has probability [math] or $1$ . By $L^{1}$ convergence we have $\mathbb{E}\left[M_{\infty}(z)\right]=\mathbb{E}\left[M_{J}(z)\right]>0$ and this proves our claim. It follows immediately that the limit $z\mapsto M_{\infty}(z)$ can only have finitely many zeros in any compact subset of $I_{\gamma}$ almost surely, because otherwise, by analyticity of $z\mapsto M_{\infty}(z)$ on the connected component of $\mathscr{V}$ that contains $I_{\gamma}$ , the function would be identically [math] on $I_{\gamma}$ with positive probability. This ensures that the total number of zeros in $I_{\gamma}$ is at most countable and finishes the proof. ∎

Lemma \thetheorem.

The function $M_{\infty}(z)$ has almost surely no zero on $I_{\gamma}$ .

In order to prove this lemma, we use an argument of self-similarity: essentially, if we take two vertices $u_{i}$ and $u_{j}$ in the tree, then conditionally on the sequences of vertices that are grafted above $u_{i}$ or above $u_{j}$ , the subtrees above $u_{i}$ and $u_{j}$ evolve as two independent weighted recursive trees. Using Proposition 3.1 and Lemma 3.1, the normalized Laplace transform of the weighted profile of each of those two subtrees should converge almost surely to some random analytic function on $\mathscr{V}$ which is non-negative on $I_{\gamma}$ and has at most countably many zeros on this interval. Since the two are independent, their zeros should not overlap and hence the sum of their contribution should result in a function that is positive on $I_{\gamma}$ .

Proof.

Let us formalise this line of reasoning. Using Theorem 1.2.2, we know that the measure $\mu$ on $\partial\mathbb{U}$ is almost surely diffuse, hence we can define

[TABLE]

and they are almost surely finite.

Let us consider the sequences $\left(\mathbf{1}_{\left\{u_{n}\in T(u_{I^{(j)}})\right\}}\right)_{n\geq 1}$ for $j\in\{1,2\}$ , which record the times when a vertex is added to $T(u_{I^{(1)}})$ or $T(u_{I^{(2)}})$ , and work conditionally on them for the rest of the proof. We let

[TABLE]

which record respectively the number of vertices among $\{u_{1},u_{2},\dots,u_{n}\}$ that are in $T(u_{I^{(j)}})$ and conversely, the $k$ -th time where a vertex is added to $T(u_{I^{(j)}})$ in the construction of $(\mathtt{T}_{n})_{n\geq 1}$ . We let $w^{(j)}_{k}:=w_{\tau^{(j)}(k)}$ and $W^{(j)}_{k}:=\sum_{i=1}^{k}w^{(j)}_{k}$ , and also $u_{k}^{(j)}:=u_{\tau_{k}^{(j)}}$ . We also define for $j\in\{1,2\}$ and $k\geq 1$

[TABLE]

the subtree hanging above $u_{I^{(j)}}$ at the time where it contains exactly $k$ vertices (translated to the origin in order to be considered as a plane tree).

Let us state the following intermediate result, which we will prove at the end of the section. Note that the random sequences $(N_{n}^{(j)})_{n\geq 1}$ , $(\tau^{(j)}_{k})_{k\geq 1}$ and $(w^{(j)}_{k})_{k\geq 1}$ for $j\in\{1,2\}$ can be read from $\left(\mathbf{1}_{\left\{u_{n}\in T(u_{I^{(j)}})\right\}}\right)_{n\geq 1}$ for $j\in\{1,2\}$ .

Lemma \thetheorem.

The following holds.

(i)

For $j\in\{1,2\}$ , we almost surely have $N_{n}^{(j)}\underset{n\rightarrow\infty}{\sim}\mu(T(u_{I^{(j)}}))\cdot n$ . 2. (ii)

For $j\in\{1,2\}$ , the sequence $(w^{(j)}_{k})_{k\geq 1}$ satisfies ( $\square_{\gamma}^{p}$ ) almost surely. 3. (iii)

Conditionally on the two sequences $\left(\mathbf{1}_{\left\{u_{n}\in T(u_{I^{(1)}})\right\}}\right)_{n\geq 1}$ and $\left(\mathbf{1}_{\left\{u_{n}\in T(u_{I^{(2)}})\right\}}\right)_{n\geq 1}$ , the sequences of trees $(\mathtt{T}^{(1)}_{k})_{k\geq 1}$ and $(\mathtt{T}^{(2)}_{k})_{k\geq 1}$ are independent and have respective distributions $\operatorname{WRT}((w^{(1)}_{k})_{k\geq 1})$ and $\operatorname{WRT}((w^{(2)}_{k})_{k\geq 1})$ .

Recall the discussion before Lemma 3.1. For $j\in\{1,2\}$ , let $J^{(j)}\geq 1$ be the smallest integer such that for all $k\geq J^{(j)}$ and for all $z\in\mathscr{E}^{\prime}$ we have $1+(e^{z}-1)\frac{w^{(j)}_{k}}{W^{(j)}_{k}}\neq 0$ . Then we can define for $k\geq J^{(j)}$ ,

[TABLE]

These processes are the martingales associated to the weighted profile of the trees $(\mathtt{T}_{k}^{(j)})_{k\geq 1}$ for $j\in\{1,2\}$ . Thanks to Lemma 3.1 (iii) those trees have respective distribution $\operatorname{WRT}((w^{(j)}_{k})_{k\geq 1})$ , for $j\in\{1,2\}$ and thanks to Lemma 3.1 (ii), those weight sequences satisfy ( $\square_{\gamma}^{p}$ ) almost surely. This allows us to apply Proposition 3.1, which entails that for $j\in\{1,2\}$ , the sequence of functions $(z\mapsto M^{(j)}_{k}(z))_{k\geq J^{(j)}}$ converges almost surely to an analytic limit $z\mapsto M_{\infty}^{(j)}(z)$ on the set $\mathscr{V}$ . Now we can write, for $n$ sufficiently large

[TABLE]

Using Lemma 3.1, we have almost surely for $j\in\{1,2\}$ ,

[TABLE]

Using the asymptotics $N_{n}^{(j)}\underset{n\rightarrow\infty}{=}\mu(T(u_{I^{(j)}}))\cdot n\cdot(1+o\mathopen{}\left(1\right))$ from Lemma 3.1 (i) we get

[TABLE]

From the a.s. convergence of the sequence of measures $(\mu_{n})_{n\geq 1}$ , see Theorem 1.2.2, we also get

[TABLE]

which entails that for $j\in\{1,2\}$ , uniformly on all compact subsets of $\mathscr{E}$ , we have the a.s. convergence

[TABLE]

where the limiting function $z\mapsto A_{j}(z)$ is analytic and only takes positive values on $\mathscr{E}\cap\mathbb{R}$ . Then, for any $z\in\mathscr{V}\cap\mathbb{R}$ , taking the limit $n\rightarrow\infty$ in (3.1) yields

[TABLE]

Now, thanks to Lemma 3.1, the function $z\mapsto M^{(1)}_{\infty}(z)$ can only have at most countably many zeros on $I_{\gamma}\subset\mathscr{V}\cap\mathbb{R}$ and for all $z\in I_{\gamma}$ , we have $M^{(2)}_{\infty}(z)>0$ almost surely. Then if we condition on the location of the zeros $z_{1},z_{2}\dots$ of $M^{(1)}_{\infty}$ on $I_{\gamma}$ , since $M^{(2)}_{\infty}$ is independent of $z_{1},z_{2}\dots$ , we have $M^{(2)}_{\infty}(z_{i})>0$ for all $i\geq 1$ almost surely. Hence $M_{\infty}$ has almost surely no zeros on $I_{\gamma}$ . ∎

Now let us prove Lemma 3.1 which we used in the preceding proof.

Proof of Lemma 3.1.

Point (i) follows just from Theorem 1.2.2 and Proposition 2.2.2 and the fact that for $j\in\{1,2\}$ we have $N^{(j)}_{n}=n\nu_{n}(T(u_{I^{(j)}}))$ .

Let us prove (ii). In order to do that we are going to prove that for $j\in\{1,2\}$ , we have

[TABLE]

Let us conclude from here: using the fact that $\boldsymbol{w}$ satisfies ( $\square_{\gamma}$ ), we get

[TABLE]

with a positive constant. We also have

[TABLE]

Because of (42), we have the following almost sure convergence $\frac{\tau_{2n}^{(j)}}{\tau_{n}^{(j)}}\rightarrow 2$ as $n\rightarrow\infty$ , hence almost surely for $n$ large enough we have $\tau_{2n}^{(j)}\leq 4\tau_{n}^{(j)}$ , so

[TABLE]

where in the two last inequalities we used the fact that $\boldsymbol{w}$ satisfies ( $\square_{\gamma}^{p}$ ) and the almost sure linear growth of $\tau_{n}^{(j)}$ ensured by (42).

So it remains only to prove (42). Recall the proof of Theorem 1.2.2. For all $k\geq 1$ the process $(\mu_{n}(T(u_{k})))_{n\geq k}$ is a martingale and almost surely we have

[TABLE]

Using successively Lemma A and then Lemma 3.1, which applies because $\boldsymbol{w}$ satisfies ( $\square_{\gamma}^{p}$ ),

[TABLE]

Using then Lemma A with $q=p$ and $\alpha=0$ and $\delta=(p-1)/2$ , we get that $\absolutevalue{\mu_{n}(T(u_{k}))-\mu(T(u_{k}))}=O\mathopen{}\left(n^{-\epsilon}\right)$ almost surely for some $\epsilon>0$ . Since this is true almost surely for all $k\geq 1$ , we use it with $k\in\{I^{(1)},I^{(2)}\}$ . As by definition for $j\in\{1,2\}$ we have $\mu(T(u_{I^{(j)}}))>0$ , we conclude that $\mu_{n}(T(u_{I^{(j)}}))\underset{n\rightarrow\infty}{\bowtie}\mu(T(u_{I^{(j)}}))$ .

Then, for any $k\geq 1$ , consider the process $\left(n\nu_{n}(T(u_{k}))-\sum_{i=k+1}^{n}\mu_{i}(T(u_{k}))\right)_{n\geq k}$ . It is easy to verify that this process is a martingale in its own filtration and that its increments are bounded by $1$ . Using again Lemma A with $q=2$ and $\alpha=1$ and $\delta=1$ , we get that $n^{-1}\absolutevalue{n\nu_{n}(T(u_{k}))-\sum_{i=k+1}^{n}\mu_{i}(T(u_{k}))}=O\mathopen{}\left(n^{-\epsilon}\right)$ for some $\epsilon>0$ . Using again that for $j\in\{1,2\}$ the limit $\mu(T(u_{I^{(j)}}))$ is almost surely positive, we can write $N_{n}^{(j)}=n\nu_{n}(T(u_{I^{(j)}}))\underset{n\rightarrow\infty}{\bowtie}\mu(T(u_{I^{(j)}}))\cdot n$ . Using the definition of $\tau_{n}^{(j)}$ , we can check that this entails that $\tau_{n}^{(j)}\underset{n\rightarrow\infty}{\bowtie}\mu(T(u_{I^{(j)}}))^{-1}\cdot n$ almost surely. This concludes the proof of (42) and so, (ii) is proved.

Let us now prove (iii). For any $k\geq 1$ , we consider the sequence $\left(\mathbf{1}_{\left\{u_{n}\in T(u_{k})\right\}}\right)_{n\geq 1}$ that encodes the labels of the vertices above $u_{k}$ . Note that the limiting mass $\mu(T(u_{k}))$ can be computed from that sequence. Now, let us sequentially reveal $\left(\mathbf{1}_{\left\{u_{n}\in T(u_{1})\right\}}\right)_{n\geq 1},\left(\mathbf{1}_{\left\{u_{n}\in T(u_{2})\right\}}\right)_{n\geq 1},\dots,\left(\mathbf{1}_{\left\{u_{n}\in T(u_{k})\right\}}\right)_{n\geq 1},\dots$ until we get to a $k$ for which $\mu(T(u_{k}))\in\mathopen{(}0\mathclose{}\mathpunct{},1\mathclose{)}$ . By definition, the first index for which it happens is $I^{(1)}$ .

Then we continue revealing the sequences $\left(\mathbf{1}_{\left\{u_{n}\in T(u_{k})\right\}}\right)_{n\geq 1}$ for $k>I^{(1)}$ but only for the $k$ ’s such that $u_{k}\notin T(u_{I^{(1)}})$ until we get to a $k$ for which $\mu(T(u_{k}))\in\mathopen{(}0\mathclose{}\mathpunct{},1\mathclose{)}$ . By definition, this second index is $I^{(2)}$ . Remark, and this is the key in this argument, that after determining $I^{(1)}$ and $I^{(2)}$ in this way, the only information that we have about $T(u_{I^{(1)}})$ and $T(u_{I^{(2)}})$ is the list of labels of the vertices that belong each of them (and the position of $u_{I^{(1)}}$ and $u_{I^{(2)}}$ ).

Now, conditionally on all this information, it is straightforward to see from the attachment dynamics that for any $j\in\{1,2\}$ , when the $i+1$ -st vertex attaches above $u_{I^{(j)}}$ at time $\tau_{i+1}^{(j)}$ , the label $K_{\tau_{i+1}^{(j)}}$ of the vertex to which it attaches is chosen among $\tau_{1}^{(j)},\tau_{2}^{(j)},\dots\tau_{i}^{(j)}$ with probability proportional to their respective weight $w_{\tau_{1}^{(j)}},w_{\tau_{2}^{(j)}},\dots w_{\tau_{i}^{(j)}}$ , independently for different choices of $i\geq 1$ and $j\in\{1,2\}$ . This finishes the proof of (iii) and hence that of the lemma. ∎

3.2 From the weighted to the unweighted sum.

Now we want to transfer these results of convergence to the Laplace transform of the real profile. Recall from (28) the definition of the sequence of functions $(z\mapsto N_{n}(z))_{n\geq 1}$ . We still assume until the end of Section 3.2 that $\boldsymbol{w}$ satisfies ( $\square_{\gamma}^{p}$ ) for some $\gamma>0$ and $p\in\mathopen{(}1\mathclose{}\mathpunct{},2\mathclose{]}$ .

We introduce the following quantity, for $n\geq J$ ,

[TABLE]

The goal of this subsection is to show that the quantity $X_{n}(z)$ is negligible as $n\rightarrow\infty$ compared to any of the two terms in the difference, for $z$ contained in some subset of the complex plane. This way we will transfer the asymptotics that we have proved for $M_{n}(z)$ and $C_{n}(z)$ in the last section to asymptotics for $N_{n}(z)$ , which is the quantity that we want to study in the end. Let us start by proving a lemma.

Lemma \thetheorem.

The process $\left(X_{n}(z)\right)_{n\geq J}$ is a martingale with respect to $(\mathcal{F}_{n})_{n\geq 1}$ . Furthermore, for all $q\in\mathopen{(}1\mathclose{}\mathpunct{},p\mathclose{]}$ ,

[TABLE]

Proof.

This process is of course $(\mathcal{F}_{n})$ -adapted and integrable. For the martingale property we compute

[TABLE]

For $z\in\mathscr{E}$ and $q\in\mathopen{(}1\mathclose{}\mathpunct{},p\mathclose{]}$ , we make the following computation, using Lemma 3.1 and Lemma 3.1,

[TABLE]

and the last exponent reduces to $q\real\phi(z)\vee\phi(q\real z)$ because $(q\real\phi(z)+g(z,q))=\phi(q\real z)+1-q<\phi(q\real z)$ . Hence, using Lemma A, we get

[TABLE]

which finishes the proof of the lemma. ∎

Recall the definition of $z_{+}$ and $z_{-}$ in (12). Let us define the domain $\mathscr{D}$ to which we refer in the statement of Proposition 3 as the connected component of

[TABLE]

that contains [math], where $\mathscr{V}$ is defined in (33). In this way, $\mathscr{D}$ is a domain of $\mathbb{C}$ and $\mathscr{D}\cap\mathbb{R}=\mathopen{(}z_{-}\mathclose{}\mathpunct{},z_{+}\mathclose{)}$ . Indeed, first, $\mathscr{D}$ is open and connected by definition. Then recall from Lemma 3.1 that $\mathscr{V}\cap\mathbb{R}$ contains $I_{\gamma}=\left\{x\in\mathbb{R}\mathrel{}\middle|\mathrel{}1+\gamma(e^{x}-1-xe^{x})>0\right\}$ an open interval which contains [math] and has $z_{+}$ as its right endpoint. Now just check that $\left\{z\in\mathbb{R}\mathrel{}\middle|\mathrel{}1+\real(\phi(z))>0\right\}=\mathopen{(}z_{-}\mathclose{}\mathpunct{},\infty\mathclose{)}$ and that $z_{-}\in I_{\gamma}$ .

For technical reasons, we also introduce the following subset of $\mathbb{C}$ , here identified as $\mathbb{R}\times\mathbb{R}$ ,

[TABLE]

on which the process $(z\mapsto M_{n}(z))_{n\geq J}$ , and hence also $(z\mapsto X_{n}(z))_{n\geq J}$ , are well-defined. Let us further decompose $\mathscr{D}^{\prime}$ into a union of open sets

[TABLE]

Lemma \thetheorem.

The following holds.

(i)

For all compact $K\subset\mathscr{D}$ there exists $\epsilon(K)>0$ such that almost surely

[TABLE] 2. (ii)

For all compact $K\subset\mathscr{D}^{\prime}$ , there exists $\epsilon(K)>0$ such that

[TABLE] 3. (iii)

For all compact $K\subset\mathscr{D}^{\prime}$ , there exists $\epsilon(K)>0$ such that almost surely

[TABLE]

Proof.

For the first one, for any $q\in\mathopen{(}1\mathclose{}\mathpunct{},p\mathclose{]}$ we can apply Lemma A on the open set $\mathscr{V}_{q}\cap\left\{z\in\mathbb{C}\mathrel{}\middle|\mathrel{}1+\real(\phi(z))>0\right\}$ with $\alpha(z)=1+\real(\phi(z))>0$ and $\delta(z)=\min(q-1,-g(z,q))>0$ , thanks to Lemma 3.2. Then using the compactness of $K$ , (i) is true for every compact $K\subset\mathscr{V}\cap\left\{z\in\mathbb{C}\mathrel{}\middle|\mathrel{}1+\real(\phi(z))>0\right\}$ , hence for any compact $K\subset\mathscr{D}$ .

Let us prove point (ii). For any $q\in\mathopen{(}1\mathclose{}\mathpunct{},p\mathclose{]}$ , thanks to Lemma 3.1, on the open set $\mathscr{D}^{\prime}_{q}\subset\mathscr{E}$ we have $\mathbb{E}\left[\absolutevalue{M_{2n}(z)-M_{n}(z)}^{q}\right]=O_{\mathscr{D}^{\prime}_{q}}\mathopen{}\left(n^{(1-q)\vee g(z,q)+o_{\mathscr{D}^{\prime}_{q}}\mathopen{}\left(1\right)}\right)$ and

[TABLE]

Applying Lemma A for the martingale $(z\mapsto M_{n}(z))_{n\geq J}$ on any compact subset $K\subset\mathscr{D}^{\prime}_{q}$ with $\alpha(z)=\phi(\real z)-\real\phi(z)>0$ and $\delta(z)=\min(-1+q+q(\phi(\real z)-\real(\phi(z))),-g(\real z,q))>0$ yields:

[TABLE]

Using the estimates of Lemma 3.1, we have $\absolutevalue{C_{n}(z)}=O_{K}\mathopen{}\left(n^{\real\phi(z)}\right)$ , and so $\absolutevalue{C_{i}(z)M_{i}(z)}=O_{K}\mathopen{}\left(i^{\phi(\real z)-\epsilon(K)}\right)$ . Hence $\sum_{i=J}^{n-1}\absolutevalue{C_{i}(z)M_{i}(z)}=O_{K}\mathopen{}\left(n^{0\vee(1+\phi(\real z)-\epsilon(K))}\right)$ which finishes the proof of (ii).

Last, in order to prove (iii), we use Lemma A on $\mathscr{D}^{\prime}_{q}$ for the martingale $(z\mapsto X_{n}(z))_{n\geq J}$ with $\alpha(z)=1+\phi(\real z)>0$ and $\delta(z)=\min(-1+q+q(\phi(\real z)-\real(\phi(z))),-g(\real z,q))$ . ∎

In order to conclude, we will also need the following lemma, which is a direct consequence of Lemma 3.1.

Lemma \thetheorem.

For any compact $K\subset\mathscr{E}\cap\left\{z\in\mathbb{C}\mathrel{}\middle|\mathrel{}1+\real(\phi(z))>0\right\}$ , there exists $\epsilon(K)$ such that

[TABLE]

Proof.

On any compact $K\subset\mathscr{E}\cap\left\{z\in\mathbb{C}\mathrel{}\middle|\mathrel{}1+\real(\phi(z))>0\right\}$ , using Lemma 3.1 we write

[TABLE]

so that

[TABLE]

where in the second line, we use the fact that $\inf_{z\in K}(1+\real\phi(z))>0$ , and we define $\epsilon(K):=\epsilon\wedge\inf_{z\in K}(1+\real\phi(z))$ . This proves the lemma. ∎

We can now prove Proposition 3.

Proof of Proposition 3.

Let us start by proving simultaneously that for any $z\in\mathscr{D}$ , we almost surely have

[TABLE]

and also that both points (i) and (ii) of the proposition hold. For any compact $K\subset\mathscr{D}$ and $z\in K$ , we write

[TABLE]

The first term is $O_{K}\mathopen{}\left(n^{-\epsilon(K)}\right)$ thanks to Lemma 3.2 (i). We bound the second one by the following quantity

[TABLE]

In the above display, we used Lemma 3.2 and then Lemma 3.1 together with Proposition 3.1 on respectively the first and the second term. In the end, the whole expression is $O_{K}\mathopen{}\left(n^{-\epsilon(K)}\right)$ . From (43), it is clear that the limiting function $z\mapsto N_{\infty}(z)$ is analytic and has almost surely no zero on $\mathopen{(}z_{-}\mathclose{}\mathpunct{},z_{+}\mathclose{)}$ because of Lemma 3.1. For (iii), let us prove the stronger statement: for any compact subset $K\subset\mathopen{(}z_{-}\mathclose{}\mathpunct{},z_{+}\mathclose{)}$ and $0<a<\pi$ , there exists $\epsilon(K,a)>0$ such that almost surely,

[TABLE]

For this, we write

[TABLE]

We apply points (ii) and (iii) of Lemma 3.2 to the compact $K\times\mathopen{[}a\mathclose{}\mathpunct{},\pi\mathclose{]}$ and get the desired bound. ∎

3.3 Height of the tree

In this section, we study the behaviour of the height $\operatorname{ht}(\mathtt{T}_{n})$ of the tree $\mathtt{T}_{n}$ , which is defined as the maximal height of the vertices of $\mathtt{T}_{n}$ , i.e.

[TABLE]

We start by showing that under the assumption ( $\square_{\gamma}^{p}$ ) we have the convergence (15). Then, for the sake of completeness, we also study the simpler case where $\log n=o\mathopen{}\left(\sum_{i=1}^{n}\frac{w_{i}}{W_{i}}\right)$ .

One key argument in our proofs is the following equality for the annealed moment generating function of the height of $u_{k}$ , for any fixed $k\geq 2$ , which can be seen as a corollary of Lemma 3.1

[TABLE]

Some elementary computations using the Chernoff bound and the last display yield the following lemma.

Lemma \thetheorem.

Suppose that the sequence of weights $\boldsymbol{w}$ satisfies

[TABLE]

Then almost surely we have

[TABLE]

where $z_{+}(u)$ is the unique positive root of $u(ze^{z}-e^{z}+1)-1=0$ .

Proof.

Using the expression (44) for the moment generating function of $\operatorname{ht}(u_{n})$ we get, for any $z>0$

[TABLE]

where we use the inequality $(1+x)\leq e^{x}$ and the assumption on $\boldsymbol{w}$ . Then, for any $z>0$ and $n\geq 1$ ,

[TABLE]

If we take $z>0$ such that $u(ze^{z}-e^{z}+1)>1$ then the right-hand-side is summable and hence using the Borel-Cantelli lemma shows that for all $n$ large enough, we have $\operatorname{ht}(u_{n})\leq ue^{z}\log n$ . Letting $z\searrow z^{+}(u)$ , we get the result. ∎

Let us prove the last claim (15) of Theorem 1.2.1. Here we suppose that the weight sequence $\boldsymbol{w}$ satisfies ( $\square_{\gamma}^{p}$ ) for some $\gamma>0$ and some $p\in\mathopen{(}1\mathclose{}\mathpunct{},2\mathclose{]}$ .

Proof of (15).

Recall the asymptotics (14) in Theorem 1.2.1. It ensures that there almost surely exist vertices at height $\lfloor\gamma e^{z}\log n\rfloor$ , for any fixed $z\in\mathopen{(}z_{-}\mathclose{}\mathpunct{},z_{+}\mathclose{)}$ and $n$ large enough. Hence the height of the tree $\mathtt{T}_{n}$ satisfies

[TABLE]

For the limsup, we use Lemma 3.3 with $u=\gamma$ (this is justified by Lemma 3.1), which yields $\limsup_{n\rightarrow\infty}\frac{\operatorname{ht}(\mathtt{T}_{n})}{\log n}\leq\gamma e^{z_{+}}$ . ∎

To finish the section, we state a proposition.

Proposition \thetheorem.

Let $f(n):=\sum_{i=2}^{n-1}\frac{w_{i}}{W_{i}}$ . If $\log n=o\mathopen{}\left(f(n)\right)$ then we have the almost sure convergences

[TABLE]

Proof.

As we can check from its moment generating function (44), the random variable $\operatorname{ht}(u_{n})-1$ is a sum of independent Bernoulli random variables, with expectation $f(n)$ . Using standard bounds for $\operatorname{ht}(u_{n})-1$ yields

[TABLE]

which is summable in $n$ for any $\epsilon>0$ . The result of the proposition is then obtained using the Borel-Cantelli lemma. ∎

4 Preferential attachment trees are weighted recursive trees

In this section, we study preferential attachment trees with fitnesses $\mathbf{a}$ as defined in the introduction. First, in Section 4.1, we prove Theorem 1.1 which allows us to see them as weighted random trees $\operatorname{WRT}(\boldsymbol{\mathsf{w}}^{\mathbf{a}})$ for some random weight sequence $\boldsymbol{\mathsf{w}}^{\mathbf{a}}$ . Then in Section 4.2 we prove Proposition 1.1 which relates the asymptotic behaviour of $\boldsymbol{\mathsf{w}}^{\mathbf{a}}$ to the behaviour of $\mathbf{a}$ . Finally, in Section 4.3 we prove Proposition 4.3, which ensures that the sequence $\boldsymbol{\mathsf{m}}^{\mathbf{a}}$ obtained as the scaling limit of the degrees can be expressed as the increments of a Markov chain.

4.1 Coupling with a sequence of Pólya urns: proof of Theorem 1.1

Here we fix an arbitrary sequence $\mathbf{a}$ such that $a_{1}>-1$ and $\forall n\geq 2,\ a_{n}\geq 0$ . Let us recall the notation, for $n\geq 0$ ,

[TABLE]

with the convention that $A_{0}=0$ . We consider a sequence of trees $(\mathtt{P}_{n})_{n\geq 1}$ evolving according to the distribution $\operatorname{PAT}(\mathbf{a})$ and we want to prove Theorem 1.1, namely that there exists a random sequence of weights $\boldsymbol{\mathsf{w}}^{\mathbf{a}}$ for which the sequence evolves as a $\operatorname{WRT}(\boldsymbol{\mathsf{w}}^{\mathbf{a}})$ . The proof uses a decomposition of this process into an infinite number of Pólya urns. This is very close to what is used in the proofs of [4, Theorem 2.1] or [8, Section 1.2] in similar settings. The novelty of our approach is to express this result using weighted random trees, since it allows us to apply all the results developed in the preceding section.

Pólya urns.

For us, a Pólya urn process $(\mathsf{Urn}(n))_{n\geq 0}=(X(n),\mathrm{Total}(n))_{n\geq 0}$ is a Markov chain on $E:=\left\{(x,z)\in\mathbb{R}_{+}\times\mathbb{R}_{+}^{*}\mathrel{}\middle|\mathrel{}x\leq z\right\}$ with transition probabilities given by the matrix $P$ where for all $(x,z)\in E$ ,

[TABLE]

The quantities $X(n)$ and $\mathrm{Total}(n)$ represent respectively the number of red balls and the total number of balls at time $n$ in a urn containing red and blacks balls, in which we add a ball at each time, the colour of which is chosen at random proportionally to the current proportion in the urn. Starting at time [math] from the state $(a,a+b)$ , i.e. with $a$ red balls and $b$ black balls, it is well-known that the sequence $(\Delta X(n))_{n\geq 1}=(X(n)-X(n-1))_{n\geq 1}$ of random variables is exchangeable, and an application of de Finetti’s representation theorem ensures that it has the same distribution as i.i.d. samples of Bernoulli random variables with a random parameter $\beta$ , which has distribution $\mathrm{Beta}(a,b)$ , where we use the convention that $\mathrm{Beta}(a,b)=\delta_{1}$ if $b=0$ .

Note that the process $(\mathsf{Urn}(n))_{n\geq 0}$ is entirely determined from $(\Delta X(n))_{n\geq 1}$ and that the random variable $\beta$ is a measurable function of the sequence $(\mathsf{Urn}(n))_{n\geq 0}$ because it can almost surely be obtained as $\beta=\lim_{n\rightarrow\infty}\frac{X(n)}{n}$ .

Nested structure of urns in the tree.

For all $k\geq 1$ we define the following process in $n\geq k$

[TABLE]

the "total fitness" of the vertices $\{u_{1},u_{2},\dots,u_{k}\}$ , for which we remark that for any $k\geq 1$ we have

[TABLE]

Imagine that $\mathtt{P}_{n}$ is constructed and we add a new vertex $u_{n+1}$ to the tree. We choose its parent in a downward sequential way:

•

we first determine whether the parent is $u_{n}$ , this happens with probability

[TABLE]

•

then with the complementary probability $\frac{W_{n-1}(n)}{W_{n}(n)}$ it is not, so conditionally on this we determine whether it is $u_{n-1}$ , this happens with (conditional) probability

[TABLE]

•

then with the complementary probability $\frac{W_{n-2}(n)}{W_{n-1}(n)}$ it is not, etc… We continue this process until we stop at some $u_{i}$ .

Now let us fix $k\geq 1$ and introduce the following time-change: for all $N\geq 0$ , we let

[TABLE]

be the $N$ -th time that a vertex in attached on one of the vertices $\{u_{1},\dots,u_{k+1}\}$ after time $k+1$ , where by definition, we have $\theta_{k}(0)=k+1$ . Remark that it can be the case that $\theta_{k}(N)$ is not defined for large $N$ , if there is only a finite number of vertices attaching to $\{u_{1},\dots,u_{k+1}\}$ . Let us ignore this possible problem for the moment, and only consider sequences $\mathbf{a}$ for which $A_{n}=O\mathopen{}\left(n\right)$ , for which this will almost surely not happen. In this case for all $N\geq 0$ we set

[TABLE]

Now, the three following facts are the key observations in order to prove Theorem 1.1:

(i)

for all $k\geq 1$ , the process $\mathsf{Urn}_{k}=\left(\mathsf{Urn}_{k}(N)\right)_{N\geq 0}$ has the distribution of a Pólya urn starting from the state $(A_{k}+k,A_{k+1}+k)$ , 2. (ii)

those process are jointly independent for $k\geq 1$ , 3. (iii)

the whole sequence $(\mathtt{P}_{n})_{n\geq 1}$ is a function the collection of processes $\left(\mathsf{Urn}_{k},\ k\geq 1\right)$ .

Point (i) already follows from the discussion above. A moment of thought shows that (ii) holds as well: of course the processes $(W_{k}(n),W_{k+1}(n))_{n\geq k+1}$ for different $k$ are not independent at all but the point is that they only interact through the time-changes $(\theta_{k}(\cdot),k\geq 1)$ . Last, for (iii), let us note that we can reconstruct the tree $\mathtt{P}_{n}$ at time $n$ from the random variables $(W_{i}(k))_{1\leq i,k\leq n}$ and that these random variables can be entirely determined using

[TABLE]

Reversing the construction and using the exchangeability.

Let us now reverse the construction and start with an independent family $(\mathsf{Urn}_{k},\ k\geq 1)$ of processes which have for each $k\geq 1$ the distribution of a Pólya urn starting from the state $(A_{k}+k,A_{k+1}+k)$ , so that they have the joint same distribution as the ones described in (i) and (ii). From what we did above, the sequence $(\mathtt{P}_{n})_{n\geq 1}$ that they determine through (iii) has distribution $\operatorname{PAT}(\mathbf{a})$ . A moment of thought shows that this argument actually still holds for a completely arbitrary sequence of fitnesses $\mathbf{a}$ .

Now, using de Finetti’s theorem, each of the processes $\mathsf{Urn}_{k}$ can be produced by sampling $\beta_{k}\sim\mathrm{Beta}(A_{k}+k,a_{k+1})$ and adding a red ball at each step independently with probability $\beta_{k}$ and a black ball with probability $1-\beta_{k}$ . This is of course done independently for different $k\geq 1$ .

In terms of our downward sequential procedure defined above for finding the parent of each newcomer, it amounts to saying that each time that we have to choose between attaching to $u_{k+1}$ or attach to a vertex among $\{u_{1},\dots,u_{k}\}$ , the former is chosen with probability $1-\beta_{k}$ and the latter with probability $\beta_{k}$ . Let us verify that the law of $(\mathtt{P}_{n})_{n\geq 1}$ conditionally on the sequence $(\beta_{k})_{k\geq 1}$ can indeed be expressed as WRT with the random sequence of weights $\boldsymbol{\mathsf{w}}^{\mathbf{a}}$ defined in Theorem 1.1, which is defined from the sequence $(\beta_{k})_{k\geq 1}$ as,

[TABLE]

with the convention that $\mathsf{W}^{\mathbf{a}}_{1}=1$ and $\mathsf{W}^{\mathbf{a}}_{0}=0$ . Let us reason conditionally on the sequence $(\beta_{k})_{k\geq 1}$ (or equivalently the sequence $(\mathsf{w}^{\mathbf{a}}_{n})_{n\geq 1}$ ). When determining the parent of $u_{n+1}$ , whose label we denote $J_{n+1}$ as in (2), we successively try to attach to $u_{n},u_{n-1},\dots$ until we stop at $u_{J_{n+1}}$ . Using the independence, we get that for every $k\in\{1,2,\dots,n\}$ ,

[TABLE]

This proves Theorem 1.1. Let us explain how Corollary 1.1 follows from the proof that we developed here. From the discussion in the previous paragraph, in the case of a sequence $\mathbf{a}$ for which $A_{n}=O\mathopen{}\left(n\right)$ , each of the processes $(\mathsf{Urn}_{k}(N))_{N\geq 0}$ for $k\geq 1$ is a measurable function of $(\mathtt{P}_{n})_{n\geq 1}$ , and hence the associated $\beta_{k}$ also is. In the end, the sequence $(\mathsf{w}^{\mathbf{a}}_{n})_{n\geq 1}$ is a measurable function of $(\mathtt{P}_{n})_{n\geq 1}$ and it is easy to check that it corresponds to the one described in the statement of Corollary 1.1.

4.2 Proof of Proposition 1.1

Let $(\mathsf{W}^{\mathbf{a}}_{n})_{n\geq 1}$ be the random sequence of cumulated weights defined Theorem 1.1, whose distribution depends on a sequence $\mathbf{a}$ of fitnesses, and is expressed using a sequence of independent Beta-distributed random variables $(\beta_{k})_{k\geq 1}$ . We are going to prove Proposition 1.1, which relates the growth of $(\mathsf{W}^{\mathbf{a}}_{n})_{n\geq 1}$ to the one of $(A_{n})_{n\geq 1}$ .

Proof of Proposition 1.1.

As in [21, Proof of Lemma 1.1], we introduce

[TABLE]

It is easy to see that $X_{n}$ is a positive martingale, hence it almost surely converges to a limit $X_{\infty}$ as $n\rightarrow\infty$ . Now, using the fact that the $(\beta_{n})_{n\geq 1}$ are independent and that for any integer $q\geq 0$ , the $q$ -th moment of a random variable with $\mathrm{Beta}(a,b)$ distribution is given by

[TABLE]

we can compute

[TABLE]

Now from ( $H_{c}$ ), there exists $\epsilon>0$ such that $A_{n}=c\cdot n+O\mathopen{}\left(n^{1-\epsilon}\right)$ and without loss of generality we can assume that $\epsilon<1$ . For all $k\in\mathopen{\llbracket}0\mathclose{}\mathpunct{},p-1\mathclose{\rrbracket}$ we can write

[TABLE]

Hence

[TABLE]

In the end, since $\left(\prod_{k=0}^{p-1}\frac{1+A_{1}+k}{n+A_{n}+k-1}\right)=\operatorname{cst}\cdot\prod_{k=0}^{p-1}\frac{1}{(c+1)n+O\mathopen{}\left(n^{1-\epsilon}\right)}=\operatorname{cst}\cdot n^{-p}\cdot(1+O\mathopen{}\left(n^{-\epsilon}\right))$ , we get

[TABLE]

where $C_{p}$ is a positive constant which depends on the sequence $\mathbf{a}$ and $p$ . This entails that, under our assumptions, for any $p\geq 1$ , we have

[TABLE]

which shows that this martingale is bounded in $L^{p}$ for all $p\geq 1$ and hence it is uniformly integrable. Consequently, it converges a.s. and in $L^{p}$ to a limit random variable $X_{\infty}$ , with moments determined by

[TABLE]

Furthermore, we have

[TABLE]

Since $\beta_{n}\sim\mathrm{Beta}(n+A_{n},a_{n+1})$ , we get

[TABLE]

Using (53), (54), Lemma A and then summing over $n\leq k\leq 2n-1$ and using the fact that $\mathbf{a}$ satisfies ( $H_{c}$ ) we get that

[TABLE]

Using Lemma A, we get that almost surely, for any $\epsilon^{\prime}<\frac{1}{2}$ ,

[TABLE]

Since $\beta_{i}>0$ almost surely for every $i\geq 1$ , the event $\{X_{\infty}=0\}$ is a tail event for the filtration generated by the $\beta_{i}$ and has probability [math] or $1$ . In the end, it has probability [math] because $\mathbb{E}\left[X_{\infty}\right]=1$ . We deduce that

[TABLE]

Hence, we have,

[TABLE]

Whenever $a_{n}\leq n^{c^{\prime}+o\mathopen{}\left(1\right)}$ as $n\rightarrow\infty$ , we can show the following (we postpone the proof to the end of the section)

Lemma \thetheorem.

For any $\delta>0$ small enough, we have

[TABLE]

Since the last quantity is summable in $k$ we can use the Borel-Cantelli lemma (and a sequence of $\delta$ going to [math]) to show that almost surely $1-\beta_{k}\leq k^{-1+c^{\prime}+o_{\omega}(1)}$ as $k\rightarrow\infty$ , where the term $o_{\omega}(1)$ denotes a random function of $k$ that tends to [math] when $k\rightarrow\infty$ . Combining this with (56), we finish proving the proposition by writing

[TABLE]

We finish by giving a proof of Lemma 4.2.

Proof of Lemma 4.2.

Let $x\geq 0$ and $y>1$ and let $X$ be a random variable with distribution $\mathrm{Beta}(x+1,y)$ and $Y$ with distribution $\mathrm{Beta}(x,1)$ , independent of $X$ . By standard results on Beta distributions, the product $Z=X\cdot Y$ has distribution $\mathrm{Beta}(x,y)$ .

Then for any $z\in\mathopen{[}0\mathclose{}\mathpunct{},1\mathclose{]}$ we have, using the explicit expression of the density of $X$ ,

[TABLE]

and the last display in increasing in $x$ . We are going to use this inequality for well-chosen sequences $(x_{n})$ , $(y_{n})$ and $(z_{n})$ taking place of the values of $x,y,z$ . Let us first remark that for any two non-negative sequences $(x_{n})$ and $(y_{n})$ with $(y_{n})$ going to infinity and $x_{n}=o\mathopen{}\left(y_{n}\right)$ , we have the following estimate using Stirling’s approximation:

[TABLE]

Now let us apply the above computations for every $n\geq 1$ with $z_{n}:=n^{-1+c^{\prime}+\delta}$ to the random variables $(1-\beta_{n})$ which have distribution $\mathrm{Beta}\left(x_{n},y_{n}\right)$ , with $x_{n}:=a_{n+1}$ and $y_{n}:=A_{n}+n$ . In particular, in this context we have $x_{n}=a_{n}\leq(n+1)^{c^{\prime}+o\mathopen{}\left(1\right)}$ and $y_{n}=A_{n}+n=(n+1)^{1+o\mathopen{}\left(1\right)}$ , so that the all of the above applies and

[TABLE]

which is what we wanted. ∎

4.3 The distribution of the limiting sequence

Let us stay in the setting of Section 4.2. Suppose that we are working with a sequence of fitnesses $\mathbf{a}$ that satisfies ( $H_{c}$ ) for some $c>0$ . The sequence $\left(\mathsf{M}_{n}^{\textbf{a}}\right)_{n\geq 1}$ is defined in (9) as some random multiple of the sequence $\left(\mathsf{W}_{n}^{\textbf{a}}\right)_{n\geq 1}$ , whose distribution is described in Theorem 1.1 from a sequence $(\beta_{n})_{n\geq 1}$ of independent random variables with $\beta_{n}\sim\mathrm{Beta}(A_{n}+n,a_{n+1})$ , so that for all $n\geq 1$ ,

[TABLE]

where the random variable $Z$ is the one that appears in (56), and depends on the whole sequence $(\beta_{n})_{n\geq 1}$ .

Proposition \thetheorem.

For any sequence a that satisfies the condition ( $H_{c}$ ), the sequence $\left(\mathsf{M}_{k}^{\textbf{a}}\right)_{k\geq 1}$ is a (possibly time-inhomogeneous) Markov chain such that for all $k\geq 1$ , $\mathsf{M}_{k+1}^{\textbf{a}}$ is independent of $\beta_{1},\beta_{2},\dots,\beta_{k}$ . The fact that for all $k\geq 1$ we have $\mathsf{M}_{k}^{\textbf{a}}=\beta_{k}\cdot\mathsf{M}_{k+1}^{\textbf{a}}$ with $\beta_{k}\sim\mathrm{Beta}(A_{k}+k,a_{k+1})$ independent of $\mathsf{M}_{k+1}^{\textbf{a}}$ characterises the backward transitions of the chain.

Proof.

We follow the same steps as [21, Lemma 1.1]. Recall the definition of the random variable $X_{\infty}$ as the limit of the sequence $(X_{n})_{n\geq 1}$ defined in (49), the definition (51) of the constant $C_{1}$ and their relation to the random variable $Z$ . We have

[TABLE]

It then follows that we can write, for $k\geq 1$ ,

[TABLE]

which ensures that $\mathsf{M}_{k+1}^{\mathbf{a}}$ is independent of $\beta_{1},\beta_{2},...,\beta_{k}$ . The limit in the last equality exists almost surely thanks to the results of the preceding section.

Now we prove the Markov property of the chain. Let $k\geq 1$ . Because of the definition of the chain as a product, the distribution of $\mathsf{M}_{k+1}^{\mathbf{a}}$ conditional on the past trajectory $\mathsf{M}_{1}^{\mathbf{a}},\mathsf{M}_{2}^{\mathbf{a}},\dots,\mathsf{M}_{k}^{\mathbf{a}}$ is the same as the distribution of $\mathsf{M}_{k+1}^{\mathbf{a}}$ conditional on $\mathsf{M}_{k}^{\mathbf{a}},\beta_{1},\dots,\beta_{k-1}$ . Since $\mathsf{M}_{k+1}^{\mathbf{a}}=\beta_{k}^{-1}\cdot\mathsf{M}_{k}^{\mathbf{a}}$ and that $\beta_{k}$ and $\mathsf{M}_{k}^{\mathbf{a}}$ are both independent of $\beta_{1},\dots,\beta_{k-1}$ , this conditional distribution corresponds to the one of $\mathsf{M}_{k+1}^{\mathbf{a}}$ conditional on the present state of the chain $\mathsf{M}_{k}^{\mathbf{a}}$ . ∎

Computing the moments.

In some cases where the sequence $\mathbf{a}$ is sufficiently regular, we can compute explicitly every moment of the random variable $\mathsf{M}^{\mathbf{a}}_{k}$ for every $k\geq 1$ . Indeed, using (52) and (57) and the independence, we get

[TABLE]

In general, if the collection $(\mu_{p})_{p\geq 1}$ of $p$ -th moments of some random variable satisfies the so-called Carleman’s condition: $\sum_{p=1}^{\infty}\mu_{2p}^{-1/(2p)}=\infty$ , then its distribution is uniquely determined from those moments.

5 Examples and applications

In this section, we compute the explicit distribution of $(\mathsf{M}_{n}^{\mathbf{a}})$ for some particular sequences $\mathbf{a}$ . We then describe some applications of our results to a model of Pólya urn with immigration and then to a model of preferential attachment graphs.

5.1 The limit chain for particular sequences $\mathbf{a}$

As stated in the preceding section, we can compute the distribution of $\mathsf{M}^{\mathbf{a}}_{k}$ for some fixed $k$ by the expression of its moments (4.3), provided that they satisfy Carleman’s condition. Knowing these distributions and the backward transitions given in Proposition 4.3 then characterises the law of the whole process. For two particular examples, this law has a nice expression.

Proposition \thetheorem.

In the two following cases, the distribution of the chain $(\mathsf{M}_{n}^{\mathbf{a}})$ is explicit.

(i)

If $\mathbf{a}$ is of the form $\mathbf{a}=(a,b,b,b,\dots)$ with $a>-1$ and $b>0$ , then the limiting sequence $(\mathsf{M}_{n}^{\mathbf{a}})_{n\geq 1}$ is a Mittag-Leffler Markov chain $\operatorname{MLMC}\left(\frac{1}{b+1},\frac{a}{b+1}\right)$ . 2. (ii)

If $\mathbf{a}$ is of the form $\mathbf{a}=(a,b_{1},b_{2},\dots,b_{\ell},b_{1},b_{2},\dots b_{\ell},b_{1},\dots)$ , periodic of period $\ell$ starting from the second term with $a>-1$ and $\ell$ integers $b_{1},b_{2},\dots,b_{\ell}$ with at least one being non-zero, then, letting $S=b_{1}+b_{2}+\dots+b_{\ell}$ , the sequence $\frac{\ell^{\frac{-\ell}{S+\ell}}}{S+\ell}\cdot(\mathsf{M}_{n}^{\mathbf{a}})_{n\geq 1}$ has the distribution of an Intertwined Product of Generalised Gamma Processes with parameters $(a,b_{1},b_{2},\dots,b_{\ell})$ , which we denote $\mathrm{IPGGP}(a,b_{1},b_{2},\dots,b_{\ell})$ .

Note that the two cases (i) and (ii) are not mutually exclusive. We will prove the two points of this proposition in separate subsections. The proper definitions of the distributions to which we refer in the statement are given along the proof.

5.1.1 Mittag-Leffler Markov chains

Let us study the case where the underlying preferential attachment tree has a sequence of fitnesses $\mathbf{a}$ that are of the form $(a,b,b,b,\dots)$ . We start by recalling the definitions of Mittag-Leffler distributions and Mittag-Leffler Markov chains and introduced in [21], and also studied in [25].

Mittag-Leffler distributions.

Let $0<\alpha<1$ and $\theta>-\alpha$ . The generalized Mittag-Leffler $\mathrm{ML}(\alpha,\theta)$ distribution has $p$ th moment

[TABLE]

and the collection of $p$ -th moments for $p\in\mathbb{N}$ uniquely characterizes this distribution thanks to Carleman’s criterion.

Mittag-Leffler Markov Chains.

For any $0<\alpha<1$ and $\theta>-\alpha$ , we introduce the (a priori) inhomogenous Markov chain $(\mathsf{M}^{\alpha,\theta}_{n})_{n\geq 1}$ , the distribution of which we call the Mittag-Leffler Markov chain of parameters $(\alpha,\theta)$ , or $\operatorname{MLMC}(\alpha,\theta)$ . This type of Markov chain was already defined in [21], for some choice of parameters $\alpha$ and $\theta$ . It is a Markov chain such that for any $n\geq 1$ ,

[TABLE]

and the transition probabilities are characterised by the following equality in law:

[TABLE]

with $B_{n}\sim\mathrm{Beta}\left(\frac{\theta+n-1}{\alpha}+1,\frac{1}{\alpha}-1\right)$ , independent of $\mathsf{M}_{n+1}^{\alpha,\theta}$ . These chains are constructed (for some values of $\theta$ depending on $\alpha$ ) in [21]. In fact, our proof of Proposition 5.1 (i) ensures that these chains exist for any choice of parameters $0<\alpha<1$ and $\theta>-\alpha$ . Let us mention that the proof of [21, Lemma 1.1] is still valid for the whole range of parameters $0<\alpha<1$ and $\theta>-\alpha$ , which proves that these Markov chains are in fact time-homogeneous. We provide, in a later paragraph, another proof of this time-homogeneity using an argument that relies on preferential attachment trees.

The limiting Markov chain is a Mittag-Leffler.

Recall the definition of the sequence $(\beta_{k})_{k\geq 1}$ and their respective distributions $\beta_{k}\sim\mathrm{Beta}(A_{k}+k,a_{k+1})$ . From our assumption that $\mathbf{a}=a,b,b,b\dots$ we have for all $k\geq 1$ ,

[TABLE]

Proof of Proposition 5.1 (i).

For $p\geq 1$ , we can make the following computation, using (50), one change of indices and several times the property of the Gamma function that for any $z>0$ we have $\Gamma\left(z+1\right)=z\Gamma\left(z\right)$ :

[TABLE]

Using Stirling formula, we can then compute the numbers $C_{p}$ introduced in (51),

[TABLE]

Using (4.3), the moments of $\mathsf{M}_{k}$ are given, for any $p\in\mathbb{N}$ by the formula:

[TABLE]

These moments identify using (59) the distribution of $\mathsf{M}_{k}^{\mathbf{a}}$ for all $k\geq 1$ ,

[TABLE]

From this, and the form of the backward transitions, we can identify $(\mathsf{M}_{k}^{\mathbf{a}})_{k\geq 1}$ as having a distribution $\mathrm{MLMC}\left(\frac{1}{b+1},\frac{a}{b+1}\right)$ . ∎

Time-homogeneity of MLMC.

Let us keep the notation from the previous paragraph with a sequence $\mathbf{a}=a,b,b,b\dots$ and let us show the time-homogeneity of the corresponding Mittag-Leffler Markov chain $(\mathsf{M}^{\mathbf{a}}_{k})_{k\geq 1}\sim\mathrm{MLMC}\left(\frac{1}{b+1},\frac{a}{b+1}\right)$ using its connection with preferential attachment trees.

For any $x>-1$ , consider the sequence $\boldsymbol{x}=x,b,b,b\dots$ and $(\mathtt{P}^{x}_{n})_{n\geq 1}\sim\operatorname{PAT}(\boldsymbol{x})$ in such a way that, using Theorem 1.1,

[TABLE]

By choosing $x$ appropriately, we can make $(\mathsf{M}_{1}^{\boldsymbol{x}},\mathsf{M}_{2}^{\boldsymbol{x}})$ have the distribution of any of the couples $(\mathsf{M}_{k},\mathsf{M}_{k+1})$ for $k\geq 1$ . Thus, in order to prove the time-homogeneity of the transitions, it suffices to prove that the conditional distribution of $\mathsf{M}_{2}^{\boldsymbol{x}}$ with respect to $\mathsf{M}_{1}^{\boldsymbol{x}}$ does not depend on $x$ .

Recall from Section 1.2.2 in the introduction that we see $(\mathtt{P}_{n}^{x})_{n\geq 1}$ as an increasing sequence of plane trees, defined as subsets of $\mathbb{U}$ . Also recall that for any $u\in\mathbb{U}$ , we denote $T(u)$ the subtree descending from $u$ . At every time $n\geq 1$ , we can consider the sequence $(\#(\mathtt{P}_{n}^{x}\cap T(1)),\#(\mathtt{P}_{n}^{x}\cap T(2)),\dots)$ , which counts the number of vertices in the subtrees descending from the children of $u_{1}=\emptyset$ in $\mathtt{P}_{n}^{x}$ , in order of creation (completed by an sequence of zeros). We can check that this sequence evolves as $n$ grows with the same distribution as the number of customers seating at different tables in a Chinese Restaurant Process with seating plan $(\frac{1}{b+1},\frac{x}{b+1})$ , see [41, Section 3.2] for a definition.

Then, conditionally on the evolution of this sequence, every time that a vertex is added to one of those subtrees, it is attached to any vertex already present in the subtree with probability proportional to its out-degree plus $b$ (and in particular this does not depend on the value of $x$ ).

Thanks to [41, Corollary 3.9], two Chinese Restaurant Processes with respective seating plan $(\frac{1}{b+1},\frac{x}{b+1})$ and $(\frac{1}{b+1},\frac{x^{\prime}}{b+1})$ with $x,x^{\prime}>-1$ have a density with respect to each other and this density is a function of the scaling limit of the number of tables created in the process, which corresponds in our case to $\mathsf{M}_{1}^{\boldsymbol{x}}$ .

These observations allow us to conclude that the distribution of $(\mathtt{P}^{x}_{n})_{n\geq 1}$ for any $x>-1$ has a positive density with respect to $(\mathtt{P}^{0}_{n})_{n\geq 1}$ , and this density is a function of $\mathsf{M}_{1}^{x}$ . From here, it is clear that conditionally on $\mathsf{M}_{1}^{\boldsymbol{x}}$ , the distribution of the quantity $(\mathsf{M}_{2}^{\boldsymbol{x}}-\mathsf{M}_{1}^{\boldsymbol{x}})=\lim_{n\rightarrow\infty}\deg_{\mathtt{P}_{n}^{x}}^{+}(u_{2})$ does not depend on $x$ , which concludes the argument.

5.1.2 Products of generalised Gamma.

The following paragraphs aim at proving Proposition 5.1 (ii). In the first paragraph and second paragraph we define the families of distributions of $\mathrm{GGP}$ and $\mathrm{IPGGP}$ -processes. Some special cases of these processes already appeared in [38, 36]. In the third one we prove that the distribution of $(\mathsf{M}_{k}^{\mathbf{a}})_{k\geq 1}$ belongs to this family whenever the sequence $\mathbf{a}$ is of the form assumed in Proposition 5.1 (ii).

Construction of a $\mathrm{GPP}(z,r)$ -process.

For $z,r>0$ real numbers, let $(Z_{i})_{i\geq 1}$ be a family of independent variables with the following distribution:

[TABLE]

where, for any $k>0$ , the distribution $\mathrm{Gamma}(u)$ has density $x\mapsto\frac{x^{u-1}e^{-x}}{\Gamma(u)}\mathbf{1}_{\left\{x>0\right\}}$ with respect to the Lebesgue measure. Then for all $k\geq 1$ we define $\mathsf{G}_{k}$ as,

[TABLE]

We say that the process $(\mathsf{G}_{k})_{k\geq 1}$ has the distribution of a Generalised Gamma process with parameters $(z,r)$ which we denote $\mathrm{GPP}(z,r)$ .

Let us note that,using standard distributional equalities with Gamma and Beta distributions, for every $k\geq 1$ , we have $(\mathsf{G}_{k})^{r}\sim\mathrm{Gamma}\left(k-1+\frac{z}{r}\right)$ and

[TABLE]

and $V_{k}^{1/r}$ is independent of $\mathsf{G}_{k+1}$ . In fact, we can further show that $V_{1},V_{2},\dots,V_{k},\mathsf{G}_{k+1}$ are jointly independent with the corresponding distribution and that this characterizes the finite dimensional marginals of this process.

Remark \thetheorem.

For $z=r$ , the process $(\mathsf{G}_{k})_{k\geq 1}$ has exactly the distribution of the points of a Poisson process on $\mathopen{(}0\mathclose{}\mathpunct{},\infty\mathclose{)}$ with intensity $r\cdot t^{r-1}\differential t$ , listed in increasing order.

Intertwined Products of $\mathrm{GGP}$ -processes.

Let $a>-1$ and $b_{1},b_{2},\dots,b_{\ell}$ be positive integers with at least one being non-zero. We let $B_{r}:=\sum_{s=1}^{r}b_{s}$ for all $0\leq r\leq\ell$ , with the convention that $B_{0}:=0$ . We also let $S=B_{\ell}$ . Then we define the set

[TABLE]

Start with independent $\mathrm{GPP}$ processes $\left\{\mathsf{G}^{(q)}\mathrel{}\middle|\mathrel{}q\in\mathcal{S}\right\}$ indexed by $\mathcal{S}$ such that for all $q\in\mathcal{S}$ ,

[TABLE]

Now $\mathsf{G}=(\mathsf{G}_{k})_{k\geq 1}$ is defined in such a way that for all $n\geq 1$ and $1\leq r\leq\ell$ we have

[TABLE]

The process $(\mathsf{G}_{k})_{k\geq 1}$ defined above is said to have distribution of an Intertwined Product of Generalized Gamma Processes with parameters $(a,b_{1},b_{2},\dots,b_{\ell})$ , denoted $\mathrm{IPGGP}(a,b_{1},b_{2},\dots,b_{\ell})$ . Its finite dimensional marginals can be obtained in the same way as it was done in the preceding paragraph for Generalized Gamma processes.

Identification of the limiting chain.

Fix $\ell\geq 1$ and $b_{1},b_{2},\dots,b_{\ell}\geq 0$ some integers (where at least one is non-zero) and suppose that the sequence $\mathbf{a}$ has the following form,

[TABLE]

meaning that the sequence is periodic with period $\ell$ starting from the second term, with $a>-1$ .

For any $j\geq 0$ and $1\leq r\leq\ell$ we have

[TABLE]

for the $(\beta_{k})_{k\geq 1}$ as defined in Theorem 1.1. For any $j\geq 1,p\geq 1$ , we use the moments (50) of a Beta random variable and a telescoping argument to write

[TABLE]

Using the last display, we get that for any $n\geq 1$ ,

[TABLE]

Using Stirling’s approximation we get

[TABLE]

Hence, recalling the definition of $C_{p}$ in (51), we get

[TABLE]

Then using (4.3) with $c=S/\ell$ ,

[TABLE]

Using the last display and the fact that random variable with distribution $\mathrm{Gamma}(u)$ has $p$ -th moment equal to $\frac{\Gamma\left(u+p\right)}{\Gamma\left(u\right)}$ , we can identify the distribution of the one-dimensional marginals $\frac{\ell^{\frac{\ell}{S+\ell}}}{S+\ell}\cdot\mathsf{M}_{k}^{\mathbf{a}}$ for any $k\geq 1$ with the ones of the process described in (63). The identification of the distribution of the process $\frac{\ell^{\frac{\ell}{S+\ell}}}{S+\ell}\cdot(\mathsf{M}_{k}^{\mathbf{a}})_{k\geq 1}$ as $\mathrm{IPGGP}(a,b_{1},b_{2},\dots,b_{\ell})$ is then obtained by comparing their finite dimensional distribution which are characterized by Proposition 4.3 and, respectively, (63) together with the discussion below (62).

Sparse sequences.

Let us treat a particular example of parameters $a,b_{1},b_{2},\dots b_{\ell}$ for which the distribution $\mathrm{IPGGP}(a,b_{1},b_{2},\dots,b_{\ell})$ has a simpler description than the general case. Suppose that only one of the parameters $b_{1},b_{2},\dots,b_{\ell}$ is non-zero, say $b_{\ell}$ for example. Keeping the notation introduced above, the corresponding set $\mathcal{S}$ contains only $b_{\ell}$ elements $\mathcal{S}=\{\ell,\ell+1,\dots,\ell+b_{\ell}-1\}$ . Following the definition (63), the process $(\mathsf{G}_{k})_{k\geq 1}$ with distribution $\mathrm{IPGGP}(a,0,0,\dots,0,b_{\ell})$ is constant on every interval $\mathopen{\llbracket}(n-1)\ell+1\mathclose{}\mathpunct{},n\ell\mathclose{\rrbracket}$ for any integer $n\geq 1$ and , the process $(\mathsf{G}_{(n-1)\ell+1})_{n\geq 1}$ is just given by a product of $b_{\ell}$ independent $\mathrm{GGP}$ -processes

[TABLE]

where for all $\ell\leq q\leq\ell+b_{\ell}-1$ , the process $(\mathsf{G}^{(q)}_{k})_{k\geq 1}$ has distribution $\mathrm{GPP}(a+q,\ell+b_{\ell})$ .

In the particular case where $a=1$ and $(b_{1},b_{2},\dots,b_{\ell-1},b_{\ell})=(0,0,\dots,0,1)$ , the picture is even simpler because the last display becomes a product over only one term. We can check using Remark 5.1.2 that the process $(\mathsf{G}_{(k-1)\ell+1})_{k\geq 1}$ has then exactly the distribution of the points of a Poisson process on $\mathopen{(}0\mathclose{}\mathpunct{},\infty\mathclose{)}$ with intensity $(\ell+1)t^{\ell}\differential t$ , listed in increasing order, which was already noted in [36, Remark 2].

5.2 Application to Pólya urns with immigration

Define the following generalisation of Pólya’s urn, which depends on a sequence of numbers $(a_{n})_{n\geq 1}$ : start at time $1$ with an urn containing $a_{1}$ red balls. At every time $n\geq 2$ , we sample a ball uniformly at random from the urn, return it to the urn with $1$ additional ball of the same colour, plus an immigration of $a_{n}$ additional white balls. The outcome of the first step being deterministic, it is equivalent to consider that we start at time $2$ with $a_{1}+1$ red balls and $a_{2}$ white balls in the urn, so that we allow ourselves to consider any (possibly negative) value $a_{1}>-1$ . This model was studied in the sequence of paper [37, 38, 36] in specific cases of periodic immigration and also studied in [2] with a larger class of periodic immigration.

Denote $R_{n}$ the number of red balls in the urn at time $n$ and let us state a scaling limit result for $R_{n}$ when $n\rightarrow\infty$ . We also identify the speed of convergence and the Gaussian fluctuations around the limit, provided that the immigration is sufficiently regular.

Recall from the introduction the assumption ( $H_{c}$ ) defined for a real number $c>0$ . We introduce the following more precise assumption of the same type, for any $c>0$ and $\delta>0$ .

[TABLE]

Remark that for any $0<\delta<\frac{1}{2}$ , this assumption is satisfied for periodic sequences $\mathbf{a}$ , and almost surely satisfied by sequences of i.i.d. non-negative random variables with a second moment.

Proposition \thetheorem.

Assume that the sequence $\mathbf{a}=(a_{n})_{n\geq 1}$ satisfies ( $H_{c}$ ) for some $c>0$ . Then for $D_{n}:=n^{-\frac{1}{c+1}}\cdot R_{n}$ ,

(i)

we have the following almost sure convergence,

[TABLE]

where $D_{\infty}$ has the same law as $\mathsf{M}_{1}^{\mathbf{a}}$ , defined in (9). 2. (ii)

If $\delta>\frac{1}{2(c+1)}$ then we have

[TABLE]

Remark \thetheorem.

If the sequence $(a_{n})_{n\geq 1}$ has one of the particular forms treated in Proposition 5.1 of the previous section, we can identify the distribution of the limiting random variable as being Mittag-Leffler or a product of independent generalised Gamma random variables. This gives us an alternative proof for the similar statement [1, Theorem 3.8].

Proof.

Let $(\mathtt{P}_{n})_{n\geq 1}$ be a sequence of trees with distribution $\operatorname{PAT}(\mathbf{a})$ and let $R_{n}:=a_{1}+\deg^{+}_{\mathtt{P}_{n}}(u_{1})$ . With this definition, the sequence $(R_{n})_{n\geq 1}$ has exactly the same distribution as the number of red balls in a Pólya urn with immigration with immigration sequence $\mathbf{a}$ .

If the sequence $\mathbf{a}$ satisfies our assumption ( $H_{c}$ ) for some $c>0$ then using (10) we can write the following almost sure convergence

[TABLE]

where the sequence $(\mathsf{M}_{n}^{\mathbf{a}})_{n\geq 1}$ is defined in (9), so this proves (i).

Let us turn to the proof of (ii). We will prove this convergence in two steps, by first proving some corresponding result for the degree of the first vertex in a $\operatorname{WRT}$ , and then using Theorem 1.1 and Proposition 1.1 to transfer the result to the corresponding $\operatorname{PAT}$ distribution. Indeed, let $(\mathtt{T}_{n})_{n\geq 1}$ be a sequence of trees with distribution $\operatorname{WRT}(\boldsymbol{w})$ with a sequence $\boldsymbol{w}$ satisfying the following assumption

[TABLE]

for some $\gamma\in\mathopen{(}0\mathclose{}\mathpunct{},1\mathclose{)}$ . In this context, recalling (19), the degree of the first vertex can be written as

[TABLE]

Now, using our assumption on the sequence $(W_{n})_{n\geq 1}$ we get $\frac{1}{W_{n}}=\frac{1-\gamma}{n^{\gamma}}+o\mathopen{}\left(n^{-\frac{\gamma}{2}-\frac{1}{2}}\right)$ , so that

[TABLE]

Rearranging the terms, we get

[TABLE]

and using the Lindeberg-Feller theorem (see [17, Theorem 3.4.5] for example), we get that the latter expression converges in distribution when $n\rightarrow\infty$ to a Gaussian distribution $\mathcal{N}(0,w_{1})$ . Recalling that $n^{-(1-\gamma)}\cdot\deg^{+}_{\mathtt{T}_{n}}(u_{1})\rightarrow w_{1}$ a.s. as $n\rightarrow\infty$ , we can also write using Slutsky’s lemma

[TABLE]

Now let us transfer this result to the case of preferential attachment trees. For this, it suffices to prove that $\mathbf{a}$ satisfies the condition ( $H_{c}^{\delta}$ ) with $\delta>\frac{1}{2(c+1)}$ then the corresponding sequence $(\mathsf{M}^{\mathbf{a}}_{n})_{n\geq 1}$ defined (9) almost surely satisfies (64) for $\gamma=\frac{c}{c+1}$ . From Proposition 1.1 and the definition of $(\mathsf{M}_{n}^{\mathbf{a}})_{n\geq 1}$ as a scaled version of $(\mathsf{W}_{n}^{\mathbf{a}})_{n\geq 1}$ , we know that we have $\mathsf{M}_{n}^{\mathbf{a}}\underset{n\rightarrow\infty}{=}\frac{1}{1-\gamma}\cdot n^{\gamma}\cdot(1+O\mathopen{}\left(n^{-\epsilon}\right))$ almost surely, for $\gamma=\frac{c}{c+1}$ and some $\epsilon>0$ . Going along the proof of Proposition 1.1, we get from (4.2) that

[TABLE]

for any $\zeta<\delta\wedge\frac{1}{2}$ , so that (64) is almost surely satisfied by $(\mathsf{M}_{n}^{\mathbf{a}})_{n\geq 1}$ if

[TABLE]

Now, thanks to Theorem 1.1, conditionally on the sequence $(\mathsf{M}_{n}^{\mathbf{a}})_{n\geq 1}$ the distribution of $(\mathtt{P}_{n})_{n\geq 1}$ is $\operatorname{WRT}((\mathsf{m}^{\mathbf{a}}_{n})_{n\geq 1})$ . Applying (65) in this case finishes to prove (ii).

∎

5.3 Applications to some other models of preferential attachment

Let us present here another model of preferential attachment which appears in the literature, for example in [36]. This model does not produce a tree as ours does, but we can couple them in such a way that some of their features coincide. We only focus on one particular model of graph here but the method presented here can adapt to other similar models.

A model of $(m,\alpha)$ -preferential attachment

Let $\mathtt{S}$ be a non-empty graph, with vertex-set $\{v_{1}^{(1)},\dots,v_{1}^{(k)}\}$ which have degrees $(d_{1},\dots d_{k})$ , and $m\geq 2$ an integer and $\alpha>-m$ a real number such that $\alpha+d_{i}>0$ for all $1\leq i\leq k$ . The model is then the following: we let $\mathtt{G}_{1}=\mathtt{S}$ . Then, at any time $n\geq 1$ , the graph $\mathtt{G}_{n+1}$ is constructed from the graph $\mathtt{G}_{n}$ by:

•

adding a new vertex labelled $v_{n+1}$ with $m$ outgoing edges,

•

choosing sequentially to which other vertex each of these edges are pointed, each vertex being chosen with probability proportional to $\alpha$ plus its degree (the degree of the vertices are updated after each edge-creation).

The degree of a vertex in a graph refers in this section to the number of edges incident to it. Here the growth procedure in fact produces multigraphs, in which it is possible for two vertices to be connected to each other by more than one edge. In this case, all those edges contribute in the count of their degree.

We can couple this model to a preferential attachment tree with sequence of fitnesses $\mathbf{a}$ defined as:

[TABLE]

where $w(\mathtt{S}):=d_{1}+d_{2}+\dots+d_{k}+k\alpha$ .

Indeed, we can construct $(\mathtt{P}_{n})$ with distribution $\operatorname{PAT}(\mathbf{a})$ . Then, for any $n\geq 1$ , consider the tree $\mathtt{P}_{1+m(n-1)}$ and for all $2\leq i\leq n$ , merge together each vertex with fitness $m+\alpha$ together with the $m-1$ vertices with fitness [math] that arrived just before it. If $\mathtt{G}_{1}$ only contains one vertex, it is immediate that the obtained sequence of graphs has exactly the same distribution as $(\mathtt{G}_{n})_{n\geq 1}$ . For general seed graphs $\mathtt{S}$ , we can still use the same construction and the obtained sequence of graphs has the same evolution as some sequence $(\widetilde{\mathtt{G}}_{n})_{n\geq 1}$ which would be obtained from $(\mathtt{G}_{n})_{n\geq 1}$ by merging all the vertices $\{v_{1}^{(1)},\dots,v_{1}^{(k)}\}$ into a unique vertex $v_{1}$ .

Note that a similar construction would also be possible if the degrees of the vertices $v_{2},v_{3},\dots$ were given by a sequence of integers $(m_{2},m_{3},\dots)$ instead of all being equal to some constant value $m$ . This is for example the case in the model studied in [14], where the degrees are random.

We have the following convergence for degrees of vertices in the graph, as $n\rightarrow\infty$ .

Proposition \thetheorem.

The following convergence holds almost surely in any $\ell^{p}$ with $p>2+\frac{\alpha}{m}$ :

[TABLE]

where

[TABLE]

and the process $(\mathsf{N}_{n})_{n\geq 1}$ is independent of $(B^{(1)},B^{(2)},\dots,B^{(k)})$ .

Furthermore, whenever $\alpha\in\mathbb{Z}$ with $\alpha>-m$ or $m=1$ then the distribution of $(\mathsf{N}_{n})_{n\geq 1}$ is explicit and given by:

•

if $\alpha\in\mathbb{Z}$ with $\alpha>-m$ , then

[TABLE]

•

if $m=1$ , then

[TABLE]

This result strengthens the one of [36, Theorem 1, Theorem 2 and Proposition 1] which corresponds (up to some definition convention) to the case $\alpha=1-m$ . We emphasize that the convergence here is almost sure in an $\ell^{p}$ space.

Proof of Proposition 5.3.

Using the coupling argument, we know that we can construct jointly the sequence of graphs $(\mathtt{G}_{n})_{n\geq 1}$ and a sequence of trees $(\mathtt{P}_{n})_{n\geq 1}\sim\operatorname{PAT}(\mathbf{a})$ with fitness sequence

[TABLE]

in such a way that for every $n\geq 1$ , the sequence

[TABLE]

coincides with

[TABLE]

Using this connection and Theorem 1.1, Proposition 1.1 and Proposition 2.1 we get

[TABLE]

almost surely in $\ell^{p}$ for all $p>2+\frac{\alpha}{m}$ , for some random sequence $(\mathsf{N}_{n})_{n\geq 1}$ . Note that the time-change between $(\mathtt{G}_{n})_{n\geq 1}$ and $(\mathtt{P}_{n})_{n\geq 1}$ is responsible for an extra factor in the scaling, so that the sequence $(\mathsf{N}_{n})_{n\geq 1}$ has the distribution of $m^{\frac{m}{2m+\alpha}}\cdot(\mathsf{M}_{n}^{\mathbf{a}})_{n\geq 1}$ . In the case $\alpha\in\mathbb{Z}$ or $m=1$ , Proposition 5.1 identifies the distribution of the limiting sequence.

Last, the convergence of $\frac{1}{\sum_{j=1}^{k}\deg_{\mathtt{G}_{n}}(v^{(j)}_{1})}\cdot(\deg_{\mathtt{G}_{n}}(v^{(1)}_{1}),\deg_{\mathtt{G}_{n}}(v^{(2)}_{1}),\dots,\deg_{\mathtt{G}_{n}}(v^{(k)}_{1}))$ just follows from the classical result of convergence for the proportion of balls in a Pólya urn. ∎

Appendix A Technical proofs and results

This appendix contains the proofs of technical results that are used throughout this paper. Let start by stating a useful conditional version of the Borel-Cantelli lemma.

Lemma \thetheorem.

Let $(\mathcal{F}_{n})$ be a filtration and let $(B_{n})_{n\geq 1}$ be a sequence of events adapted to this filtration. For all $n\geq 1$ , let $\mathsf{p}_{n}:=\mathbb{P}\left(B_{n}\mathrel{}\middle|\mathrel{}\mathcal{F}_{n-1}\right)$ . We have

[TABLE]

and also

[TABLE]

Proof.

The first convergence is the content of Theorem 5.4.11 and the second one is an application of Theorem 5.4.9 to the martingale $\left(\sum_{i=1}^{n}(\mathbf{1}_{B_{i}}-\mathsf{p}_{i})\right)_{n\geq 1}$ , both taken from [17]. ∎

The following lemma is a rewriting of [6, Lemma 1]. We provide the proof for completeness.

Lemma \thetheorem.

Let $(M_{n})_{n\geq 1}$ be a complex-valued martingale with finite $q$ -th moment for some $q\in\mathopen{[}1\mathclose{}\mathpunct{},2\mathclose{]}$ . Then for every $n\geq 1$ we have

[TABLE]

Proof.

Let $X_{n+1}:=M_{n+1}-M_{n}$ and let $X_{n+1}^{\prime}$ be a random variable such that conditionally on $(M_{1},\dots,M_{n})$ the random variable $X_{n+1}^{\prime}$ is independent of, and has the same distribution as $X_{n+1}$ . Then

[TABLE]

where the first equality comes from the fact that $\mathbb{E}\left[X_{n+1}^{\prime}\mathrel{}\middle|\mathrel{}M_{1},\dots M_{n+1}\right]=0$ . The first inequality is the one of Jensen for conditional expectation, applied to the convex function $z\mapsto\absolutevalue{z}^{q}$ . The second inequality is due to Clarkson, see [51, Lemma 1], and can be applied because the distribution of $X_{n+1}-X_{n+1}^{\prime}$ conditional on $M_{n}$ is symmetric and $1\leq q\leq 2$ . The last inequality comes from the triangle inequality for the $L^{q}$ -norm. ∎

Let us state another result about martingales, which we use numerous times throughout the paper. Recall our uniform big- $O$ and small- $o$ notation, introduced in (30).

Lemma \thetheorem.

Suppose that $(z\mapsto Z_{n}(z))_{n\geq 1}$ is a sequence of analytic functions on some open set $\mathscr{O}\subset\mathbb{C}$ , adapted to some filtration $(\mathcal{G}_{n})$ . Suppose that for every $z\in\mathscr{O}$ , the sequence $(Z_{n}(z))_{n\geq 1}$ is a martingale with respect to the filtration $(\mathcal{G}_{n})$ . If there exists a parameters $q>1$ and continuous functions $\alpha:\mathscr{O}\rightarrow\mathbb{R}$ and $\delta:\mathscr{O}\rightarrow\mathopen{(}0\mathclose{}\mathpunct{},\infty\mathclose{)}$ such that for all $n\geq 1$ we have

[TABLE]

then for any compact subset $K\subset\mathscr{O}$ , there exists $\epsilon(K)>0$ such that

(i)

if $\alpha>0$ on $\mathscr{O}$ we have $n^{-\alpha(z)}\cdot\absolutevalue{Z_{n}(z)-Z_{1}(z)}=O_{K}\mathopen{}\left(n^{-\epsilon(K)}\right)$ almost surely and also in expectation, 2. (ii)

if $\alpha\leq 0$ on $\mathscr{O}$ , the almost sure limit $Z_{\infty}(z)$ exists for $z\in\mathscr{O}$ and we have $n^{-\alpha(z)}\cdot\absolutevalue{Z_{n}(z)-Z_{\infty}(z)}=O_{K}\mathopen{}\left(n^{-\epsilon(K)}\right)$ almost surely and also in expectation.

Proof of Lemma A.

First, without loss of generality, we can consider that the term $o_{\mathscr{O}}\mathopen{}\left(1\right)$ is identically equal to [math], otherwise we just replace the function $z\mapsto\delta(z)$ by $z\mapsto\frac{1}{2}\cdot\delta(z)$ . Second, by compactness, it is sufficient to prove the result for a small disk around each $x\in K$ . Since $\mathscr{O}$ is an open set, let $\rho>0$ be such that $\mathrm{D}(x,2\rho)\subset\mathscr{O}$ , where $\mathrm{D}(x,2\rho)$ is the closed disk in the complex plane with centre $x$ and radius $2\rho$ . We denote

[TABLE]

and choose $\rho$ small enough so that $\underline{\alpha}-\overline{\alpha}+\frac{1}{q}\underline{\delta}>0$ . Then if we let $\xi:\mathopen{[}0\mathclose{}\mathpunct{},2\pi\mathclose{]}\rightarrow\mathbb{C}$ such that $\xi(t)=x+2\rho e^{it}$ , we have for any $n$ and $m$ , using the Cauchy formula

[TABLE]

Now,

[TABLE]

Using sequentially Jensen’s inequality and Doob’s maximal inequality in $L^{q}$ , gives us for every $z\in\mathrm{D}(x,\rho)$ :

[TABLE]

So using (A), Fubini’s theorem and (A), we get

[TABLE]

Now let us treat the two cases $\alpha>0$ and $\alpha\leq 0$ separately. Remark that the quantity $\left(\overline{\alpha}-\frac{1}{q}\cdot\underline{\delta}\right)$ is negative when $\alpha\leq 0$ , but can be of any sign in the case $\alpha>0$ .

$\bullet$ For $\alpha>0$ and $r\geq 0$ , we let

[TABLE]

Using (A), we have

[TABLE]

Thanks to our assumptions, the number $\beta:=\underline{\alpha}-0\vee(\overline{\alpha}-\frac{1}{q}\cdot\underline{\delta})$ is positive. Using Markov’s inequality and the last display yields

[TABLE]

which is summable, so the Borel-Cantelli lemma ensures that $A_{r}=O\mathopen{}\left(2^{-\frac{\beta}{2}r}\right)$ almost surely as $r\rightarrow\infty$ . Now for any $n\geq 1$ , there is a unique integer $r_{n}$ such that $2^{r_{n}}\leq n<2^{r_{n}+1}$ , namely $r_{n}=\lfloor\log_{2}(n)\rfloor$ , and we write

[TABLE]

which proves point (i), because almost surely $A_{r_{n}}=O\mathopen{}\left(n^{-\frac{\beta}{2}}\right)$ as $n\rightarrow\infty$ .

$\bullet$ For $\alpha\leq 0$ , the reasoning is similar so we use the same notation for slightly different quantities. For any integer $r\geq 0$ we let

[TABLE]

Then, thanks to (A), we have

[TABLE]

and thanks to our assumption the number $\beta:=\underline{\alpha}-\overline{\alpha}+\frac{1}{q}\cdot\underline{\delta}$ is positive. Using the same arguments as in the case $\alpha>0$ we have $A_{r}=O\mathopen{}\left(2^{-\frac{\beta}{2}r}\right)$ almost surely as $r\rightarrow\infty$ and taking $r_{n}:=\lfloor\log_{2}(n)\rfloor$ yields

[TABLE]

Again we almost surely have $A_{r_{n}}=O\mathopen{}\left(n^{-\frac{\beta}{2}}\right)$ as $n\rightarrow\infty$ . This ensures that the sequence of functions $(z\mapsto Z_{n}(z))_{n\geq 1}$ is almost surely a Cauchy sequence for the uniform convergence on the disc $\mathrm{D}(x,\rho)$ (so that its limit $z\mapsto Z_{\infty}(z)$ is well-defined on the disk) and that (ii) is satisfied. ∎

Finally, let us give a proof of Lemma 3.1.

Proof of Lemma 3.1.

From the assumption, we know that there exists $\epsilon>0$ such that $W_{n}\underset{}{=}\operatorname{cst}\cdot n^{\gamma}+O\mathopen{}\left(n^{\gamma-\epsilon}\right)$ as $n\rightarrow\infty$ . Without loss of generality, we can assume that $\epsilon<1$ . Then it is immediate that $w_{n}=W_{n+1}-W_{n}=O\mathopen{}\left(n^{\gamma-\epsilon}\right)$ . Then

[TABLE]

and the first point follows by summing over intervals of the type $\mathopen{\llbracket}n2^{k}\mathclose{}\mathpunct{},n2^{k+1}\mathclose{\rrbracket}$ .

Now write

[TABLE]

Since $\frac{w_{i}}{W_{i}}\rightarrow 0$ as $i\rightarrow\infty$ , we get

[TABLE]

Putting everything together, we get

[TABLE]

Last, just remark that $\log W_{n}=\log(\operatorname{cst}\cdot n^{\gamma}\cdot(1+O\mathopen{}\left(n^{-\epsilon}\right)))=\gamma\log n+\operatorname{cst}+O\mathopen{}\left(n^{-\epsilon}\right),$ which finishes the proof. ∎

Bibliography51

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Cyril Banderier, Philippe Marchal and Michael Wallner “Periodic Pólya urns and an application to Young tableaux” In 29th International Conference on Probabilistic, Combinatorial and Asymptotic Methods for the Analysis of Algorithms 110 , LIP Ics. Leibniz Int. Proc. Inform. Schloss Dagstuhl. Leibniz-Zent. Inform., Wadern, 2018, pp. Art. No. 11, 13
2[2] Cyril Banderier, Philippe Marchal and Michael Wallner “Periodic Pólya Urns, the Density Method, and Asymptotics of Young Tableaux”, 2019 ar Xiv: 1912.01035 [math.PR]
3[3] Albert-László Barabási and Réka Albert “Emergence of Scaling in Random Networks” In Science 286.5439 , 1999, pp. 509–512 DOI: 10.1126/science.286.5439.509 · doi ↗
4[4] Noam Berger, Christian Borgs, Jennifer T. Chayes and Amin Saberi “Asymptotic behavior and distributional limits of preferential attachment graphs” In The Annals of Probability 42.1 , 2014, pp. 1–40 DOI: 10.1214/12-AOP 755 · doi ↗
5[5] Shankar Bhamidi “Universal techniques to analyze preferential attachment trees: Global and local analysis” In Preprint available at http://www. unc. edu/ bhamidi , 2007
6[6] John D. Biggins “Uniform Convergence of Martingales in the Branching Random Walk” In The Annals of Probability 20.1 , 1992, pp. 137–151 DOI: 10.1214/aop/1176989921 · doi ↗
7[7] Patrick Billingsley “Convergence of probability measures”, Wiley series in probability and statistics. Probability and statistics section New York: Wiley, 1999
8[8] Benjamin Bloem-Reddy and Peter Orbanz “Preferential Attachment and Vertex Arrival Times”, 2017 ar Xiv: 1710.02159 [math.PR]

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Geometry of weighted recursive and affine preferential attachment trees

Abstract

1 Introduction

1.1 Two related models of growing trees

Definitions.

Notation.

Representation result.

Corollary \thetheorem.

Assumptions on the sequences.

Proposition \thetheorem.

Convergence of degrees using the WRT representation.

Distribution of the limiting chain.

1.2 Other geometric properties of weighted random trees

1.2.1 Height and profile of WRT

Remark \thetheorem.

1.2.2 Convergence of the weight measure

Plane-tree framework.

Convergence of measures.

1.3 Organisation of the paper

Acknowledgements

2 Measures and degrees in weighted random trees

2.1 Convergence of the degree sequence

Proposition \thetheorem.

Proof.

2.2 Convergence of measures

Time-dependent Pólya urn scheme.

Characterization of the convergence of probability measures over U‾\overline{\mathbb{U}}U.

Lemma \thetheorem.

Proof.

2.2.1 Proof of Theorem 1.2.2.

Proof of Theorem 1.2.2.

Lemma \thetheorem.

Proof.

2.2.2 Convergence of other sequences of measures.

Proposition \thetheorem.

The degree measure.

The uniform measure on the vertices of Tn\mathtt{T}_{n}Tn​.

3 Height and profile of WRT

Proposition \thetheorem.

3.1 Study of the Laplace transform of the weighted profile

Lemma \thetheorem.

Proof.

Lemma \thetheorem.

Lemma \thetheorem.

Proof of Lemma 3.1.

Convergence of the martingales (Mn(z))n≥1(M_{n}(z))_{n\geq 1}(Mn​(z))n≥1​.

Lemma \thetheorem.

Proof.

Proposition \thetheorem.

Lemma \thetheorem.

Proof.

Lemma \thetheorem.

Proof of Proposition 3.1.

Zeros of the limit.

Lemma \thetheorem.

Proof of Lemma 3.1.

Lemma \thetheorem.

Proof.

Lemma \thetheorem.

Proof of Lemma 3.1.

3.2 From the weighted to the unweighted sum.

Lemma \thetheorem.

Proof.

Lemma \thetheorem.

Proof.

Lemma \thetheorem.

Proof.

Proof of Proposition 3.

3.3 Height of the tree

Lemma \thetheorem.

Proof.

Proof of (15).

Proposition \thetheorem.

Proof.

Characterization of the convergence of probability measures over $\overline{\mathbb{U}}$ .

The uniform measure on the vertices of $\mathtt{T}_{n}$ .

Convergence of the martingales $(M_{n}(z))_{n\geq 1}$ .

5.1 The limit chain for particular sequences $\mathbf{a}$

Construction of a $\mathrm{GPP}(z,r)$ -process.

Intertwined Products of $\mathrm{GGP}$ -processes.

A model of $(m,\alpha)$ -preferential attachment