Geometry of weighted recursive and affine preferential attachment trees
Delphin S\'enizergues

TL;DR
This paper analyzes the geometry of weighted recursive and affine preferential attachment trees, establishing their equivalence and deriving asymptotic properties of various tree statistics.
Contribution
It shows that affine preferential attachment trees can be represented as weighted recursive trees with random weights and proves convergence results for key tree metrics.
Findings
Affine preferential attachment trees are distributionally equivalent to weighted recursive trees with random weights.
Established almost sure scaling limits for degree sequences, height, and profile of the trees.
Proved weak convergence of measures associated with the tree structure.
Abstract
We study two models of growing recursive trees. For both models, initially the tree only contains one vertex and at each time a new vertex is added to the tree and its parent is chosen randomly according to some rule. In the \emph{weighted recursive tree}, we choose the parent of among with probability proportional to , where is some deterministic sequence that we fix beforehand. In the \emph{affine preferential attachment tree with fitnesses}, the probability of choosing any is proportional to , where denotes its current number of children, and the sequence of \emph{fitnesses} is deterministic and chosen as a parameter of the model. We show that for any sequence , the corresponding preferential attachment tree…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
\DeclareSourcemap\maps
[datatype=bibtex] \map[overwrite=true] \step[fieldsource=fjournal] \step[fieldset=journal, origfieldval]
Geometry of weighted recursive and affine preferential attachment trees
Delphin Sénizergues
Abstract
We study two models of growing recursive trees. For both models, initially the tree only contains one vertex and at each time a new vertex is added to the tree and its parent is chosen randomly according to some rule. In the weighted recursive tree, we choose the parent of among with probability proportional to , where is some deterministic sequence that we fix beforehand. In the affine preferential attachment tree with fitnesses, the probability of choosing any is proportional to , where denotes its current number of children, and the sequence of fitnesses is deterministic and chosen as a parameter of the model.
We show that for any sequence , the corresponding preferential attachment tree has the same distribution as some weighted recursive tree with a random sequence of weights (with some explicit distribution). We then prove almost sure scaling limit convergences for some statistics associated with weighted recursive trees as time goes to infinity, such as degree sequence, height, profile and also the weak convergence of some measures carried on the tree. Thanks to the connection between the two models, these results also apply to affine preferential attachment trees.
1 Introduction
The uniform recursive tree has been introduced in the 70’s as an example of random graphs constructed by addition of vertices: starting from a tree with a single vertex, the vertices arrive one by one and the -th vertex picks its parent uniformly at random from the already present vertices. Many properties of this tree were then investigated due to its particularly simple dynamics: number of leaves, profile, height, degrees, size of subtrees and others. We refer to the survey [48] and the more recent book [16, Section 6] for an overview of the results obtained for this model.
We consider a generalisation of the uniform recursive tree called the weighted recursive tree (WRT), which was introduced in [9] in 2006. In this model, each vertex is assigned a non-negative weight, constant in time. When a newcomer randomly picks its parent, it does so with probability proportional to those weights. Although more general than the uniform recursive tree, WRT’s have attracted far fewer contributions, see e.g. [31, 24]. In [31] those trees are studied because of their connection to a model of random walk with preferential relocation (a.k.a. "monkey walk"). The authors prove some limiting results for the distribution of the weight of vertices at different heights in the tree, for different assumptions on the weight sequence which cover a wide range of behaviours.
In this paper, we prove asymptotic results for this model about the degree sequence, the height, the profile and the convergence of some probability measures carried on the tree, mainly under some assumptions that ensure that the sequence describing the weights of the vertices in order of creation behaves roughly as a power of . Our deepest result is the one that concerns the asymptotic behaviour of the profile of the tree, which is the function that maps each integer to the number of vertices in the tree at height . Both the statement and the proof of this result are inspired from the work carried out in the last 20 years for different models of logarithmic trees, see [10, 11, 49, 46, 28]. They rely on the study of the Laplace transform of the profile using tools that ultimately date back to Biggins [6] in the context of the branching random walk, together with a Fourier inversion argument, which in our case is handled by a very precise theorem of [28]. The rest of our results and proofs on WRT’s are less involved and mostly rely on more elementary arguments, as well as a connection with Pemantle’s time-dependent Pólya urns, introduced in [40].
We will also consider another model of trees which we call the affine preferential attachment tree (PAT) with fitnesses. In this one, every vertex has a fixed fitness, and the probability of picking any vertex to be the parent of a newcomer is proportional to its fitness plus its current number of children.
The term "preferential attachment", coined by Barabási and Albert in [3], refers to the property that a vertex in the graph that has a high degree tends to increase its degree even more over time, also referred to as a "rich-get-richer" effect. Many different preferential attachment mechanisms have then been studied in the last two decades because the degree distribution that emerges from this type of construction shares some quantitative properties with real-world networks, see [50, 34] for good overviews of the vast literature on this subject.
In our case, one of our motivations for studying those trees arises from the analysis of some growing random graphs, developed in the companion paper [47]. The class of models that we study there is designed to encompass Rémy’s algorithm, described in [43], which creates a sequence of binary trees, and a lot of its natural generalizations, studied in [20, 32, 12, 22, 23, 44]. In particular, we show that the sequences graphs obtained using these constructions, considered as metric spaces, almost surely converge in the so-called Gromov–Hausdorff–Prokhorov scaling limit towards a limiting random continuous metric space. This proof relies on a decomposition of our graphs along the structure of a tree, whose evolution is that of an affine preferential attachment tree with fitnesses. Notably, a crucial result that is needed in this argument is a uniform control over the degree of all the vertices the tree, which we prove in this paper.
Let us note that only a few contributions in the literature concern this particular model, where the fitness can depend on the vertex. In the case where the fitnesses are i.i.d., the model is considered for the first time in [18] and the first rigorous mathematical result can be found in [5]. Very recently, still in the case of i.i.d. fitnesses, it has been studied in more detail in [30] along with some other similar models. The authors study the asymptotic degree distribution and maximum degree in the tree and show that these can exhibit different behaviours according to the tail of the fitness distribution, which the authors classify as weak, strong and extreme disorder. Let us also mention two models that do not fall in our setting but are somewhat related, studied in [14] and [8], in which the reinforcement is affine in the degree of the vertices but there is some inhomogeneity between vertices. Instead of coming from different fitnesses associated to vertices like in our model, this inhomogeneity is introduced using a random initial degree, or respectively a random time of creation.
Our approach for studying this model relies on the connection between the PAT and the WRT models (this was already known in the field in the case of constant fitnesses but stated in a slightly different form, see [8, 4]). Indeed we shall see that using a de Finetti-type argument, a PAT can be seen as a WRT with a random sequence of weights that almost surely decays like a power of . This enables us to translate all of the results obtained for WRT’s to corresponding results for PAT’s, and hence prove asymptotics for degrees, height and profile of the tree. In particular, we prove the almost sure scaling limit convergence of the sequence of degrees of the vertices in the tree in an norm. For some regular sequences of fitnesses, we can explicitly describe the distribution of the limiting sequence using Beta, Gamma and Mittag-Leffler distributions. This relates in various ways to other results that can be found in the literature associated to preferential attachment trees or to urn models, contained in [33, 26, 37, 25, 38, 36, 35, 2].
1.1 Two related models of growing trees
Definitions.
For any sequence of non-negative real numbers with , we define the distribution on sequences of growing rooted labelled trees111In fact, in the rest of the paper we will see them as plane trees, see Section 1.2.2., which is called the weighted recursive tree with weights . We construct a sequence of rooted trees starting from containing only one root-vertex with label and let it evolve in the following manner: the tree is obtained from by adding a vertex with label . The parent of this new vertex is chosen to be the vertex with label with probability proportional to its weight, that is
[TABLE]
Remark that this conditional distribution does not depend on the evolution up to time , which ensures in particular that the random variables are independent. In this definition, we also allow sequences of weights that are random and in this case the distribution denotes the law of the random tree obtained by the above process conditionally on , so that the obtained distribution on growing trees is a mixture of WRT with deterministic sequences of weights.
Similarly, for any sequence of real numbers, with and for , we define another model of growing tree. The construction goes on as before: contains only one root-vertex with label and is obtained from by adding a vertex with label and the parent of the newcomer is chosen to be the vertex with label , where now
[TABLE]
where denotes the number of children in the tree . In the particular case where , the second vertex is always defined as a child of , even in the case for which the last display does not make sense. We call this sequence of tree an affine preferential attachment tree with fitnesses and its law is denoted by .
Notation.
Here and in the rest of the paper, whenever we have any sequence of real numbers , we write in a bold font as a shorthand for the sequence itself, and with a capital letter to denote the sequence of partial sums defined for all as . In particular, we do so for sequences of fitnesses , for deterministic sequences of weights and for random sequence of weights .
Representation result.
The following result gives a connection between these two models of growing trees. It is an analogue of the so-called "Pólya urn-representation" result described in [4, Theorem 2.1] or [8, Section 1.2] for related models, which already cover the case of constant sequences .
For the distribution has density with respect to Lebesgue measure. If and , we use the convention that the distribution is a Dirac mass at . {theorem}[WRT-representation of PAT’s] For any sequence of fitnesses, we define the associated random sequence as
[TABLE]
where the are independent with respective distribution . Then, the distributions and coincide.
The result of the theorem is obtained by studying the evolution of the degrees in the preferential attachment model . The key argument lies in the fact that we can describe the whole process using a sequence of Pólya urns, related to the degrees of the vertices. The connection of the evolution of the degrees to Pólya urns in the context of preferential attachment models is well-know and was observed for the first time in [33]. It explains why Beta-distributed random variables appear in the limit. In our case, the theorem relies on applying the de Finetti theorem to this sequence of urns and on proving that those urns are jointly independent.
In fact, the result stated in the theorem can be made a bit more precise than an equality in distribution as soon as the sequence is chosen in such way that almost surely the degree of the first vertex tends to infinity as . For example, it is easy to check that the condition is sufficient to ensure this behaviour, and in this case we can state the following corollary.
Corollary \thetheorem.
For a sequence such that , we can construct the sequence from in such a way that for all :
[TABLE]
The obtained sequence has the distribution described in Theorem 1.1 and conditionally on this sequence has distribution .
In fact, and this is the content of Proposition 1.1 below, if grows linearly as some with some then the sequence almost surely grows as some power of which depends on . This is done using moment computations under the explicit definition of given by the theorem. In the rest of the paper, we investigate several properties of the WRT under this type of assumptions for the sequence of weights, such as convergence of height, profile and measures carried on the tree. Thanks to this connection, our results will then also hold for the PAT under the assumption that grows linearly.
Assumptions on the sequences.
For two sequences and we say that
[TABLE]
Our main assumption for sequences of fitnesses is the following (), which is parametrised by some positive and ensures that the fitness of vertices is on average
[TABLE]
For sequences of weights , we introduce the following hypothesis, which depends on a parameter
[TABLE]
where denotes a positive constant. The following proposition ensures in particular that our assumption on sequences of fitnesses translates to a power-law behaviour for the random sequence of cumulated weights defined in Theorem 1.1.
Proposition \thetheorem.
Suppose that there exists such that satisfies (), then the random sequence defined in Theorem 1.1 almost surely satisfies () with
[TABLE]
If furthermore is such that for some , then almost surely , where is a random function of which tends to [math] when .
Convergence of degrees using the WRT representation.
In the WRT with a deterministic sequence of weights that satisfies
[TABLE]
for some , the degree of a fixed vertex evolves as a sum of independent Bernoulli random variables and it is possible to handle it with elementary methods and obtain
[TABLE]
Further calculations allow us to improve this statement to an almost sure convergence
[TABLE]
in the space of -th power summable sequences, for weight sequences that satisfy some additional control. A precise version of this statement is given in Proposition 2.1.
Suppose that satisfies () and consider which has distribution . Then, according to Theorem 1.1 and Corollary 1.1, we know that conditionally on the sequence obtained as in (4), the sequence has distribution . Also, thanks to Proposition 1.1, we know that there exists some random variable such that almost surely as , with . So let us introduce
[TABLE]
Applying the convergence (8) conditionally on the sequence (or equivalently conditionally on ) yields an almost sure convergence in the product topology on sequences, which can be improved to an convergence if satisfies some additional control, thanks Proposition 2.1. This is stated below as a theorem. {theorem} Suppose that satisfies (). Then for a sequence we obtain the following almost sure convergence in the product topology
[TABLE]
Furthermore, if for some , the previous convergence also takes place in the space of -th power summable sequences, for all .
Let us emphasize that the function that outputs the maximum of a sequence is a continuous function, so that the scaling limit of the maximal degree in the tree is ensured by the theorem whenever the appropriate condition on the sequence is satisfied. Convergence of the rescaled degree of fixed vertices in preferential attachment trees is a well-know phenomenon in the case a preferential attachment trees with constant fitnesses, as is the convergence of the maximum of that sequence, see [33]. However, to the best of the author’s knowledge, Theorem 1.1 is the first result that ensures an almost sure convergence of the rescaled degrees as a sequence in such a topology. This improves the convergence proved in distribution in [36] for a related model, which we treat in Proposition 5.3.
The distribution of the limiting sequence can be characterized, and even has a reasonable description for certain regular sequences of fitnesses , as it is explained in the following paragraph. This result is actually related to the study of some urn models like the Pólya urns with immigration of [38] or the periodic Pólya urns of [2] and allows us to provide some alternative proofs and complete some of the known results about those processes. This is developed in Section 5.2.
Distribution of the limiting chain.
Let us say a word on the properties of the non-decreasing sequence that corresponds using our notation to the sequence defined in (9). Of course, using the random variables defined in Theorem 1.1, we can write for any ,
[TABLE]
but then because the random variable depends on the whole sequence , the sequence is not just an iterated product of independent random variables, as it was the case for . Nevertheless, the sequence still has the nice property of being a time-inhomogeneous Markov chain with a simple backward transition, characterised by the equality
[TABLE]
where is independent of and has distribution . This is the content of Proposition 4.3.
For some specific choices of sequences , the distribution of the chain is explicit. Whenever is of the form
[TABLE]
we retrieve Goldschmidt and Haas’ Mittag-Leffler Markov chain family, introduced in [21] and also studied by James [25].
The other case where the chain is explicit is when is periodic starting from the second term, of the form
[TABLE]
Then the sequence has an explicit distribution defined using products of Gamma-distributed random variables. We define it in Section 5.1.2.
1.2 Other geometric properties of weighted random trees
Let us now state the convergence for other statistics of weighted random trees, namely profile, height and probability measures. Here we let be a sequence of trees evolving according to the distribution for some deterministic sequence and state our results in this setting. Our results will also apply to random sequences of weights that satisfy the assumptions of the theorems almost surely, they will hence apply to PAT with appropriate sequences of fitnesses, thanks to Theorem 1.1 and Proposition 1.1.
1.2.1 Height and profile of WRT
Let
[TABLE]
be the number of vertices of at height . The function is called the profile of the tree . The height of the tree is the maximal distance of a vertex to the root, which we can also express as . We are interested in the asymptotic behaviour of and as .
In order to express our results, we need to introduce some quantities. For , we define the function as
[TABLE]
This function is increasing on and decreasing on with and and . We define and as
[TABLE]
We are going to assume that we work with a sequence which satisfies the following assumption () for some and ,
[TABLE]
Thanks to Proposition 1.1, this property is almost surely satisfied for by the random sequence for any sequence of fitnesses satisfying and . {theorem} Suppose that there exists and such that the sequence satisfies (). Then, for a sequence of random trees , we have the almost sure asymptotics for the profile
[TABLE]
where the error term is uniform in . Also for any compact we have almost surely for all
[TABLE]
where the error term is uniform in . Moreover, we have the almost sure convergence
[TABLE]
The proof of this result follows the path used for many similar results for trees with logarithmic growth (see [10, 11, 29]): we study the Laplace transform of the profile on a domain of the complex plane and prove its convergence to some random analytic function when appropriately rescaled. Then, we apply [28, Theorem 2.1], which consists of a fine Fourier inversion argument and hence allows to obtain precise asymptotics for . The application of the theorem in its full generality proves a so-called Edgeworth expansion for , which we express here in a weaker form by equations (13) and (14). The convergence (13) expresses that the profile is asymptotically close to a Gaussian shape centred around and with variance , so that a majority of vertices have a height of order . The second equation (14) provides the behaviour of the number of vertices at a given height, for heights that are not necessarily close to (for which the preceding result ensure that there are of order vertices per level). According to this result, at height for any there are of order vertices.
Remark that the exponent is continuous in and tends to [math] when . Although this does not directly prove the convergence (15), it already provides a lower-bound for since it ensures that asymptotically there always exist vertices at height , for any small . The convergence of the height (15) can then be obtained by proving a corresponding upper-bound, which can be done using quite rough estimates.
This result includes the well-known asymptotics as for the uniform random tree, proved for example in [15, 42]. Using the connection of preferential attachment trees to weighted recursive trees given by Theorem 1.1, it also includes the case of preferential attachment trees with constant fitnesses, for which similar results were proved, in [42] for the height and in [29, 28] for the asymptotic behaviour of the profile (13).
Remark \thetheorem.
One can also notice that, in the case , the function tends to a negative value as so that it is understandable that the asymptotics (14) couldn’t be valid for values of below a certain threshold. Nevertheless, one can check that and wonder what happens for values of that are slightly below . In fact, in the proof, we first study the weighted profile of the tree, which corresponds to the total weight of vertices at every height instead their number. In this case, the study of the corresponding Laplace transform is easier and would lead to a statement similar to Theorem 1.2.1 for the asymptotics of the weighted profile that would hold for any such that . A subsequent part of the proof then consists in transferring this result to the Laplace transform of the "true" profile of the tree, and this is the part that breaks down if is chosen smaller than .
As a complement to this result, let us mention that there is another case where we can compute the asymptotic height of the tree, which corresponds to sequences that grow fast to infinity. For any sequence of weights , a quantity of interest is , which is the expected height of a vertex taken with probability proportional to its weight in . When this quantity grows faster than logarithmically, we have the almost sure convergence (see Proposition 3.3 in Section 3.3)
[TABLE]
which indicates that all the action takes place at the very tip of the tree.
1.2.2 Convergence of the weight measure
We also study the convergence of some natural probability measures defined on the trees . This will prove useful for the applications developed in the companion paper [47].
Plane-tree framework.
For this result it will be easier to work with plane trees. We introduce the Ulam-Harris tree , where , with the convention that . Classically, a plane tree is defined as a non-empty subset of such that
- (i)
if and for some , then , 2. (ii)
for all , there exists such that for all , iff .
We choose to construct our sequence of weighted recursive trees as plane trees by considering that each time a vertex is added, it becomes the right-most child of its parent. In this way the vertices of the trees , listed in order of arrival, form a sequence of elements of . In fact, from now on, we will always assume that we use this particular embedded construction, both for the WRT and the PAT. Note that with this representation as unlabelled subsets of , the tree itself, for any , does not contain information relative to the labelling (hence the weight) of its vertices, but this piece of information can be read from the sequence .
We also denote , which we can be interpreted as the set of infinite paths from the root to infinity, and write . We classically endow this set with the distance
[TABLE]
where denotes the most recent common ancestor of and , and the height of a vertex is defined as the only such that . Note that even when , their most recent common ancestor belongs to , as long as . Endowed with this distance, is then a complete separable metric space.
In the paper, except when proving results related to the weak convergence of measures, for which we use the topology generated by , we consider as a graph and that we compute distances between vertices using the corresponding graph distance, which we denote . In particular, the height of a vertex is always its graph distance to the root .
Convergence of measures.
For every , we define the measure on , which only charges the set of vertices of , with for any ,
[TABLE]
We refer to as the natural weight measure on . The following theorem classifies the possible behaviours of for any weight sequence.
{theorem}
The sequence converges almost surely weakly towards a limiting probability measure on . There are three possible behaviours for :
- (i)
If , then is carried on . 2. (ii)
If and , then is diffuse and supported on . 3. (iii)
If then is concentrated on one point of .
This convergence can be extended to other natural measures on the tree, such as the uniform measure on , or some "preferential attachment measure" which charges each vertex proportionally to some affine function of its degree. This is the content of Proposition 2.2.2. Note that in our case of interest, when the sequence satisfies the assumption () for some , we are in case (ii) of the theorem.
In the specific case of (resp. ) with a sequence of weights (resp. of fitnesses) that is constant starting from the second term, the measure has an explicit description: if any , writing for the sub-tree descending from , the sequences
[TABLE]
are independent and have an explicit distribution (see [41] for a definition). Furthermore, the corresponding sequence of trees, conditionally on , can be described as a split tree. This result, along with other other properties of these families of growing trees can be found in [27].
1.3 Organisation of the paper
The paper is organised as follows.
We first investigate some properties of weighted random trees with deterministic weight sequence . In Section 2.1 we first prove Proposition 2.1 which states the convergence of the degree sequence using elementary methods. Then in Section 2.2, we prove the weak convergence of the weight measure to some limit and describe three regimes for its behaviour. We also study other natural measures related to the sequence of trees and prove that they also converge towards . For all these measures, our main tool consists in introducing martingales related to the mass of a subtree descending from a fixed vertex. This is the content of Theorem 1.2.2 and Proposition 2.2.2. In Section 3, we prove Theorem 1.2.1 about the convergence of the height and the profile of WRT. This is achieved by first proving the uniform convergence of a rescaled version of the Laplace transform of the profile on a complex domain, which is the content of Proposition 3. This ensures that we can use [28, Theorem 2.1] for the convergence of the profile. This convergence provides a lower-bound for the height of the tree; we then prove a matching upper-bound to obtain asymptotics for the height. We also prove Proposition 3.3, which identify the asymptotic behaviour of the height of the tree in the case where the weights increase very fast.
Then we switch to studying a sequence of preferential attachment trees with sequence of fitnesses . In Section 4, we present a proof of Theorem 1.1 and Corollary 1.1 using a coupling between the preferential attachment process with a sequence of Pólya urn processes and this establishes that can also be described as having distribution for a random sequence ; we then prove Proposition 1.1 which relates the properties of to the ones of . We finish the section by stating and proving Proposition 4.3 in which we prove that the sequence defined above as some random multiple of is a Markov chain. In Section 5, we identify in Proposition 5.1 the distribution of the chain for particular sequences using moment identifications. We then present two applications of this result, one concerning a model of Pólya urn with immigration and the other concerning another model of preferential attachment graphs, in Proposition 5.3.
Some technical results can be found in Appendix A.
Acknowledgements
The author would like to thank the anonymous referees for their numerous comments and suggestions that helped improve the presentation of this paper. He would also like to thank Philippe Marchal whose remarks led to an improvement in the generality of one of the results.
2 Measures and degrees in weighted random trees
In this section, we work with a sequence of trees that has distribution for a deterministic sequence . We start with two statistics of the tree that are quite easy to analyse, namely the sequence of degrees of the vertices of the tree and also some natural measures defined on the tree.
2.1 Convergence of the degree sequence
We start the section by proving convergence for the sequence of degrees of the vertices in their order of creation under the model. We suppose here that the sequence of weights is such that there exists constants and for which
[TABLE]
We write for the out-degree of the vertex in . For a fixed remark that, as a sequence of random variables indexed by , we have the equality in distribution
[TABLE]
with a sequence of independent uniform variables in . With this description of the distribution of the degrees of fixed vertices, only using some law of large numbers for the convergence and Chernoff bounds for the fluctuations we obtain the following result.
Proposition \thetheorem.
For a sequence of weights satisfying (18), the following holds.
- (i)
We have the almost sure pointwise convergence
[TABLE] 2. (ii)
If the sequence furthermore satisfies for some constant , then there exists a function of which goes to [math] as , also denoted , such that all large enough, we have for all
[TABLE]
and the convergence (20) holds almost surely in the space for all .
Proof.
To prove (i), first remark that for any such that , thanks to (18), we have
[TABLE]
Using a version of the Borel-Cantelli lemma (see Lemma A in the appendix), we get that almost surely
[TABLE]
and hence . For the indices for which , we of course have almost surely for all , and so the convergence also holds. This finishes the proof of (i).
For the second part of the statement, let us first compute
[TABLE]
where we have used the inequality . Now let be a constant such that for all , we have (such a constant exists because of the assumption (18)). For all , we introduce the following
[TABLE]
where the real number is chosen in such a way that the function is decreasing on . Using Markov’s inequality, we get for any integers and such that
[TABLE]
Using a union bound, the fact that for any , and the definition of , we get that for all
[TABLE]
The last display is summable over all and hence using the Borel-Cantelli lemma, we almost surely have for large enough
[TABLE]
We can conclude by noting that under our assumptions we have . The convergence in for is just obtained by dominated convergence using the componentwise convergence (20) and the domination (21). ∎
2.2 Convergence of measures
The goal of this section is to prove Theorem 1.2.2, which concerns the convergence of the sequence of weight measures seen as measures on . One of the key arguments is the fact that the weight of the subtree descending from a fixed vertex can be described using a generalised Pólya urn scheme, as studied by Pemantle [40]. We also prove Proposition 2.2.2, which states the weak convergence of other measures.
Time-dependent Pólya urn scheme.
Let us start by describing an urn process, following Pemantle [40]. Let be two non-negative real numbers, with , and be an integer and be a sequence of non-negative real numbers. We refer to the following process as a time-dependent Pólya urn starting at time with red balls and black balls and weight sequence :
- •
At time , the urn contains red balls and black balls222Those numbers of balls are not required to be integers..
- •
Then at every time , a ball is drawn at random and replaced in the urn, along with additional balls of the same colour.
For any we call the proportion of red balls in the urn at time . We can easily check that is a martingale in its own filtration, with values in . As a result, it converges as a.s. and in towards some random variable .
Characterization of the convergence of probability measures over .
Recall from the introduction the definition of the Ulam-Harris tree and its completed version , which is endowed with the distance defined in (16). Recall that is a separable and complete metric space.
For any , we write the subtree descending from . In there is an easy characterisation of the weak convergence of sequences of probability measures defined on the Borel--field associated to , which a direct consequence of the Portmanteau theorem (see e.g. [7, Theorem 2.1]):
Lemma \thetheorem.
Let be a sequence of Borel probability measures on . Then converges weakly to a probability measure if and only if for any ,
[TABLE]
Let us provide a proof of this lemma for completeness.
Proof.
We can check that the sets of the form for , or for , are clopen for the topology generated by , so by the Portmanteau theorem this already proves the "only if" part of the lemma. Now reciprocally, suppose that the condition on is satisfied. We can check that every open set can be written as a countable disjoint union of these clopen sets (see for example [45, Lemma 1.2] for a similar statement for the topology of ), which we write . Then, using Fatou’s lemma and the -additivity of measures we get
[TABLE]
We conclude using the Portmanteau theorem again. ∎
2.2.1 Proof of Theorem 1.2.2.
We are going to apply this criterion to our sequence , which, we recall, is defined in such a way that for all , the measure charges only the vertices of the tree , and such that for any ,
[TABLE]
Proof of Theorem 1.2.2.
We can already see that if is bounded and hence converges to some we have as . In this case it is easy to verify that weakly converges to the measure which is such that . In this case and so is carried on .
Let us now assume that as and show that in this case, converges weakly to some limit that is carried on . In this case we have as . Let us denote for every integers ,
[TABLE]
the proportion of the total mass above vertex at time . Remark that this quantity evolves as the proportion of red balls in a time-dependent Pólya urn scheme with weights , starting at time with black balls and red balls. Hence for all , the sequence almost surely converges to a limit . Also, for any that does not receive a label in the process, the sequence (and also ) is identically equal to zero. Hence we have convergence of and for all .
The last step in order to prove the weak convergence of is to prove that the quantities that we obtain in the limit indeed define a probability measure on . If for all we have
[TABLE]
then it entails that , where is the unique probability measure on such that for all ,
[TABLE]
The existence of such a measure is ensured by the Kolmogorov extension theorem on the product space .
For any , the equality (23) is immediate, so let us prove it for all for . For any , let
[TABLE]
Using what we just proved, we know that for any , the quantity almost surely converges as to some limit . Proving (23) reduces to proving that for any , we almost surely have . By construction, the sequence is non-negative and non-increasing, hence it converges almost surely, so it suffices to prove that its almost sure limit is [math].
We define , the time when the vertex receives its -th child in the growth procedure. Conditionally on the event , the process evolves as the proportion of red balls in a time-dependent Pólya urn scheme, starting at time with red balls (that correspond to the weight of ) and blacks balls (that correspond to the total weight of other vertices in the tree), and weights . Hence we have
[TABLE]
On the event , we have for and for , which decreases almost surely to [math], so a.s. on that event.
Using the crude bound , which entails that almost surely, we get
[TABLE]
hence in , so its almost sure limit is also [math]. In the end, by Lemma 2.2, the sequence of measures almost surely converges weakly to a limit , and this measure only charges the set .
We just finished proving that, for any sequence of weight , the sequence almost surely converges weakly to a probability measure . Furthermore, we proved that is carried on when is bounded and carried on when . The proof is then finished by applying the lemma stated below. ∎
Lemma \thetheorem.
Suppose that so that is carried on . Then either and then is almost surely diffuse, or and then is carried on one point of .
Proof.
For any the process evolves as the proportion of red balls in a time-dependent Pólya urn started at time with red balls, black balls and a weight sequence . By the work of Pemantle in [39], if we assume then the limiting proportion almost surely belongs to the set . This translates into the fact that almost surely for any , which entails that is almost surely carried on one leaf of .
On the contrary, let us suppose that and prove that this entails that the limiting measure is diffuse almost surely. Consider the function which associates to each couple their most recent common ancestor in the completed tree . This function is continuous with respect to the distance . Then, since almost surely weakly, we also get the following almost sure weak convergence of the push-forward of the product measure on by the function :
[TABLE]
Let us fix and conditionally on , let and be two independent vertices taken under . Then, an argument taken from the proof of [13, Lemma 3.8] in a slightly different setting ensures that
[TABLE]
The argument goes as follows:
- •
with probability , we have ,
- •
with probability , it is not the case, and we can check that conditionally on this event, the vertices and defined as the most recent ancestor in of respectively and , are independent and taken under the measure , and that .
It suffices to then apply this in cascade to get the last display.
Note that thanks to the summability condition, the infinite product is non-zero, and this suffices to ensure that the obtained sequence is a probability distribution. Thanks to the weak convergence (24), it corresponds to the (annealed) distribution , where and are two independent points taken under the measure , conditionally on . Now we can write
[TABLE]
where the inequality is due to the fact that the vertices have a height smaller than . Hence . So, almost surely, two points taken independently under are different, and this ensures that is diffuse. ∎
2.2.2 Convergence of other sequences of measures.
We also study two other sequences of measures and carried on the Ulam tree . For every , these measures only charge the vertices in such a way that for any ,
[TABLE]
where is a sequence of real numbers such that and for all . We write . We suppose that and that there exists such that . The assumptions on the sequence are chosen such that they are satisfied by a sequence of fitnesses that satisfies () for some .
Proposition \thetheorem.
Under the assumptions and , the sequences and converge almost surely weakly towards the limiting measure on that is defined in Theorem 1.2.2.
The rest of this section is devoted to proving Proposition 2.2.2. We treat the two sequences of measures separately.
The degree measure.
Consider the sequence on . Since the sequence tends to infinity, we have for every . Indeed, using the equality in distribution (19) and Lemma A in the appendix, it is easy to see that either in which case the degrees are eventually constant as ; or , in which case we have the almost sure asymptotic behaviour . In both cases, for all , we have almost surely as .
For all , we keep the notation introduced in the proof of Theorem 1.2.2 and let
[TABLE]
We can check that
[TABLE]
Now, using that and that , we get
[TABLE]
Hence, if we denote , then the last computation shows that is a martingale for the filtration generated by . More precisely we can write
[TABLE]
hence we have
[TABLE]
Then, using [19, Chapter VII.9, Theorem 3], we get that if
[TABLE]
then a.s. as , which would prove that as . In our case, we can verify that (25) holds. Indeed, using the fact that we assumed that and , we have
[TABLE]
which is summable under our assumptions. In the end, using Lemma 2.2, we have the almost sure convergence
[TABLE]
The uniform measure on the vertices of .
Consider the sequence on . Fix . For any we can write . For any , we have , which tends a.s. to some limit as . Using Lemma A in the appendix, we have
[TABLE]
and also
[TABLE]
In both cases we get almost surely. We also have for any ,
[TABLE]
so we can conclude using Lemma 2.2 that almost surely weakly.
3 Height and profile of WRT
The main goal of this section is to prove Theorem 1.2.1 which gives asymptotics for the profile and height of the tree. Recall that we denote
[TABLE]
the number of vertices at height in the tree . In order to get information on the sequence of functions we study their Laplace transform
[TABLE]
where the last expression is given using an integral against the probability measure defined in Section 2.2 as the uniform measure on the vertices of . The key result in our approach is to prove the convergence of this sequence of analytic functions when appropriately rescaled, uniformly in on an open neighbourhood of [math] in the complex plane. It then allows us to use [28, Theorem 2.1] and hence derive a convergence result for the profile. We actually start in Section 3.1 by studying the convergence of the similarly defined sequence of functions
[TABLE]
where we integrate with respect to the weight measure instead of the uniform measure as before. This one is easier to study because for every fixed , it defines a martingale as grows, up to some deterministic scaling. Then in Section 3.2, we make use of this first convergence and show that up to some deterministic multiplicative constant, the two sequences of integrals appearing in (26) and (27) are almost surely equivalent when tends to infinity.
Let us fix some for this whole section. Throughout this section, we always work under the assumption that () holds for the sequence . For some results, we will assume that their exists such that the stronger condition () holds, i.e.
[TABLE]
We let be a function of a complex parameter and let be the following rescaled version of the Laplace transform of the profile
[TABLE]
The proposition below ensures that the sequence converges uniformly on all compact subsets of some domain to some limiting function which does not vanish anywhere on the set , along with some more technical statements.
Proposition \thetheorem.
Suppose that the weight sequence satisfies () for some and some . Then there exists a domain such that where and are defined as in (12), such that the following properties are satisfied.
- (i)
With probability , the sequence of random analytic functions converges uniformly on all compact subsets of , as , to some random analytic function which satisfies . 2. (ii)
For every compact subset and , we can find an a.s. finite random variable such that for all ,
[TABLE] 3. (iii)
For every compact subset , every and ,
[TABLE]
Under the results of Proposition 3 we can apply [28, Theorem 2.1] whose conclusions for the sequence are the following. For any and , we denote
[TABLE]
Then, for every integer and every compact subset , we have the convergence
[TABLE]
where for all , the (random) functions are polynomials of degree at most in and are entirely determined from and , with , see [28, Equation (16)] for their complete definition. The asymptotics (13) and (14) stated in Theorem 1.2.1 follow from the last display. Indeed, (13) is obtained by letting and and using the fact that almost surely. For (14), we let , and use .
In Section 3.3, we complete the proof of Theorem 1.2.1 by computing the asymptotic behaviour of the height of the tree. Since the convergence of the profile already ensures that there almost surely are vertices at height for small enough and all large enough, it suffices to prove a corresponding upper-bound in order to finish proving the convergence (15) in Theorem 1.2.1.
3.1 Study of the Laplace transform of the weighted profile
We study the sequence . The following lemma is the starting point of our analysis. In this section, we will use the notation .
Lemma \thetheorem.
For all and all , we have
[TABLE]
Proof.
Recall that conditionally on , the -st vertex of is a child of the vertex , where . We compute
[TABLE]
Taking conditional expectation with respect to yields:
[TABLE]
This concludes the proof. ∎
Let be an integer that we are going to fix later on. The last result ensures that if is such that , then we can define for all
[TABLE]
and the sequence is a martingale. We want to prove results about the asymptotic behaviour of , uniformly in on an appropriate set. If is fixed, then there exist parameters with for which the sequence takes the value [math]. Under the assumption () on the sequence , we know that as . If we restrict ourselves to a set of the form for some , then
[TABLE]
hence it suffices to take large enough in order for the sequence to only take non-zero values for all and all . In what follows we work on the set
[TABLE]
where is as defined in Proposition 3. For technical reasons, we also sometimes consider the larger set
[TABLE]
Using the preceding discussion, we fix such that the sequence does not have any zero on , so that is well-defined for all .
We introduce the following notation. Let and be two functions of a complex parameter and an integer . For a set of the complex plane we write
[TABLE]
to express the fact that is a big (resp. small) o of as , uniformly on every compact . Note that later in the paper, we will also use this notation for random functions of and when such a comparison holds almost surely.
Now, let us derive some information on the asymptotic behaviour of .
Lemma \thetheorem.
Suppose that satisfies (). Then there exists and an analytic function on such that
[TABLE]
Remark that the lemma implies that for any , we have
[TABLE]
as . It is also immediate that satisfies the same asymptotics up to a constant, as soon as is such that .
Before proving the lemma, we state the following result which follows from elementary calculus. Its proof can be found in the appendix.
Lemma \thetheorem.
Suppose that satisfies (). Then there exists such that
[TABLE]
Proof of Lemma 3.1.
We write for the principal value of the complex logarithm. For such that we have . If for every and , we let
[TABLE]
which is well-defined thanks to our choice of , then is summable in and the rest of the series is
[TABLE]
for some , thanks to Lemma 3.1. Then we write
[TABLE]
which yields using (31) and Lemma 3.1
[TABLE]
and is an analytic function of , which finishes the proof. ∎
Convergence of the martingales .
When the parameter is a positive real number, the sequence is a positive martingale and so it converges almost surely to some limit. We want to prove that these martingales converge almost surely and in for the largest possible range of parameters . For the rest of Section 3.1 and also in the subsequent Section 3.2, we assume that the weight sequence satisfies () for some fixed parameters and .
We align our notation with the one used in [11, Theorem 2.2] which states something similar to our forthcoming Proposition 3.1 for another model, the binary search tree.
For any and , we let
[TABLE]
For any , let , and denote
[TABLE]
Lemma \thetheorem.
The set is open and contains the open interval of real numbers which contains [math].
Proof.
Of course is open as a union of open sets. For any real we have . So, if then there exists for which . Since , the set contains the interval defined above. Since , we have . ∎
Proposition \thetheorem.
The sequence of functions converges uniformly almost surely and in towards an analytic function on every compact subset of . Furthermore, for any compact subset , there exists a real such that almost surely
[TABLE]
The proof of the proposition will follow from the next lemma, together with Lemma A, stated in the appendix.
Lemma \thetheorem.
For any and we have
[TABLE]
and also
[TABLE]
Proof.
For any and , we write
[TABLE]
Taking the -th power of the modulus on both sides and using the inequality , we get
[TABLE]
Using Lemma A in the appendix, we have for any ,
[TABLE]
Using the last display and equation (3.1), we get a recurrence inequality of the form
[TABLE]
where
[TABLE]
Applying (37) in cascade we get
[TABLE]
Now notice that from our assumption on the sequence we have
[TABLE]
On the other hand, since then , so we can use Lemma 3.1 to get
[TABLE]
We conclude using the following lemma which is an application of Hölder’s inequality using the assumption ().
Lemma \thetheorem.
For any we have .
Together with (39), this proves that is summable and so . Also
[TABLE]
and so . Replacing this in (38) finishes to prove (34). In order to prove (35), we use Lemma A again and write
[TABLE]
Using Lemma 3.1 we get which finishes the proof of the lemma. ∎
Proof of Proposition 3.1.
Any compact subset can be covered by a finite number of . The convergence result is then an application of Lemma A, on the set with and, say . The limiting function is analytic as a uniform limit of analytic functions. ∎
Zeros of the limit.
Now that we have proved that their exists a limiting function defined on the set , we are interested in the possible location of the zeros of this random function. In fact, the function is related to the function of Proposition 3, for which we aim to prove that it has almost surely no zero on some real interval which contains [math]. We will prove a similar result for in Lemma 3.1, and we start by proving the following weaker statement. Recall the definition of the interval in Lemma 3.1.
Lemma \thetheorem.
For all , we have almost surely . As a consequence, the number of zeros of the map on the interval is almost surely at most countable.
Let us recall from (1) the definition of the sequence of independent random variables that is used to construct the trees .
Proof of Lemma 3.1.
This follows from an application of Kolmogorov’s law. Indeed, fix and and for all , let
[TABLE]
where denotes the graph distance in , and the distance between a vertex and a subset of vertices is defined the usual way. The idea behind is that, up to a positive multiplicative constant (i.e. a deterministic constant that depends on and but not on ), it has the same distribution as the sequence associated to the growth of the tree that one informally obtains by contracting all the vertices into one. Note that the growth of such a tree can be described as that of a weighted recursive tree with weight sequence , which shares the same asymptotic property () as the original sequence .
We claim the following:
- (i)
Due to the above remarks, is a positive martingale which satisfies the same assumptions as so it converges a.s. and in towards a non-negative limit, , thanks to Proposition 3.1, which we can apply here because . 2. (ii)
We have . 3. (iii)
The sequence , hence its limit , is independent of the first steps of the construction, and is hence a measurable function of the sequence .
Using all these observations we deduce that for any we have the equality of events . This proves that is measurable with respect to the tail -algebra generated by the sequence , which is a sequence of jointly independent random variables. Kolmogorov’s [math]- law then ensures that this event has probability [math] or . By convergence we have and this proves our claim. It follows immediately that the limit can only have finitely many zeros in any compact subset of almost surely, because otherwise, by analyticity of on the connected component of that contains , the function would be identically [math] on with positive probability. This ensures that the total number of zeros in is at most countable and finishes the proof. ∎
Lemma \thetheorem.
The function has almost surely no zero on .
In order to prove this lemma, we use an argument of self-similarity: essentially, if we take two vertices and in the tree, then conditionally on the sequences of vertices that are grafted above or above , the subtrees above and evolve as two independent weighted recursive trees. Using Proposition 3.1 and Lemma 3.1, the normalized Laplace transform of the weighted profile of each of those two subtrees should converge almost surely to some random analytic function on which is non-negative on and has at most countably many zeros on this interval. Since the two are independent, their zeros should not overlap and hence the sum of their contribution should result in a function that is positive on .
Proof.
Let us formalise this line of reasoning. Using Theorem 1.2.2, we know that the measure on is almost surely diffuse, hence we can define
[TABLE]
and they are almost surely finite.
Let us consider the sequences for , which record the times when a vertex is added to or , and work conditionally on them for the rest of the proof. We let
[TABLE]
which record respectively the number of vertices among that are in and conversely, the -th time where a vertex is added to in the construction of . We let and , and also . We also define for and
[TABLE]
the subtree hanging above at the time where it contains exactly vertices (translated to the origin in order to be considered as a plane tree).
Let us state the following intermediate result, which we will prove at the end of the section. Note that the random sequences , and for can be read from for .
Lemma \thetheorem.
The following holds.
- (i)
For , we almost surely have . 2. (ii)
For , the sequence satisfies () almost surely. 3. (iii)
Conditionally on the two sequences and , the sequences of trees and are independent and have respective distributions and .
Recall the discussion before Lemma 3.1. For , let be the smallest integer such that for all and for all we have . Then we can define for ,
[TABLE]
These processes are the martingales associated to the weighted profile of the trees for . Thanks to Lemma 3.1(iii) those trees have respective distribution , for and thanks to Lemma 3.1(ii), those weight sequences satisfy () almost surely. This allows us to apply Proposition 3.1, which entails that for , the sequence of functions converges almost surely to an analytic limit on the set . Now we can write, for sufficiently large
[TABLE]
Using Lemma 3.1, we have almost surely for ,
[TABLE]
Using the asymptotics from Lemma 3.1(i) we get
[TABLE]
From the a.s. convergence of the sequence of measures , see Theorem 1.2.2, we also get
[TABLE]
which entails that for , uniformly on all compact subsets of , we have the a.s. convergence
[TABLE]
where the limiting function is analytic and only takes positive values on . Then, for any , taking the limit in (3.1) yields
[TABLE]
Now, thanks to Lemma 3.1, the function can only have at most countably many zeros on and for all , we have almost surely. Then if we condition on the location of the zeros of on , since is independent of , we have for all almost surely. Hence has almost surely no zeros on . ∎
Now let us prove Lemma 3.1 which we used in the preceding proof.
Proof of Lemma 3.1.
Point (i) follows just from Theorem 1.2.2 and Proposition 2.2.2 and the fact that for we have .
Let us prove (ii). In order to do that we are going to prove that for , we have
[TABLE]
Let us conclude from here: using the fact that satisfies (), we get
[TABLE]
with a positive constant. We also have
[TABLE]
Because of (42), we have the following almost sure convergence as , hence almost surely for large enough we have , so
[TABLE]
where in the two last inequalities we used the fact that satisfies () and the almost sure linear growth of ensured by (42).
So it remains only to prove (42). Recall the proof of Theorem 1.2.2. For all the process is a martingale and almost surely we have
[TABLE]
Using successively Lemma A and then Lemma 3.1, which applies because satisfies (),
[TABLE]
Using then Lemma A with and and , we get that almost surely for some . Since this is true almost surely for all , we use it with . As by definition for we have , we conclude that .
Then, for any , consider the process . It is easy to verify that this process is a martingale in its own filtration and that its increments are bounded by . Using again Lemma A with and and , we get that for some . Using again that for the limit is almost surely positive, we can write . Using the definition of , we can check that this entails that almost surely. This concludes the proof of (42) and so, (ii) is proved.
Let us now prove (iii). For any , we consider the sequence that encodes the labels of the vertices above . Note that the limiting mass can be computed from that sequence. Now, let us sequentially reveal until we get to a for which . By definition, the first index for which it happens is .
Then we continue revealing the sequences for but only for the ’s such that until we get to a for which . By definition, this second index is . Remark, and this is the key in this argument, that after determining and in this way, the only information that we have about and is the list of labels of the vertices that belong each of them (and the position of and ).
Now, conditionally on all this information, it is straightforward to see from the attachment dynamics that for any , when the -st vertex attaches above at time , the label of the vertex to which it attaches is chosen among with probability proportional to their respective weight , independently for different choices of and . This finishes the proof of (iii) and hence that of the lemma. ∎
3.2 From the weighted to the unweighted sum.
Now we want to transfer these results of convergence to the Laplace transform of the real profile. Recall from (28) the definition of the sequence of functions . We still assume until the end of Section 3.2 that satisfies () for some and .
We introduce the following quantity, for ,
[TABLE]
The goal of this subsection is to show that the quantity is negligible as compared to any of the two terms in the difference, for contained in some subset of the complex plane. This way we will transfer the asymptotics that we have proved for and in the last section to asymptotics for , which is the quantity that we want to study in the end. Let us start by proving a lemma.
Lemma \thetheorem.
The process is a martingale with respect to . Furthermore, for all ,
[TABLE]
Proof.
This process is of course -adapted and integrable. For the martingale property we compute
[TABLE]
For and , we make the following computation, using Lemma 3.1 and Lemma 3.1,
[TABLE]
and the last exponent reduces to because . Hence, using Lemma A, we get
[TABLE]
which finishes the proof of the lemma. ∎
Recall the definition of and in (12). Let us define the domain to which we refer in the statement of Proposition 3 as the connected component of
[TABLE]
that contains [math], where is defined in (33). In this way, is a domain of and . Indeed, first, is open and connected by definition. Then recall from Lemma 3.1 that contains an open interval which contains [math] and has as its right endpoint. Now just check that and that .
For technical reasons, we also introduce the following subset of , here identified as ,
[TABLE]
on which the process , and hence also , are well-defined. Let us further decompose into a union of open sets
[TABLE]
Lemma \thetheorem.
The following holds.
- (i)
For all compact there exists such that almost surely
[TABLE] 2. (ii)
For all compact , there exists such that
[TABLE] 3. (iii)
For all compact , there exists such that almost surely
[TABLE]
Proof.
For the first one, for any we can apply Lemma A on the open set with and , thanks to Lemma 3.2. Then using the compactness of , (i) is true for every compact , hence for any compact .
Let us prove point (ii). For any , thanks to Lemma 3.1, on the open set we have and
[TABLE]
Applying Lemma A for the martingale on any compact subset with and yields:
[TABLE]
Using the estimates of Lemma 3.1, we have , and so . Hence which finishes the proof of (ii).
Last, in order to prove (iii), we use Lemma A on for the martingale with and . ∎
In order to conclude, we will also need the following lemma, which is a direct consequence of Lemma 3.1.
Lemma \thetheorem.
For any compact , there exists such that
[TABLE]
Proof.
On any compact , using Lemma 3.1 we write
[TABLE]
so that
[TABLE]
where in the second line, we use the fact that , and we define . This proves the lemma. ∎
We can now prove Proposition 3.
Proof of Proposition 3.
Let us start by proving simultaneously that for any , we almost surely have
[TABLE]
and also that both points (i) and (ii) of the proposition hold. For any compact and , we write
[TABLE]
The first term is thanks to Lemma 3.2(i). We bound the second one by the following quantity
[TABLE]
In the above display, we used Lemma 3.2 and then Lemma 3.1 together with Proposition 3.1 on respectively the first and the second term. In the end, the whole expression is . From (43), it is clear that the limiting function is analytic and has almost surely no zero on because of Lemma 3.1. For (iii), let us prove the stronger statement: for any compact subset and , there exists such that almost surely,
[TABLE]
For this, we write
[TABLE]
We apply points (ii) and (iii) of Lemma 3.2 to the compact and get the desired bound. ∎
3.3 Height of the tree
In this section, we study the behaviour of the height of the tree , which is defined as the maximal height of the vertices of , i.e.
[TABLE]
We start by showing that under the assumption () we have the convergence (15). Then, for the sake of completeness, we also study the simpler case where .
One key argument in our proofs is the following equality for the annealed moment generating function of the height of , for any fixed , which can be seen as a corollary of Lemma 3.1
[TABLE]
Some elementary computations using the Chernoff bound and the last display yield the following lemma.
Lemma \thetheorem.
Suppose that the sequence of weights satisfies
[TABLE]
Then almost surely we have
[TABLE]
where is the unique positive root of .
Proof.
Using the expression (44) for the moment generating function of we get, for any
[TABLE]
where we use the inequality and the assumption on . Then, for any and ,
[TABLE]
If we take such that then the right-hand-side is summable and hence using the Borel-Cantelli lemma shows that for all large enough, we have . Letting , we get the result. ∎
Let us prove the last claim (15) of Theorem 1.2.1. Here we suppose that the weight sequence satisfies () for some and some .
Proof of (15).
Recall the asymptotics (14) in Theorem 1.2.1. It ensures that there almost surely exist vertices at height , for any fixed and large enough. Hence the height of the tree satisfies
[TABLE]
For the limsup, we use Lemma 3.3 with (this is justified by Lemma 3.1), which yields . ∎
To finish the section, we state a proposition.
Proposition \thetheorem.
Let . If then we have the almost sure convergences
[TABLE]
Proof.
As we can check from its moment generating function (44), the random variable is a sum of independent Bernoulli random variables, with expectation . Using standard bounds for yields
[TABLE]
which is summable in for any . The result of the proposition is then obtained using the Borel-Cantelli lemma. ∎
4 Preferential attachment trees are weighted recursive trees
In this section, we study preferential attachment trees with fitnesses as defined in the introduction. First, in Section 4.1, we prove Theorem 1.1 which allows us to see them as weighted random trees for some random weight sequence . Then in Section 4.2 we prove Proposition 1.1 which relates the asymptotic behaviour of to the behaviour of . Finally, in Section 4.3 we prove Proposition 4.3, which ensures that the sequence obtained as the scaling limit of the degrees can be expressed as the increments of a Markov chain.
4.1 Coupling with a sequence of Pólya urns: proof of Theorem 1.1
Here we fix an arbitrary sequence such that and . Let us recall the notation, for ,
[TABLE]
with the convention that . We consider a sequence of trees evolving according to the distribution and we want to prove Theorem 1.1, namely that there exists a random sequence of weights for which the sequence evolves as a . The proof uses a decomposition of this process into an infinite number of Pólya urns. This is very close to what is used in the proofs of [4, Theorem 2.1] or [8, Section 1.2] in similar settings. The novelty of our approach is to express this result using weighted random trees, since it allows us to apply all the results developed in the preceding section.
Pólya urns.
For us, a Pólya urn process is a Markov chain on with transition probabilities given by the matrix where for all ,
[TABLE]
The quantities and represent respectively the number of red balls and the total number of balls at time in a urn containing red and blacks balls, in which we add a ball at each time, the colour of which is chosen at random proportionally to the current proportion in the urn. Starting at time [math] from the state , i.e. with red balls and black balls, it is well-known that the sequence of random variables is exchangeable, and an application of de Finetti’s representation theorem ensures that it has the same distribution as i.i.d. samples of Bernoulli random variables with a random parameter , which has distribution , where we use the convention that if .
Note that the process is entirely determined from and that the random variable is a measurable function of the sequence because it can almost surely be obtained as .
Nested structure of urns in the tree.
For all we define the following process in
[TABLE]
the "total fitness" of the vertices , for which we remark that for any we have
[TABLE]
Imagine that is constructed and we add a new vertex to the tree. We choose its parent in a downward sequential way:
- •
we first determine whether the parent is , this happens with probability
[TABLE]
- •
then with the complementary probability it is not, so conditionally on this we determine whether it is , this happens with (conditional) probability
[TABLE]
- •
then with the complementary probability it is not, etc… We continue this process until we stop at some .
Now let us fix and introduce the following time-change: for all , we let
[TABLE]
be the -th time that a vertex in attached on one of the vertices after time , where by definition, we have . Remark that it can be the case that is not defined for large , if there is only a finite number of vertices attaching to . Let us ignore this possible problem for the moment, and only consider sequences for which , for which this will almost surely not happen. In this case for all we set
[TABLE]
Now, the three following facts are the key observations in order to prove Theorem 1.1:
- (i)
for all , the process has the distribution of a Pólya urn starting from the state , 2. (ii)
those process are jointly independent for , 3. (iii)
the whole sequence is a function the collection of processes .
Point (i) already follows from the discussion above. A moment of thought shows that (ii) holds as well: of course the processes for different are not independent at all but the point is that they only interact through the time-changes . Last, for (iii), let us note that we can reconstruct the tree at time from the random variables and that these random variables can be entirely determined using
[TABLE]
Reversing the construction and using the exchangeability.
Let us now reverse the construction and start with an independent family of processes which have for each the distribution of a Pólya urn starting from the state , so that they have the joint same distribution as the ones described in (i) and (ii). From what we did above, the sequence that they determine through (iii) has distribution . A moment of thought shows that this argument actually still holds for a completely arbitrary sequence of fitnesses .
Now, using de Finetti’s theorem, each of the processes can be produced by sampling and adding a red ball at each step independently with probability and a black ball with probability . This is of course done independently for different .
In terms of our downward sequential procedure defined above for finding the parent of each newcomer, it amounts to saying that each time that we have to choose between attaching to or attach to a vertex among , the former is chosen with probability and the latter with probability . Let us verify that the law of conditionally on the sequence can indeed be expressed as WRT with the random sequence of weights defined in Theorem 1.1, which is defined from the sequence as,
[TABLE]
with the convention that and . Let us reason conditionally on the sequence (or equivalently the sequence ). When determining the parent of , whose label we denote as in (2), we successively try to attach to until we stop at . Using the independence, we get that for every ,
[TABLE]
This proves Theorem 1.1. Let us explain how Corollary 1.1 follows from the proof that we developed here. From the discussion in the previous paragraph, in the case of a sequence for which , each of the processes for is a measurable function of , and hence the associated also is. In the end, the sequence is a measurable function of and it is easy to check that it corresponds to the one described in the statement of Corollary 1.1.
4.2 Proof of Proposition 1.1
Let be the random sequence of cumulated weights defined Theorem 1.1, whose distribution depends on a sequence of fitnesses, and is expressed using a sequence of independent Beta-distributed random variables . We are going to prove Proposition 1.1, which relates the growth of to the one of .
Proof of Proposition 1.1.
As in [21, Proof of Lemma 1.1], we introduce
[TABLE]
It is easy to see that is a positive martingale, hence it almost surely converges to a limit as . Now, using the fact that the are independent and that for any integer , the -th moment of a random variable with distribution is given by
[TABLE]
we can compute
[TABLE]
Now from (), there exists such that and without loss of generality we can assume that . For all we can write
[TABLE]
Hence
[TABLE]
In the end, since , we get
[TABLE]
where is a positive constant which depends on the sequence and . This entails that, under our assumptions, for any , we have
[TABLE]
which shows that this martingale is bounded in for all and hence it is uniformly integrable. Consequently, it converges a.s. and in to a limit random variable , with moments determined by
[TABLE]
Furthermore, we have
[TABLE]
Since , we get
[TABLE]
Using (53), (54), Lemma A and then summing over and using the fact that satisfies () we get that
[TABLE]
Using Lemma A, we get that almost surely, for any ,
[TABLE]
Since almost surely for every , the event is a tail event for the filtration generated by the and has probability [math] or . In the end, it has probability [math] because . We deduce that
[TABLE]
Hence, we have,
[TABLE]
Whenever as , we can show the following (we postpone the proof to the end of the section)
Lemma \thetheorem.
For any small enough, we have
[TABLE]
Since the last quantity is summable in we can use the Borel-Cantelli lemma (and a sequence of going to [math]) to show that almost surely as , where the term denotes a random function of that tends to [math] when . Combining this with (56), we finish proving the proposition by writing
[TABLE]
We finish by giving a proof of Lemma 4.2.
Proof of Lemma 4.2.
Let and and let be a random variable with distribution and with distribution , independent of . By standard results on Beta distributions, the product has distribution .
Then for any we have, using the explicit expression of the density of ,
[TABLE]
and the last display in increasing in . We are going to use this inequality for well-chosen sequences , and taking place of the values of . Let us first remark that for any two non-negative sequences and with going to infinity and , we have the following estimate using Stirling’s approximation:
[TABLE]
Now let us apply the above computations for every with to the random variables which have distribution , with and . In particular, in this context we have and , so that the all of the above applies and
[TABLE]
which is what we wanted. ∎
4.3 The distribution of the limiting sequence
Let us stay in the setting of Section 4.2. Suppose that we are working with a sequence of fitnesses that satisfies () for some . The sequence is defined in (9) as some random multiple of the sequence , whose distribution is described in Theorem 1.1 from a sequence of independent random variables with , so that for all ,
[TABLE]
where the random variable is the one that appears in (56), and depends on the whole sequence .
Proposition \thetheorem.
For any sequence a that satisfies the condition (), the sequence is a (possibly time-inhomogeneous) Markov chain such that for all , is independent of . The fact that for all we have with independent of characterises the backward transitions of the chain.
Proof.
We follow the same steps as [21, Lemma 1.1]. Recall the definition of the random variable as the limit of the sequence defined in (49), the definition (51) of the constant and their relation to the random variable . We have
[TABLE]
It then follows that we can write, for ,
[TABLE]
which ensures that is independent of . The limit in the last equality exists almost surely thanks to the results of the preceding section.
Now we prove the Markov property of the chain. Let . Because of the definition of the chain as a product, the distribution of conditional on the past trajectory is the same as the distribution of conditional on . Since and that and are both independent of , this conditional distribution corresponds to the one of conditional on the present state of the chain . ∎
Computing the moments.
In some cases where the sequence is sufficiently regular, we can compute explicitly every moment of the random variable for every . Indeed, using (52) and (57) and the independence, we get
[TABLE]
In general, if the collection of -th moments of some random variable satisfies the so-called Carleman’s condition: , then its distribution is uniquely determined from those moments.
5 Examples and applications
In this section, we compute the explicit distribution of for some particular sequences . We then describe some applications of our results to a model of Pólya urn with immigration and then to a model of preferential attachment graphs.
5.1 The limit chain for particular sequences
As stated in the preceding section, we can compute the distribution of for some fixed by the expression of its moments (4.3), provided that they satisfy Carleman’s condition. Knowing these distributions and the backward transitions given in Proposition 4.3 then characterises the law of the whole process. For two particular examples, this law has a nice expression.
Proposition \thetheorem.
In the two following cases, the distribution of the chain is explicit.
- (i)
If is of the form with and , then the limiting sequence is a Mittag-Leffler Markov chain . 2. (ii)
If is of the form , periodic of period starting from the second term with and integers with at least one being non-zero, then, letting , the sequence has the distribution of an Intertwined Product of Generalised Gamma Processes with parameters , which we denote .
Note that the two cases (i) and (ii) are not mutually exclusive. We will prove the two points of this proposition in separate subsections. The proper definitions of the distributions to which we refer in the statement are given along the proof.
5.1.1 Mittag-Leffler Markov chains
Let us study the case where the underlying preferential attachment tree has a sequence of fitnesses that are of the form . We start by recalling the definitions of Mittag-Leffler distributions and Mittag-Leffler Markov chains and introduced in [21], and also studied in [25].
Mittag-Leffler distributions.
Let and . The generalized Mittag-Leffler distribution has th moment
[TABLE]
and the collection of -th moments for uniquely characterizes this distribution thanks to Carleman’s criterion.
Mittag-Leffler Markov Chains.
For any and , we introduce the (a priori) inhomogenous Markov chain , the distribution of which we call the Mittag-Leffler Markov chain of parameters , or . This type of Markov chain was already defined in [21], for some choice of parameters and . It is a Markov chain such that for any ,
[TABLE]
and the transition probabilities are characterised by the following equality in law:
[TABLE]
with , independent of . These chains are constructed (for some values of depending on ) in [21]. In fact, our proof of Proposition 5.1(i) ensures that these chains exist for any choice of parameters and . Let us mention that the proof of [21, Lemma 1.1] is still valid for the whole range of parameters and , which proves that these Markov chains are in fact time-homogeneous. We provide, in a later paragraph, another proof of this time-homogeneity using an argument that relies on preferential attachment trees.
The limiting Markov chain is a Mittag-Leffler.
Recall the definition of the sequence and their respective distributions . From our assumption that we have for all ,
[TABLE]
Proof of Proposition 5.1 (i).
For , we can make the following computation, using (50), one change of indices and several times the property of the Gamma function that for any we have :
[TABLE]
Using Stirling formula, we can then compute the numbers introduced in (51),
[TABLE]
Using (4.3), the moments of are given, for any by the formula:
[TABLE]
These moments identify using (59) the distribution of for all ,
[TABLE]
From this, and the form of the backward transitions, we can identify as having a distribution . ∎
Time-homogeneity of MLMC.
Let us keep the notation from the previous paragraph with a sequence and let us show the time-homogeneity of the corresponding Mittag-Leffler Markov chain using its connection with preferential attachment trees.
For any , consider the sequence and in such a way that, using Theorem 1.1,
[TABLE]
By choosing appropriately, we can make have the distribution of any of the couples for . Thus, in order to prove the time-homogeneity of the transitions, it suffices to prove that the conditional distribution of with respect to does not depend on .
Recall from Section 1.2.2 in the introduction that we see as an increasing sequence of plane trees, defined as subsets of . Also recall that for any , we denote the subtree descending from . At every time , we can consider the sequence , which counts the number of vertices in the subtrees descending from the children of in , in order of creation (completed by an sequence of zeros). We can check that this sequence evolves as grows with the same distribution as the number of customers seating at different tables in a Chinese Restaurant Process with seating plan , see [41, Section 3.2] for a definition.
Then, conditionally on the evolution of this sequence, every time that a vertex is added to one of those subtrees, it is attached to any vertex already present in the subtree with probability proportional to its out-degree plus (and in particular this does not depend on the value of ).
Thanks to [41, Corollary 3.9], two Chinese Restaurant Processes with respective seating plan and with have a density with respect to each other and this density is a function of the scaling limit of the number of tables created in the process, which corresponds in our case to .
These observations allow us to conclude that the distribution of for any has a positive density with respect to , and this density is a function of . From here, it is clear that conditionally on , the distribution of the quantity does not depend on , which concludes the argument.
5.1.2 Products of generalised Gamma.
The following paragraphs aim at proving Proposition 5.1(ii). In the first paragraph and second paragraph we define the families of distributions of and -processes. Some special cases of these processes already appeared in [38, 36]. In the third one we prove that the distribution of belongs to this family whenever the sequence is of the form assumed in Proposition 5.1(ii).
Construction of a -process.
For real numbers, let be a family of independent variables with the following distribution:
[TABLE]
where, for any , the distribution has density with respect to the Lebesgue measure. Then for all we define as,
[TABLE]
We say that the process has the distribution of a Generalised Gamma process with parameters which we denote .
Let us note that,using standard distributional equalities with Gamma and Beta distributions, for every , we have and
[TABLE]
and is independent of . In fact, we can further show that are jointly independent with the corresponding distribution and that this characterizes the finite dimensional marginals of this process.
Remark \thetheorem.
For , the process has exactly the distribution of the points of a Poisson process on with intensity , listed in increasing order.
Intertwined Products of -processes.
Let and be positive integers with at least one being non-zero. We let for all , with the convention that . We also let . Then we define the set
[TABLE]
Start with independent processes indexed by such that for all ,
[TABLE]
Now is defined in such a way that for all and we have
[TABLE]
The process defined above is said to have distribution of an Intertwined Product of Generalized Gamma Processes with parameters , denoted . Its finite dimensional marginals can be obtained in the same way as it was done in the preceding paragraph for Generalized Gamma processes.
Identification of the limiting chain.
Fix and some integers (where at least one is non-zero) and suppose that the sequence has the following form,
[TABLE]
meaning that the sequence is periodic with period starting from the second term, with .
For any and we have
[TABLE]
for the as defined in Theorem 1.1. For any , we use the moments (50) of a Beta random variable and a telescoping argument to write
[TABLE]
Using the last display, we get that for any ,
[TABLE]
Using Stirling’s approximation we get
[TABLE]
Hence, recalling the definition of in (51), we get
[TABLE]
Then using (4.3) with ,
[TABLE]
Using the last display and the fact that random variable with distribution has -th moment equal to , we can identify the distribution of the one-dimensional marginals for any with the ones of the process described in (63). The identification of the distribution of the process as is then obtained by comparing their finite dimensional distribution which are characterized by Proposition 4.3 and, respectively, (63) together with the discussion below (62).
Sparse sequences.
Let us treat a particular example of parameters for which the distribution has a simpler description than the general case. Suppose that only one of the parameters is non-zero, say for example. Keeping the notation introduced above, the corresponding set contains only elements . Following the definition (63), the process with distribution is constant on every interval for any integer and , the process is just given by a product of independent -processes
[TABLE]
where for all , the process has distribution .
In the particular case where and , the picture is even simpler because the last display becomes a product over only one term. We can check using Remark 5.1.2 that the process has then exactly the distribution of the points of a Poisson process on with intensity , listed in increasing order, which was already noted in [36, Remark 2].
5.2 Application to Pólya urns with immigration
Define the following generalisation of Pólya’s urn, which depends on a sequence of numbers : start at time with an urn containing red balls. At every time , we sample a ball uniformly at random from the urn, return it to the urn with additional ball of the same colour, plus an immigration of additional white balls. The outcome of the first step being deterministic, it is equivalent to consider that we start at time with red balls and white balls in the urn, so that we allow ourselves to consider any (possibly negative) value . This model was studied in the sequence of paper [37, 38, 36] in specific cases of periodic immigration and also studied in [2] with a larger class of periodic immigration.
Denote the number of red balls in the urn at time and let us state a scaling limit result for when . We also identify the speed of convergence and the Gaussian fluctuations around the limit, provided that the immigration is sufficiently regular.
Recall from the introduction the assumption () defined for a real number . We introduce the following more precise assumption of the same type, for any and .
[TABLE]
Remark that for any , this assumption is satisfied for periodic sequences , and almost surely satisfied by sequences of i.i.d. non-negative random variables with a second moment.
Proposition \thetheorem.
Assume that the sequence satisfies () for some . Then for ,
- (i)
we have the following almost sure convergence,
[TABLE]
where has the same law as , defined in (9). 2. (ii)
If then we have
[TABLE]
Remark \thetheorem.
If the sequence has one of the particular forms treated in Proposition 5.1 of the previous section, we can identify the distribution of the limiting random variable as being Mittag-Leffler or a product of independent generalised Gamma random variables. This gives us an alternative proof for the similar statement [1, Theorem 3.8].
Proof.
Let be a sequence of trees with distribution and let . With this definition, the sequence has exactly the same distribution as the number of red balls in a Pólya urn with immigration with immigration sequence .
If the sequence satisfies our assumption () for some then using (10) we can write the following almost sure convergence
[TABLE]
where the sequence is defined in (9), so this proves (i).
Let us turn to the proof of (ii). We will prove this convergence in two steps, by first proving some corresponding result for the degree of the first vertex in a , and then using Theorem 1.1 and Proposition 1.1 to transfer the result to the corresponding distribution. Indeed, let be a sequence of trees with distribution with a sequence satisfying the following assumption
[TABLE]
for some . In this context, recalling (19), the degree of the first vertex can be written as
[TABLE]
Now, using our assumption on the sequence we get , so that
[TABLE]
Rearranging the terms, we get
[TABLE]
and using the Lindeberg-Feller theorem (see [17, Theorem 3.4.5] for example), we get that the latter expression converges in distribution when to a Gaussian distribution . Recalling that a.s. as , we can also write using Slutsky’s lemma
[TABLE]
Now let us transfer this result to the case of preferential attachment trees. For this, it suffices to prove that satisfies the condition () with then the corresponding sequence defined (9) almost surely satisfies (64) for . From Proposition 1.1 and the definition of as a scaled version of , we know that we have almost surely, for and some . Going along the proof of Proposition 1.1, we get from (4.2) that
[TABLE]
for any , so that (64) is almost surely satisfied by if
[TABLE]
Now, thanks to Theorem 1.1, conditionally on the sequence the distribution of is . Applying (65) in this case finishes to prove (ii).
∎
5.3 Applications to some other models of preferential attachment
Let us present here another model of preferential attachment which appears in the literature, for example in [36]. This model does not produce a tree as ours does, but we can couple them in such a way that some of their features coincide. We only focus on one particular model of graph here but the method presented here can adapt to other similar models.
A model of -preferential attachment
Let be a non-empty graph, with vertex-set which have degrees , and an integer and a real number such that for all . The model is then the following: we let . Then, at any time , the graph is constructed from the graph by:
- •
adding a new vertex labelled with outgoing edges,
- •
choosing sequentially to which other vertex each of these edges are pointed, each vertex being chosen with probability proportional to plus its degree (the degree of the vertices are updated after each edge-creation).
The degree of a vertex in a graph refers in this section to the number of edges incident to it. Here the growth procedure in fact produces multigraphs, in which it is possible for two vertices to be connected to each other by more than one edge. In this case, all those edges contribute in the count of their degree.
We can couple this model to a preferential attachment tree with sequence of fitnesses defined as:
[TABLE]
where .
Indeed, we can construct with distribution . Then, for any , consider the tree and for all , merge together each vertex with fitness together with the vertices with fitness [math] that arrived just before it. If only contains one vertex, it is immediate that the obtained sequence of graphs has exactly the same distribution as . For general seed graphs , we can still use the same construction and the obtained sequence of graphs has the same evolution as some sequence which would be obtained from by merging all the vertices into a unique vertex .
Note that a similar construction would also be possible if the degrees of the vertices were given by a sequence of integers instead of all being equal to some constant value . This is for example the case in the model studied in [14], where the degrees are random.
We have the following convergence for degrees of vertices in the graph, as .
Proposition \thetheorem.
The following convergence holds almost surely in any with :
[TABLE]
where
[TABLE]
and the process is independent of .
Furthermore, whenever with or then the distribution of is explicit and given by:
- •
if with , then
[TABLE]
- •
if , then
[TABLE]
This result strengthens the one of [36, Theorem 1, Theorem 2 and Proposition 1] which corresponds (up to some definition convention) to the case . We emphasize that the convergence here is almost sure in an space.
Proof of Proposition 5.3.
Using the coupling argument, we know that we can construct jointly the sequence of graphs and a sequence of trees with fitness sequence
[TABLE]
in such a way that for every , the sequence
[TABLE]
coincides with
[TABLE]
Using this connection and Theorem 1.1, Proposition 1.1 and Proposition 2.1 we get
[TABLE]
almost surely in for all , for some random sequence . Note that the time-change between and is responsible for an extra factor in the scaling, so that the sequence has the distribution of . In the case or , Proposition 5.1 identifies the distribution of the limiting sequence.
Last, the convergence of just follows from the classical result of convergence for the proportion of balls in a Pólya urn. ∎
Appendix A Technical proofs and results
This appendix contains the proofs of technical results that are used throughout this paper. Let start by stating a useful conditional version of the Borel-Cantelli lemma.
Lemma \thetheorem.
Let be a filtration and let be a sequence of events adapted to this filtration. For all , let . We have
[TABLE]
and also
[TABLE]
Proof.
The first convergence is the content of Theorem 5.4.11 and the second one is an application of Theorem 5.4.9 to the martingale , both taken from [17]. ∎
The following lemma is a rewriting of [6, Lemma 1]. We provide the proof for completeness.
Lemma \thetheorem.
Let be a complex-valued martingale with finite -th moment for some . Then for every we have
[TABLE]
Proof.
Let and let be a random variable such that conditionally on the random variable is independent of, and has the same distribution as . Then
[TABLE]
where the first equality comes from the fact that . The first inequality is the one of Jensen for conditional expectation, applied to the convex function . The second inequality is due to Clarkson, see [51, Lemma 1], and can be applied because the distribution of conditional on is symmetric and . The last inequality comes from the triangle inequality for the -norm. ∎
Let us state another result about martingales, which we use numerous times throughout the paper. Recall our uniform big- and small- notation, introduced in (30).
Lemma \thetheorem.
Suppose that is a sequence of analytic functions on some open set , adapted to some filtration . Suppose that for every , the sequence is a martingale with respect to the filtration . If there exists a parameters and continuous functions and such that for all we have
[TABLE]
then for any compact subset , there exists such that
- (i)
if on we have almost surely and also in expectation, 2. (ii)
if on , the almost sure limit exists for and we have almost surely and also in expectation.
Proof of Lemma A.
First, without loss of generality, we can consider that the term is identically equal to [math], otherwise we just replace the function by . Second, by compactness, it is sufficient to prove the result for a small disk around each . Since is an open set, let be such that , where is the closed disk in the complex plane with centre and radius . We denote
[TABLE]
and choose small enough so that . Then if we let such that , we have for any and , using the Cauchy formula
[TABLE]
Now,
[TABLE]
Using sequentially Jensen’s inequality and Doob’s maximal inequality in , gives us for every :
[TABLE]
So using (A), Fubini’s theorem and (A), we get
[TABLE]
Now let us treat the two cases and separately. Remark that the quantity is negative when , but can be of any sign in the case .
For and , we let
[TABLE]
Using (A), we have
[TABLE]
Thanks to our assumptions, the number is positive. Using Markov’s inequality and the last display yields
[TABLE]
which is summable, so the Borel-Cantelli lemma ensures that almost surely as . Now for any , there is a unique integer such that , namely , and we write
[TABLE]
which proves point (i), because almost surely as .
For , the reasoning is similar so we use the same notation for slightly different quantities. For any integer we let
[TABLE]
Then, thanks to (A), we have
[TABLE]
and thanks to our assumption the number is positive. Using the same arguments as in the case we have almost surely as and taking yields
[TABLE]
Again we almost surely have as . This ensures that the sequence of functions is almost surely a Cauchy sequence for the uniform convergence on the disc (so that its limit is well-defined on the disk) and that (ii) is satisfied. ∎
Finally, let us give a proof of Lemma 3.1.
Proof of Lemma 3.1.
From the assumption, we know that there exists such that as . Without loss of generality, we can assume that . Then it is immediate that . Then
[TABLE]
and the first point follows by summing over intervals of the type .
Now write
[TABLE]
Since as , we get
[TABLE]
Putting everything together, we get
[TABLE]
Last, just remark that which finishes the proof. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Cyril Banderier, Philippe Marchal and Michael Wallner “Periodic Pólya urns and an application to Young tableaux” In 29th International Conference on Probabilistic, Combinatorial and Asymptotic Methods for the Analysis of Algorithms 110 , LIP Ics. Leibniz Int. Proc. Inform. Schloss Dagstuhl. Leibniz-Zent. Inform., Wadern, 2018, pp. Art. No. 11, 13
- 2[2] Cyril Banderier, Philippe Marchal and Michael Wallner “Periodic Pólya Urns, the Density Method, and Asymptotics of Young Tableaux”, 2019 ar Xiv: 1912.01035 [math.PR]
- 3[3] Albert-László Barabási and Réka Albert “Emergence of Scaling in Random Networks” In Science 286.5439 , 1999, pp. 509–512 DOI: 10.1126/science.286.5439.509 · doi ↗
- 4[4] Noam Berger, Christian Borgs, Jennifer T. Chayes and Amin Saberi “Asymptotic behavior and distributional limits of preferential attachment graphs” In The Annals of Probability 42.1 , 2014, pp. 1–40 DOI: 10.1214/12-AOP 755 · doi ↗
- 5[5] Shankar Bhamidi “Universal techniques to analyze preferential attachment trees: Global and local analysis” In Preprint available at http://www. unc. edu/ bhamidi , 2007
- 6[6] John D. Biggins “Uniform Convergence of Martingales in the Branching Random Walk” In The Annals of Probability 20.1 , 1992, pp. 137–151 DOI: 10.1214/aop/1176989921 · doi ↗
- 7[7] Patrick Billingsley “Convergence of probability measures”, Wiley series in probability and statistics. Probability and statistics section New York: Wiley, 1999
- 8[8] Benjamin Bloem-Reddy and Peter Orbanz “Preferential Attachment and Vertex Arrival Times”, 2017 ar Xiv: 1710.02159 [math.PR]
