
TL;DR
This paper establishes new probability bounds for the height, width, and size of Galton-Watson trees, showing that certain ratios have sub-exponential or sub-Gaussian tail behaviors, applicable broadly without distribution assumptions.
Contribution
It introduces general probability bounds for Galton-Watson trees' dimensions, adaptable to specific offspring distributions, improving understanding of their typical shapes.
Findings
H/W ratio has sub-exponential tails
H/|T|^{1/2} ratio has sub-Gaussian tails
Bounds are tight and distribution-agnostic
Abstract
This work proves new probability bounds relating to the height, width, and size of Galton-Watson trees. For example, if is any Galton-Watson tree, and , , and are the height, width, and size of , respectively, then has sub-exponential tails and has sub-Gaussian tails. Although our methods apply without any assumptions on the offspring distribution, when information is provided about the distribution the method can be adapted accordingly, and always seems to yield tight bounds.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
\AtBeginShipout
Most trees are short and fat
Louigi Addario-Berry
Department of Mathematics and Statistics, McGill University, Montréal, Canada
[email protected] http://www.problab.ca/louigi/
(Date: March 30, 2017)
Abstract.
This work proves new probability bounds relating to the height, width, and size of Galton-Watson trees. For example, if is any Galton-Watson tree, and , , and are the height, width, and size of , respectively, then has sub-exponential tails and has sub-Gaussian tails.
Although our methods apply without any assumptions on the offspring distribution, when information is provided about the distribution the method can be adapted accordingly, and always seems to yield tight bounds.
2010 Mathematics Subject Classification:
60J80,60G50,60E15,05C05
“Il faudrait souhaiter que …les probabilistes de la jeune génération, soit par complaisance pour les méthodes purement analytiques, soit grâce à l’engouement pour la puissance - d’ailleurs parfaitement justifiée - des méthodes liées aux distributions dans les espaces fonctionnels, n’oublient point les méthodes directes. ”
A.N. Kolmogorov, Sur les propriétés des fonctions de concentrations de M. P. Lévy, [7].
1. Introduction
Given a rooted tree , write for the set of nodes of at distance from the root. The width of is ; the height of is . The volume of is its number of nodes, denoted . We write to mean that is a node of . Note that if either or then .
Next, given a probability distribution supported on the the non-negative integers , write if is a Galton-Watson tree with offspring distribution . In other words, each node of gives birth to a random, -distributed number of offspring, and the number of offspring is independent for distinct nodes.
This work presents new probabilistic relations between the height, width, and volume of general Galton-Watson trees. Here is the most general result.
Theorem 1.1**.**
There exists an absolute constant such that the following holds. Fix a probability measure with support , and let . Then for all ,
[TABLE]
In the above theorem we write instead of . We use similar shorthand in several other places. Also, we say is subcritical if , critical if , and supercritical if ; in the third case, the sum may be infinite.
The method we introduce is rather general, in that when additional information is provided about the tail behaviour of , the preceding bounds can be correspondingly strengthened. For example, we have the following two theorems.
Theorem 1.2**.**
There exists an absolute constant such that the following holds. Fix a critical or subcritical probability measure with support and with , and let . Then there exists depending on such that for all
[TABLE]
Moreover, if is critical then we may choose so that for all and all ,
[TABLE]
Theorem 1.2 is related to a result of [1], which proves sub-Gaussian tail bounds for when has finite variance. The results of [1] apply to Galton-Watson trees conditioned to have a fixed size, and in this sense are stronger than Theorem 1.2. However, in [1] the bounds become weaker as the variance increases, whereas the bounds of Theorem 1.2 exhibit the correct dependence on the variance.
Theorem 1.3**.**
There exists an absolute constant such that the following holds. Fix a critical or subcritical probability measure with support , and let . Suppose that there exists and such that for all . Then for all ,
[TABLE]
To understand the requirement that in Theorem 1.3, note that if for some then so is not subcritical. On the other hand, if is in the domain of attraction of an -stable distribution (with ) then both bounds have optimal dependence on ; see [4, 8, 2]. On the other hand, if for some then is again in the domain of attraction of the -stable (Gaussian) distribution, in which case Theorems 1.1 and 1.2 apply.
Theorem 1.1 is tight in general; we explain why shortly. The next theorem shows that when has infinite variance, Theorem 1.1 is not tight for large trees.
Theorem 1.4**.**
Fix a critical or subcritical probability measure with support , and let . If then for any there exists depending on and such that for all and all ,
[TABLE]
The following example limits the degree to which Theorem 1.4 can be strengthened. Suppose that and fix with . Let be the event that , that contains exactly nodes with children and that all other nodes are leaves. , where . When occurs we have and , so with , we obtain
[TABLE]
This implies that the dependence of on in Theorem 1.4 can not be removed, and that the quadratic relation between the allowed values of and is essentially tight.
If then the same example shows that the bounds of Theorem 1.1 are tight. In this case is the event that is a path of length , and with , we have
[TABLE]
The remainder of the introduction is structured as follows. Section 1.1 provides a few basic definitions, and states a well-known identity in law between the breadth-first queue process of a Galton-Watson tree and a corresponding random walk stopped on its first visit to zero. In Section 1.2 we introduce a key bound, used in the proofs of all the above theorems, which shows that the height of a tree is bounded, up to a scaling factor, by the “inverse harmonic average queue length”. We then provide a brief overview of the overall proof strategy and of the structure of the remainder of the paper.
1.1. Preliminaries
For a rooted tree , write for the root of . For , write for the number of children of in , and write for the parent of in , with the convention that . Also, write for the distance from to , and recall that is the set of nodes of at depth .
The Ulam-Harris tree is the infinite tree with root and non-root nodes indexed by finite sequences of positive integers. A node at depth has parent and children , where by convention if . For nodes and of , write if either , or and precedes lexicographically. We refer to as the breadth-first ordering of the nodes of .
Any countable rooted plane tree may be viewed as a subtree of as follows. First identify the root of with the root of . Next, recursively, if is identified with then identify the th child of with , for each . With this identification, the ordering induces a total order of the nodes of , which we call the breadth-first ordering of the nodes of . If is locally finite then we enumerate the nodes of in breadth-first order as ; writing the indices this way allows us to treat the cases and simultaneously.
For , if then let . The quantity is the breadth-first queue length just before is explored in a lexicographic breadth-first search of . It is not hard to see that for all ,
[TABLE]
recall that by convention.
Proposition 1.5**.**
Let be a probability measure on , and define a probability measure on by taking for . Let be a random walk with initial position and jump distribution , and let . If then and are identically distributed.
For a random walk we write for the probability measure under which the random walk has initial position . Our default notation for the steps of is , so that for all .
1.2. Proof overview
The following chain of identities,
[TABLE]
may seem too trivial to be useful, and it is. However, it contains an idea which, after a little massaging, becomes quite powerful, and indeed undergirds the rest of the proof. The following lemma and proposition show that replacing by within the final sum yields an upper bound on , and allows us to bound the height by studying the breadth-first queue process.
Lemma 1.6**.**
For any sequence of positive integers with , it holds that
[TABLE]
Proof.
In what follows, is always assumed to be an integer from the interval ; so, for example, is shorthand for the set .
First note the easy bound
[TABLE]
A slightly more complicated lower bound on the same sum follows from the identity , which implies that
[TABLE]
Finally,
[TABLE]
It follows that for any , the left-hand side of (1.1) is at least
[TABLE]
Since , setting wraps things up. ∎
Proposition 1.7**.**
For any tree , it holds that .
Proof.
“Shifting time by one” makes the numbers work out a little more smoothly. For let ; this is the length of breadth-first queue just after is explored. Since , setting yields
[TABLE]
For let , and let . Now fix an integer . In a breadth-first exploration of , for all , just after is explored the queue consists of nodes from . This implies that
[TABLE]
Next, with , the nodes of in breadth-first order are . Just after is explored the elements of queue form a subset of , so
[TABLE]
Together, these inequalities yield
[TABLE]
and the proposition follows from Lemma 1.6. ∎
Note that the width is also easily bounded using the breadth-first queue. First, as noted in the preceding proof, if then the nodes counted by form a subset of . Next, if is the first node of with respect to the breadth-first order , then . Together, these facts immediately imply that
[TABLE]
When , we have , where is a simple random walk with whose steps satisfy , and where .
For we write
[TABLE]
Proposition 1.7 and the bounds of (1.2) reduce our task to that of understanding the relations between and the quantities and .
At the heart of our argument is a multi-scale decomposition of the random walk path . When takes values at a given scale – when has order about , say – then for as long as the random walk continues to have order about , the partial sums increase at rate about . In particular, it takes steps for the sequence of partial sums to increase by .
To convert this into a good bound requires controlling how much time the random walk spends at each scale. This in turn impels us to track the times the random walk “changes scale” and how many times it may revisit a given scale before visiting the origin, and to produce bounds on how long the random walk might spend at a given scale. For the latter, we make crucial use of universal bounds on the concentration function of a sum of IID random variables. The bounds may be sharpened if further information is provided about the jump distribution, and such sharpenings seem to generally yield optimal bounds for the heights of the corresponding Galton-Watson trees.
In Section 2, we introduce a general bound on the concentration function of a random walk, and deduce several corollaries which hold under specific hypotheses about the step distribution. In Section 3, we formally define the multi-scale decomposition which is at the core of our proofs, and prove results controlling the amount of time spent by a random walk at a fixed scale. In Section 4, these results are applied to prove probabilistic bounds relating and under various assumptions on the step size; from these, the theorems stated in the introduction are straightforwardly derived, in Section 5. Finally, Section 6 contains some questions and references to the literature.
2. Exit times from intervals and the concentration function
Following Kesten [6], for a random variable , and a constant , write
[TABLE]
This function was introduced by Paul Lévy [9, Section 16], who called it the concentration function of . We will use the following bounds.
Theorem 2.1** ([6]).**
There exists an absolute constant such that the following holds. Let be iid copies of a random variable , fix and let . Then for all ,
[TABLE]
*and *
[TABLE]
The next two corollaries are straightforward applications of Theorem 2.1. Recall that for a random walk , we write for the probability measure under which the walk has initial position .
Corollary 2.2**.**
Let be iid copies of a random variable , and let be a random walk with steps . Suppose that . For , let . Then for any ,
[TABLE]
Proof.
Since , we also have . Fixing , we may thus find such that . It follows by Theorem 2.1 that for all with , for any ,
[TABLE]
The result follows. ∎
Corollary 2.3**.**
There exists an absolute constant such that the following holds. Let be iid copies of an integer random variable with , and let be a random walk with steps . Then for all and all ,
[TABLE]
Proof.
Taking for positive integer , we have
[TABLE]
the final inequality since if then . The result now follows from the first bound of Theorem 2.1.
[TABLE]
The result follows. ∎
Lemma 2.4**.**
There exists an absolute constant such that the following holds. Let be iid copies of an integer random variable with and with , and let be a random walk with steps . Fix and let . Then
[TABLE]
Proof.
Write . By the assumptions of the lemma, , so by the second bound in Theorem 2.1, for any and any we have
[TABLE]
If then provided the above bound yields .
Using the result of the preceding paragraph, we may assume that . In this case, since we must have . If then the bound in (2.1) is at most , so whenever , proving the lemma in this case. For the remainder of the proof we thus assume .
Note that if is such that then either or , so if then . For all we thus have
[TABLE]
Suppose . Then we have , so defining , by Chebyshev’s inequality the latter probability is at most
[TABLE]
If then , so this yields
[TABLE]
The random variable has support and mean at most zero, which implies that . The preceding bound then gives
[TABLE]
proving the lemma in this case.
Finally, suppose that . Then applying the conditional Jensen’s inequality gives
[TABLE]
the last inequality since . Corollary 2.3 yields that
[TABLE]
For , we then have , and the lemma follows. ∎
3. Decomposing a random walk into scales
Throughout Sections 3 and 4, fix an integer random variable with and , and let be a random walk with steps distributed as . Let be the first time the random walk visits the non-positive integers, and for let .
3.1. Exit probabilities and upcrossings
We begin with an easy fact about exit probabilities.
Lemma 3.1**.**
Fix integers , and let . Then .
Proof.
It is easily seen that . Since is a submartingale, it follows that . Now note that since for all , when we have either or . Writing , it follows that
[TABLE]
A random walk makes an upcrossing of an interval each time it travels from a location below to a location above . Let . Then, for , let and let . The random walk finishes its ’th upcrossing of at time . Write . The following result states that the number of upcrossings of an interval before the first visit to zero is stochastically dominated by an appropriate geometric random variable.
Proposition 3.2**.**
Fix integers , and let be the number of upcrossings of by by time . Also, let . Then for any positive integers and ,
[TABLE]
Proof.
Write . For , if and only if . We use that for events , it holds that . By the inclusions we thus have
[TABLE]
Next, observe that . For a random walk starting from , we have precisely if and the random walk exits the interval in the positive direction. Using the strong Markov property, and Lemma 3.1 with , and , it follows that
[TABLE]
By induction this yields , from which the proposition is immediate. ∎
3.2. The scale of a random walk
We now describe a collection of stopping times that formalizes the notion of “change of scale” for the random walk. These scales are designed to be overlapping, the utility of which is that when the walk switches from one scale to another, smaller scale, it is reasonably unlikely to ever revisit a larger scale.
In brief, a change of scale occurs when the walk leaves an interval of the form ; just before this happens the random walk was at scale . When the scale changes, the next scale, , is chosen so that the random walk lies between and ; it now must exit the interval to change scale, and so forth.
Definition 3.3**.**
Let and . Next, for , let
[TABLE]
and let .
The sequence captures the times at which the random walk changes scale, and the sequence contains the scales. For , if then so . Also, since the walk only makes negative steps of size , if then . Thus, if and then . Furthermore, . In summary: when the scale changes, it either increases by at least two, or decreases by exactly two, or jumps to , the latter occurring at the hitting time of [math]. The right mental picture is to stop the walk at time . The definitions imply that once , it also holds that and for all . It is handy to also set . We take by convention.
Definition 3.4**.**
Fix . Let , and for let . Then let .
Note that if there is no with . When , the random variable is the ’th time the walk visits scale , and is the number of changes of scale before time . Also, is the number of visits to scale before the walk hits [math].
Proposition 3.5**.**
For all , .
Proof.
First, for all with we have . If then so at time the random walk has just completed an upcrossing of .
If and then ; the walk has hit zero. If and then in fact and . Since , in this case the random walk must complete at least one upcrossing of between time and . (Indeed, it must complete exactly one.)
In either case, between time and the random walk either hits zero or else completes an upcrossing of either or of . The result follows. ∎
3.3. Occupation times of scales
We next study the amount of time the random walk spends at each scale. For , write for the current scale at time : formally, with , let . Then, for let . We write for the total time spent at scale before hitting [math]. Finally, fix , write , and let
[TABLE]
The first lemma of the section says that the time before the random walk leaves scale is stochastically dominated by times a geometric random variable.
Lemma 3.6**.**
For all , with ,
[TABLE]
Proof.
Use the Markov property. ∎
For the coming lemma, let be a sequence of independent Geometric random variables, so for integers . Also, recall that is the ’th time the random walk visits scale .
Lemma 3.7**.**
For any integers and , for all ,
[TABLE]
Proof.
For any , by the strong Markov property and Lemma 3.6, for integer we have
[TABLE]
The sequence is thus stochastically dominated by . Finally, if then , and the lemma follows. ∎
For , let
[TABLE]
and write . Note that for all and .
Theorem 3.8**.**
For integer and real , for all ,
[TABLE]
Proof.
The first inequality is immediate from the fact that . For the second, we write
[TABLE]
we shall bound the first term by and the second by .
First, precisely if the random walk visits scale before hitting zero. When the random walk has scale its position lies in the interval , so Lemma 3.1 gives that .
In bounding the second term, we assume that , as otherwise the required bound is trivial. We use the strong Markov property at time to write
[TABLE]
We bound the latter by to complete the proof.
For the remainder of the argument, the following description of is more useful. Recalling that is the total number of visits to scale before hitting zero, we have
[TABLE]
It thus suffices to bound .
Assume for the moment that is an integer. Fix any positive integers and such that . If then either or . Proposition 3.5 states that , and Proposition 3.2 then implies
[TABLE]
Provided that , a Chernoff bound then gives
[TABLE]
By Lemma 3.7 we obtain that
[TABLE]
Choose ; using that , straightforward arithmetic shows that the sum on the right is then bounded by . This completes the proof when is integer. For general , the same argument yields the bound ; but this is still less than . ∎
The next corollary provides a cleaner probability tail bound for . For a sequence of positive real numbers, let
[TABLE]
Corollary 3.9**.**
Fix a positive integer and positive real numbers . Then for any positive integer ,
[TABLE]
Proof.
By Theorem 3.8, we have that for any ,
[TABLE]
Now fix , , and as in the statement of the corollary. Then
[TABLE]
so , and therefore
[TABLE]
∎
In the next section we use Corollary 3.9 to derive upper bounds for probabilities of the form , under a range of assumptions on the step size distribution. The intuition behind all these results is that if then the dominant contribution to the sum should typically come from the largest scale reached by the walk before hitting zero. In other words, if then, writing , we expect that is typically around .
4. Relating and
Theorem 4.1**.**
There exists an absolute constant such that the following holds. Fix an integer random variable with and , and let be a random walk with step distribution . Then writing , for all ,
[TABLE]
Proof.
Fix and . For let , and write . By Lemma 2.4 we have , so
[TABLE]
for some absolute constant .
It is straightforward to check that for all , . Thus, provided , we also have
[TABLE]
It follows by Corollary 3.9 that if then for any ,
[TABLE]
Taking gives . On the event that , we have and , so this yields
[TABLE]
Moreover, if then so
[TABLE]
Using the two preceding bounds, we obtain that whenever ,
[TABLE]
Taking , the above bound becomes
[TABLE]
from which the theorem follows easily. ∎
Theorem 4.2**.**
There exists an absolute constant such that the following holds. Fix an integer random variable with and with and , and let be a random walk with step distribution . Then there exists depending only on the law of such that for all and all ,
[TABLE]
Proof.
Observe that
[TABLE]
By monotone convergence, we may thus choose such that
[TABLE]
for all . The first bound of Theorem 2.1 yields that for all we have . Writing , it follows that for all , if then
[TABLE]
We thus have for all . For smaller we use the bound of Lemma 2.4.
Now fix and , and for let . Then with , in the notation of Corollary 3.9 we have
[TABLE]
Provided , the final quantity is at most . Moreover, by reprising the bound on from Theorem 4.1, we obtain that . Letting , it follows by Corollary 3.9 that for , for all we have
[TABLE]
Next, fix , and for let . Then for any we may write
[TABLE]
To bound the final summands, take and , and in (4.1). Then and
[TABLE]
and , so (4.1) implies that
[TABLE]
and summing over yields that
[TABLE]
For any there is such that . For this value of we also have , so using the preceding bound we obtain
[TABLE]
Finally, if then , so
[TABLE]
Theorem 4.3**.**
There exists an absolute constant such that the following holds. Fix and write . Then for any real we have
[TABLE]
Proof.
Fix and an integer . For let , and write . By assumption we have , so
[TABLE]
where depends only on . It is straightforward to check that for if then . Thus, provided , we also have
[TABLE]
It follows from Corollary 3.9 that for any integer we have
[TABLE]
Taking gives . On the event that we have and , so taking the above bound yields
[TABLE]
Moreover, if then . We always have , and we may assume , so it follows that
[TABLE]
Using the two preceding bounds, we obtain that whenever we have
[TABLE]
from which the result follows easily. ∎
Proposition 4.4**.**
There exists an absolute constant such that the following holds. Fix an integer random variable with , , and . Let be a random walk with step distribution . Then for all there exists such that for any integers and , and any real ,
[TABLE]
Proof.
In this proof we write and . Fix small, and let be large enough that , where is the constant from Corollary 2.3. Then for all we have , so for all , if then
[TABLE]
It follows that for all . For smaller we use the bound of Lemma 2.4.
Now fix and , and for let . Then with , in the notation of Corollary 3.9 we have
[TABLE]
where and are absolute constants. Provided , the final quantity is at most . Moreover, by reprising the bound on from Theorem 4.1, we obtain that .
Letting , it follows by Corollary 3.9 that for , for all we have
[TABLE]
Taking small enough that , say, the result then follows. ∎
Corollary 4.5**.**
Under the conditions of Proposition 4.4, for any there exists such that for all and all ,
[TABLE]
Proof.
Fix fix , let , and let be as in Proposition 4.4. Now fix . We consider the cases and separately.
First suppose . For let . For any and we may write
[TABLE]
Taking and , we have ; setting , we also obtain . Proposition 4.4 then implies that
[TABLE]
Now take , sum over , and use that ; this yields that
[TABLE]
For any there is such that . For this value of we have , so the using preceding bound we obtain
[TABLE]
To conclude, recall that , so . Since we have , and the result follows in this case.
Now suppose . Since , by Theorem 4.1 of [3] we have . We may thus choose large enough that for all , . By the cycle lemma, for all we have
[TABLE]
which for yields
[TABLE]
For we have , so for sufficiently large the preceding bound is at most . ∎
5. Proofs of the main theorems
Proof of Theorem 1.1.
First, note that if is supercritical then conditionally given that , almost surely . On the other hand, writing for the measure with , where , then given that , the conditional law of is , and is subcritical or critical. Since the bounds of the theorem only depend on through , and , it thus suffices to prove the theorem for critcal and subcritical trees.
In light of the preceding paragraph, the second bound is now immediate from Propositions 1.5 and 1.7 and Theorem 4.1. For the first, we use that for any tree , . We then have
[TABLE]
which yields the first bound. ∎
Proof of Theorem 1.2.
Propositions 1.5 and 1.7 and Theorem 4.2 together imply that there exists depending only on such that for all and all ,
[TABLE]
where is a universal constant. For such we then have
[TABLE]
This proves the second probability bound, and the first bound follows by the same argument used in proving Theorem 1.1.
Now suppose is critical, and let be the breadth-first queue process of . By Proposition 1.5, this process has the same law as , where is a random walk with and jump distribution defined by , and . Since is critical, is centered, so by the local central limit theorem there is such that for all , . By the cycle lemma, we then have
[TABLE]
Combined with the above probability bound, we obtain that for ,
[TABLE]
for some absolute constant . For we have , so
[TABLE]
For sufficiently large we have
[TABLE]
so the final bound follows. ∎
Proof of Theorem 1.3.
The second bound is immediate from Propositions 1.5 and 1.7 and Theorem 4.3. For the first, arguing as in Theorem 1.1, we have
[TABLE]
which is the first bound. ∎
Proof of Theorem 1.4.
The second bound is immediate from Propositions 1.5 and 1.7 and Corollary 4.5. We deduce the first as in the other theorems: since ,
[TABLE]
from which the first bound follows. ∎
6. Conclusion
This section provides a few pointers to related work, and suggests open questions related to the results presented above, as well as some potential strengthenings of said results.
- (1)
Write for the law of a Galton-Watson tree with offspring distribution conditioned to have size exactly . A natural question is whether the above theorems can be shown to hold for . In any case where this is possible, it yields a stronger result, as the corresponding theorem for can then be obtained by a suitable averaging over .
- (a)
For Theorem 1.1, such a generalization is false; if then is almost surely a path of length . An extension to conditioned Galton-Watson trees may still be possible, but its bounds must include some dependence on both and .
- (b)
As mentioned in the introduction, when is critical with finite variance, sub-Gaussian tail bounds for were proved in [1]. It should be possible to strengthen the bounds of [1] to exhibit the same dependence on the variance as in Theorem 1.2; such a strengthening would then yield Theorem 1.2 as a corollary.
- (c)
The work [8] provides a version of Theorem 1.3 which applies to conditioned Galton-Watson trees. However, this result requires that is in the domain of attraction of a stable law; it should be possible to weaken this requirement, insisting only on upper bounds for the tail probabilities of the offspring distribution.
- (d)
As for an analogue of Theorem 1.4, it should hold that for all there exists such that for ,
[TABLE]
These bounds would imply that in probability and that in probability; both limits were conjectured by Janson (see [5], Conjectures 21.5 and 21.6)). 2. (2)
An intuition which motivated the development of this paper is that for Galton-Watson trees we expect that is typically of order . This is true, for example, for conditioned Galton-Watson trees in the domain of attraction of a stable tree. However, in general this need not hold. For example, suppose that , and . Taking , the tree typically contains exactly one node of degree ; it has width of order and height of order . Are there examples where of offspring distributions for which is much larger than with non-vanishing probability? 3. (3)
More generally, the range of possible joint behavior of and for supercritical conditioned Galton-Watson trees is unclear, and deserves investigation.
It is not impossible that an extension of the techniques of the current paper could be used to tackle some of the above questions. Our approach essentially requires bounds on the amount of time the random walk associated to the breadth-first queue process spends at small scales. Thus, implementing this for conditioned Galton-Watson trees would require, in particular, universal bounds on how much time a random walk conditioned to first visit [math] at time is likely to spend at a given scale.
Acknowledgements
Thank you, Igor Kortchemski and Yuval Peres, for useful discussions during the preparation of this paper.
This research was funded in part by an NSERC Discovery Grant. Part of the research was carried out at the Isaac Newton Institute for Mathematical Sciences during the programme “Random Geometry”, supported by EPSRC Grant Number EP/K032208/1. My interest in this problem was sparked during a workshop at McGill’s Bellairs research institute in Holetown, Barbados. Thanks are due to all the above institutions and agencies for their support.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Addario-Berry et al. [2013] Louigi Addario-Berry, Luc Devroye, and Svante Janson. Sub-Gaussian tail bounds for the width and height of conditioned Galton–Watson trees. Ann. Probab. , 41(2):1072–1087, 03 2013. doi: 10.1214/12-AOP 758 . URL http://dx.doi.org/10.1214/12-AOP 758 . · doi ↗
- 2Duquesne and Wang [2017+] T. Duquesne and M. Wang. Decomposition of Lévy trees along their diameter. Ann. Inst. H. Poincaré Probab. Statist, to appear, 2017+. URL https://arxiv.org/abs/1503.05069 .
- 3Esseen [1968] C. G. Esseen. On the concentration function of a sum of independent random variables. Z. Wahrscheinlichkeitstheorie und Verw. Gebiete , 9:290–308, 1968. doi: 10.1007/BF 00531753 . URL http://dx.doi.org/10.1007/BF 00531753 . · doi ↗
- 4Haas [2003] Bénédicte Haas. Loss of mass in deterministic and random fragmentations. Stochastic Process. Appl. , 106(2):245–277, 2003. ISSN 0304-4149. doi: 10.1016/S 0304-4149(03)00045-0 .
- 5Janson [2012] Svante Janson. Simply generated trees, conditioned Galton-Watson trees, random allocations and condensation. Probab. Surv. , 9:103–252, 2012. ISSN 1549-5787. doi: 10.1214/11-PS 188 . URL http://dx.doi.org/10.1214/11-PS 188 . · doi ↗
- 6Kesten [1972] Harry Kesten. Sums of independent random variables—without moment conditions. Ann. Math. Statist. , 43:701–732, 1972. ISSN 0003-4851. doi: 10.1214/aoms/1177692541 . URL http://dx.doi.org/10.1214/aoms/1177692541 . · doi ↗
- 7Kolmogorov [1958] A.N. Kolmogorov. Sur les propriétés des fonctions de concentration de M.P. Lévy. Ann. Inst. H. Poincaré , 16(1):27–34, 1958. URL http://www.numdam.org/article/AIHP_1958__16_1_27_0.pdf .
- 8Kortchemski [2017+] I. Kortchemski. Sub-exponential tail bounds for conditioned stable Bienaymé–Galton–Watson trees. Probab. Theory Relat. Fields, to appear, 2017+. URL https://arxiv.org/abs/1504.04358 .
