Large deviation theorem for branches of the random binary tree in the Horton-Strahler analysis
Ken Yamamoto

TL;DR
This paper establishes a large deviation theorem for the distribution of branch orders in random binary trees using Horton-Strahler analysis, providing asymptotic rate functions for deviations.
Contribution
It introduces a large deviation theorem specific to branch order counts in random binary trees, with asymptotic analysis of the rate functions.
Findings
Large deviation theorem for branch counts in random binary trees
Asymptotic forms of the rate functions are derived
Provides a theoretical foundation for analyzing bifurcation complexity
Abstract
The Horton-Strahler analysis is a graph-theoretic method to measure the bifurcation complexity of branching patterns, by defining a number called the order to each branch. The main result of this paper is a large deviation theorem for the number of branches of each order in a random binary tree. The rate function associated with a large deviation cannot be derived in a closed form; instead, asymptotic forms of the rate function are given.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Large deviation theorem for branches of the random binary tree in the Horton-Strahler analysis
Ken Yamamoto Department of Physics and Earth Sciences, Faculty of Science, University of the Ryukyus, Senbaru, Okinawa, Japan (). [email protected]
Abstract
The Horton-Strahler analysis is a graph-theoretic method to measure the bifurcation complexity of branching patterns, by defining a number called the order to each branch. The main result of this paper is a large deviation theorem for the number of branches of each order in a random binary tree. The rate function associated with a large deviation cannot be derived in a closed form; instead, asymptotic forms of the rate function are given.
keywords:
large deviation, central limit theorem, binary tree
{AMS}
60F10, 05C05, 60F05
1 Introduction
The topological analysis of branching patterns or objects began with hydrological research on river networks. Horton proposed a systematic method to assign a number (called the order) to each stream based on the join of streams [8]. Horton’s law of stream numbers is an empirical relation stating that the number of streams of order decreases geometrically with . Horton’s method partially needs information about spatial configuration of the river network such as stream lengths and junction angles. Strahler refined Horton’s method so that the order is defined by a purely graph-theoretic way [14].
Strahler’s ordering method for a binary tree is composed of the following three rules.
The leaf nodes (degree-one nodes) are defined to have order 1. 2. 2.
A node whose children have different order and () has order . 3. 3.
A node whose two children have the same order has order .
We define a branch of order as a maximal connected path whose constituent node(s) all have the same order . For a binary tree having leaves, let denote the number of its order- branches. Further, based on Strahler’s ordering, Tokunaga [15] established a method, called the Tokunaga indexing, to describe the structure of side-branching. The Horton-Strahler analysis, based on the branch order, has been applied to a wide variety of branching patterns and structures, such as botanical trees [10] and blood vessels [16] in biology, register allocation in computer science [4], cracks in material engineering [5], and complex network analysis [7].
In this paper, we focus on rooted, planar, full binary trees [13]. This class of binary trees appears naturally in modeling a river network. A special node corresponding to the estuary is called the root; each stream has a flow direction towards it. A river network is embedded in the ground surface, so the corresponding tree is planar; formally, a planar binary tree is defined as a rooted binary tree with right and left directions assigned to each pair of children of the same parent. If we set only the junction points as internal nodes, each node has either zero or two children; this type of binary tree is called a full binary tree. We let denote the set of planar full binary trees having leaves. The number of distinct trees in is expressed as
[TABLE]
and this combinatorial quantity is known as the th Catalan number [13]. For example, consists of two binary trees, each of which has three branches of order 1 and one branch of order 2 (see Fig. 1 for reference). A probability space formed by introducing the uniform probability measure on is referred to as the random binary tree model (or random model for short), introduced by Shreve [12]. Note that is a random variable on the random model.
A striking feature common to some stochastic tree models is self-similarity, which means the invariance under the operation of pruning (cutting leaves) [11]. A stochastic tree model is said to be Horton self-similar if it satisfies Horton’s law, and self-similarity involving side-branching structure is called Tokunaga self-similarity. The random model, the Tokunaga model [15], and the random self-similar network [17] are well-known self-similar tree models. For the development and related topics of the self-similarity of random trees, see Kovchegov and Zaliapin [9].
The main subject of this paper is the asymptotic property of . For any function , is a real-valued random variable on the random model. By applying the pruning operation, a recursive relation
[TABLE]
between the averages of two adjoining orders and holds [20], where denotes the average on the random model. In this paper, we mainly study the case where is an exponential function.
Wang and Waymire [18] proved the central limit theorem for :
[TABLE]
where ‘’ denotes convergence in distribution, and is the normal distribution with mean and variance . Recently, Yamamoto [21] obtained two generalized forms of Eq. (2) as
[TABLE]
and
[TABLE]
for each These results are both reduced to Eq. (2) when . Equation (3) implies that converges in probability to as , which is compared with Horton’s law.
It is worth pointing out that the central limit theorem (4) can be derived easily by the pruning operation [1, 9]. Since pruning a binary tree times yields a binary tree having leaves, Eq. (2) immediately implies
[TABLE]
Considering along with this equation, we obtain the central limit theorem (4).
As for , the following large deviation theorem was demonstrated [18].
Theorem 1.1** (Large deviation theorem for ).**
For the random model,
[TABLE]
and
[TABLE]
where the rate function is given by
[TABLE]
For the proof of this theorem, the following general result on large deviation properties is important.
Theorem 1.2** (Cox and Griffeath [2]).**
Let be a sequence of random variables and let
[TABLE]
where is a sequence of positive numbers such that . Assume that on the interval ,
[TABLE]
where is strictly convex and on . If is convex on and , then
[TABLE]
and
[TABLE]
where , , , and is the Legendre transform of . In addition, the central limit theorem
[TABLE]
*holds.
For systematic treatment of large deviation theory, see Ellis [6] and Deuschel and Stroock [3] for example. Wang and Waymire [18] derived
[TABLE]
for any , which leads to the proof of Theorem 5 (with in Theorem 1.2). The rate function given in Eq. (5) is the Legendre transform of . Furthermore, owing to , Theorem 5 directly implies the central limit theorem (2) via Theorem 1.2.
Unlike Eq. (2), central limit theorems (3) and (4) were obtained by the asymptotic properties of the characteristic functions of and [21]. Thus, a natural problem is to establish the large deviation theorems corresponding to Eqs. (3) and (4). In this paper, a large deviation theorem connected to Eq. (3) is formulated and proved.
2 Main result
Lemma 2.1**.**
For and ,
[TABLE]
*where the function is introduced in Eq. (6).
This is a generalized result of Eq. (6). We give the proof of this lemma in the next section.
By Lemma 2.1, we can prove the following large deviation theorem for .
Theorem 2.2** (Large deviation theorem for ).**
For ,
[TABLE]
and
[TABLE]
*where the rate function is the Legendre transform of .
Note that this theorem includes Theorem 5 as a special case of .
Proof 2.3**.**
We can complete the proof by substituting in Lemma 2.1 for in Theorem 1.2. Since the function is strictly increasing, strictly convex and on , its composite also possesses these properties. Hence, for any . By the chain rule and , the derivative of is
[TABLE]
Owing to and , we obtain
[TABLE]
*Therefore, the proof is complete. *
Remark 1**.**
As a consequence of Theorem 2.2, the central limit theorem (3) holds straightforwardly from Lemma 2.1 and Theorem 1.2. By differentiating Eq. (7) again and applying the Leibniz rule, the second derivative of is
[TABLE]
Using , , , and (, we obtain
[TABLE]
*Thus, the central limit theorem (3) is derived. *
Lemma 2.1 and Theorem 2.2 indicate that the order appears in the composition . The author believes that this regularity implies self-similarity of trees from the perspective of large deviation theory.
A large deviation formalism of Eq. (4) is not studied in this paper, and is an open problem.
3 Proof of Lemma 2.1
This section is mainly devoted to the proof of Lemma 2.1.
First, we show Lemma 2.1 for (corresponding to ). This case (6) was already proved by Wang and Waymire [18], but we employ a formula different from theirs. Our method has the major advantage that we can easily extend to .
We need to estimate , which is the moment generating function of . By setting and in Eq. (1),
[TABLE]
This sum can be calculated exactly using the Gauss hypergeometric function [19], but here we perform an asymptotic analysis using a saddle-point method.
Letting () to replace the sum by integral about , and using Stirling’s approximation
[TABLE]
we get
[TABLE]
where
[TABLE]
and ‘’ denotes the asymptotic equality in the sense that the ratio between both hand sides tends to unity as . One can easily confirm that the function takes a maximum value at
[TABLE]
thereby
[TABLE]
Therefore,
[TABLE]
Remark 2**.**
Equation (6) was previously obtained [18] by using a saddle-point method to
[TABLE]
*However, as noted in [18], this procedure needs to treat the two cases where is positive and where is negative separately. Moreover, it seems to be difficult to extend their method to general . In this light, our method, starting with Eq. (1), is advantageous compared to the preceding one. *
Next, we proceed to general by induction on . Assume that
[TABLE]
where the coefficient satisfies
[TABLE]
and we show Eq. (10) for . By Eq. (1) and asymptotic approximation as above, we have
[TABLE]
The saddle-point estimation requires to maximize the same function as in Eq. (9), but in Eq. (9) is replaced by here. We also note that the coefficient does not affect the saddle-point method. Hence, for some coefficient , we have
[TABLE]
so that
[TABLE]
Thus, the statement holds for any .
4 Note on approximate forms of the rate function
Unfortunately, the rate function in Theorem 2.2 cannot be expressed exactly for . By the definition of the Legendre transformation, is given by
[TABLE]
where satisfies
[TABLE]
In short, is the inverse function of . The difficulty for is due to the fact that has a complicated form for and cannot be solved explicitly. Instead of the exact form of , we derive its approximate forms.
According to the general theory of rate functions [6], is convex and takes the minimum value 0 at . Moreover, the derivative of Eq. (11) yields
[TABLE]
Owing to and , a second-order Taylor expansion of around becomes
[TABLE]
In other words, the bottom of the curve of is approximated by a parabola, and this is equivalent to the central limit theorem (3). Differentiating Eq. (13), we have
[TABLE]
Figure 2 shows numerical results of and for , 2, and 3 by the solid curves. We used the Newton-Raphson method to solve . Approximate forms (13) and (14) corresponding to the central limit theorem are shown by dashed curves and lines. The dashed curves are close to the solid ones only in the vicinity of . In what follows, we calculate approximate forms of near (leftmost point) and (rightmost point).
Definition 4.1**.**
For simplicity of notation, we introduce
[TABLE]
*for . *
Proposition 3**.**
* and possess the following properties.*
** 2. 2.
** 3. 3.
**
Proof 4.2**.**
One can easily prove this by using the following relations:
[TABLE]
Proposition 4**.**
By using , the first and second derivatives of are respectively expressed as
[TABLE]
and
[TABLE]
Proof 4.3**.**
By Eq. (7) and Proposition 3, is written as
[TABLE]
*Next, we get Eq. (16) straightforwardly by Eq. (8) and Proposition 3. *
Using the above properties of , let us derive the expansion of .
Theorem 4.4** (Asymptotic forms of around and ).**
Around which is the leftmost point of ,
[TABLE] 2. 2.
Around which is the rightmost point of ,
[TABLE]
Proof 4.5**.**
By Eq. (12),
[TABLE]
In this proof, we use this formula as a differential equation to determine . Since and , we need to expand around , corresponding to and , by means of Proposition 16.
We expand on condition that is sufficiently small. By definition, , and similarly for . Putting into Eq. (15), we immediately obtain
[TABLE]
To estimate the sum in Eq. (16), the term of is which is the leading order, and the others are . Thus, by neglecting the terms other than , we have
[TABLE]
Hence, Eq. (17) becomes
[TABLE]
By integrating from to ,
[TABLE]
and the solution is
[TABLE] 2. 2.
Contrary to the above, tends to infinity as , so we need to expand when is sufficiently large. From the observation
[TABLE]
we reasonably set . The exponent satisfies and , so that
[TABLE]
Noting that
[TABLE]
we have
[TABLE]
In this case, the dominant term of the sum in Eq. (16) corresponds to , so that
[TABLE]
Finally, integrating Eq. (17) from to as above, we get
[TABLE]
By using in Theorem 4.4, we reach asymptotic forms of .
Theorem 4.6** (Asymptotic forms of the rate function around and ).**
Around ,
[TABLE] 2. 2.
Around ,
[TABLE]
Proof 4.7**.**
As in the proof of Proposition 3, we use and
[TABLE]
By integrating from [math] to ,
[TABLE]
The proof is completed by calculating as
[TABLE] 2. 2.
By integrating Eq. (17) from to , we have
[TABLE]
We need to be careful in the calculation of , because both and in Eq. (11) diverge as .
[TABLE]
Remark 5**.**
The same calculation applies to (). Note that is the fixed point of , namely , so we obtain
[TABLE]
*The Taylor expansion (13) of around is reproduced. *
By the exact form of in Eq. (5), is symmetric about . On the other hand, comparing the approximate forms of near and in Theorem 4.6, for is clearly asymmetric.
In Fig. 3, we show numerical results of and along with the asymptotic forms at (dashed curves) and (dot-dashed curves) from Theorems 4.4 and 4.6. (The solid curves are the same as in Fig. 2.)
Acknowledgments
The author is grateful to referees for instructing recent related articles. The idea that Eq. (4) is derived by the pruning operation is suggested by a referee. The present work was partially supported by a University of the Ryukyus Research Project Promotion Grant for Young Researchers (17SP04109), and Hayao Nakayama Foundation for Science & Technology and Culture (H29-B-41).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Burd, G. A., Waymire, E. C. and Winn, R. D. (2000). A self-similar invariance of critical binary Galton-Watson trees. Bernoulli 6, 1–21.
- 2[2] Cox, J. T. and Griffeath, D. (1984). Large deviations for Poisson systems of independent random walks. Zeitschrift für Wahrscheinlichkeitstheorie und verwandte Gebiete 66, 543–558.
- 3[3] Deuschel, J. D. and Stroock, D. W. (1989). Large Deviations , Academic Press, Boston.
- 4[4] Devroye, L. and Kruszewski, P. (1994). A note on the Horton-Strahler number for random trees. Inform. Process. Lett. 52, 155–159.
- 5[5] Djordjevic, Z. V., Li, X. F., Shin, W. S., Wunder, S. L. and Baran, G. R. (1995). Fractal and topological characterization of branching patterns on the fracture surfaces of cross-linked dimethacrylate resins. J. Mater. Sci. 30, 2968–2980.
- 6[6] Ellis, R. S. (2006). Entropy, Large Deviations, and Statistical Mechanics , Springer, Berlin.
- 7[7] Guimerà, R., Danon, L., Díaz-Guilera, A., Giralt, F. and Arenas, A. (2003). Self-similar community structure in a network of human interactions. Phys. Rev. E 68, 065103(R).
- 8[8] Horton, R. E. (1945). Erosional development of streams and their drainage basins; hydrophysical approach to quantitative morphology. Geol. Soc. Am. Bull. 56, 275–370.
