Central limit theorem for the Horton-Strahler bifurcation ratio of general branch order
Ken Yamamoto

TL;DR
This paper proves a central limit theorem for the bifurcation ratio in the Horton-Strahler branching model, extending previous results to general branch orders and providing new mathematical relations.
Contribution
It generalizes the central limit theorem for bifurcation ratios to all branch orders, enhancing understanding of hierarchical branching structures.
Findings
Established the central limit theorem for general branch order bifurcation ratios.
Derived useful relations supporting the main theorems.
Extended previous results from lowest to all branch orders.
Abstract
The Horton-Strahler ordering method, originating in hydrology, formulates the hierarchical structure of branching patterns using a quantity called the bifurcation ratio. The main result of this paper is the central limit theorem for bifurcation ratio of general branch order. This is a generalized form of the central limit theorem for the lowest bifurcation ratio, which was previously proved. Some useful relations are also derived in the proofs of the main theorems.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Central limit theorem for the Horton-Strahler bifurcation ratio of general branch order
Ken Yamamoto
Department of Physics and Earth Sciences, Faculty of Science, University of the Ryukyus, 1 Sembaru, Nishihara, Okinawa 903–0213, Japan
Abstract
**Abstract
** The Horton-Strahler ordering method, originating in hydrology, formulates the hierarchical structure of branching patterns using a quantity called the bifurcation ratio. The main result of this paper is the central limit theorem for bifurcation ratio of general branch order. This is a generalized form of the central limit theorem for the lowest bifurcation ratio, which was previously proved. Some useful relations are also derived in the proofs of the main theorems.
1 Introduction
Branching objects are found very widely Ball , ranging from natural patterns like river networks, plants, and dendritic crystals, to conceptual expressions like binary search trees in computer science Knuth and phylogenetic trees in taxonomy Archibald . The topological structure of a branching pattern is modeled by a binary tree if a segment bifurcates (does not trifurcate or more) at every branching point.
Let denote the set of the different binary trees having leaves. The number of leaves is called the magnitude in research of branching patterns. As known well Stanley , the number of the different binary trees of magnitude is given by
[TABLE]
which iscalled the st Catalan number. In Fig. 1, for , and 4 are schematically shown. Introducing the uniform probability measure on (so that each binary tree is assigned equal probability ), we obtain the probability space referred to as the random model Shreve . The formation of real-world branching patterns more or less involves stochastic effects, and the random model is a kind of mathematical simplification of such random factors.
In hydrology, methods for measuring the hierarchical structure of a river network have been proposed by Horton Horton , Strahler Strahler , Shreve Shreve , Tokunaga Tokunaga , and other researchers. Their methods define how to assign an integer number (called the order) to each stream. Among all, Strahler’s method is currently the most popular because of its simple computation rule. Strahler’s method is a refinement of Horton’s method, so it is sometimes called the Horton-Strahler ordering method. The Horton-Strahler method recursively defines the order of each node by the following rules. (i) The leaf nodes are defined to have order one. (ii) A node whose children have different order and () has order . (iii) A node whose two children have the same order has order . We define a branch of order as a maximal connected path made by nodes of equal order . (A branch here is called a stream in the analysis of river networks.) An example of Strahler’s ordering is shown in Fig. 2. For a binary tree , we let denote the number of branches of order in . By the definition of the order, and (). Note that if , because a node of order 2 is produced by the merge of two leaves. For the binary tree in Fig. 2, , , , and for . is a random variable on , and its stochastic property is of main interest in this study.
For any function , is a real-valued random variable on . According to Ref. Yamamoto2010 , the recursive relation between the averages of the th and st variables
[TABLE]
holds, where denotes the average on the random model. The coefficient
[TABLE]
represents the probability . In particular, putting in Eq. (1), we have
[TABLE]
Mathematical properties of have been investigated thoroughly. For instance, the average and variance are respectively given by Werner
[TABLE]
Moreover, from Eq. (2), the moment generating function of is given by
[TABLE]
and this summation can be expressed using the Gauss hypergeometric function Yamamoto2008 :
[TABLE]
The ratio is called the bifurcation ratio of order or simply the th bifurcation ratio. Hydrologists have empirically confirmed that the bifurcation ratios of an actual river network become almost constant for different orders, and this relation is referred to as Horton’s law of stream numbers. By definition, the bifurcation ratio is always smaller than or equal to . When , we reasonably define . The random variable is also called the the bifurcation ratio. The lowest bifurcation ratio is relatively easy to deal with, because it is similar to . The central limit theorem for has been shown by Wang and Waymire Wang :
Theorem 1** (Central limit theorem for the lowest bifurcation ratio).**
On the random model,
[TABLE]
where “” denotes convergence in distribution, and is the normal distribution with mean and variance .
It is a simple and natural idea that we extend Theorem 1 to general order . Compared with , however, higher-order branches for and the bifurcation ratio of order is difficult to handle and less studied. In this paper, we generalize Theorem 1 in two ways (Theorems 2 and 3 in §2), and further generalize them (Theorem 4 in §6). In §3–5, we give proofs of lemmas, which are necessary for the main theorems. In these proofs, Eq. (1) and its variant
[TABLE]
are very useful.
2 Main results
The following two theorems are the main results of the present paper.
Theorem 2** (Central limit theorem for the bifurcation ratio of general order).**
For any order , the th bifurcation ratio satisfies
[TABLE]
Theorem 3** (Central limit theorem for the number of branches of general order).**
For any order , the number of st branches satisfies
[TABLE]
Remark*.*
These two theorems are generalization of Theorem 1 to general order ; they are reduced to Theorem 1 by setting . Theorem 2 states the property of the bifurcation ratio , and Theorem 3 states the property of the number of branches . The limit variance in Theorem 2 becomes large as increases, whereas the limit variance in Theorem 3 becomes small as increases.
From Theorem 2, the following property, which can be regarded as Horton’s law of stream numbers, is easily derived.
Corollary 1** (Horton’s law of stream numbers for the random model).**
For any order , the th bifurcation ratio converges in probability to the common value :
[TABLE]
where “” denotes convergence in probability.
Let us introduce the asymptotic equality, since this study mainly focuses on the asymptotic behavior (the limit ) of .
Definition 1**.**
The average value is asymptotically equivalent to if
[TABLE]
and this is denoted by
[TABLE]
For example, from Eq. (3),
[TABLE]
Theorems 2 and 3 are easily proved by using the following Lemmas 1 and 2, respectively.
Lemma 1**.**
For and ,
[TABLE]
Lemma 2**.**
For and ,
[TABLE]
The odd-power result has a more complicated form than the even-power one.
Proof of Theorem 2.
We let denote the characteristic function of the left-hand side of Eq. (6), where the subscript and superscript respectively correspond to in the denominator and in the numerator in Eq. (6). By definition, is calculated as
[TABLE]
where . At the last equality, we have split the sum into even () and odd (). By Lemma 1, the terms of the first sum (even ) are , whereas the terms of the second sum (odd ) are . Hence, the second sum can be neglected in the limit , so that
[TABLE]
Recall that the characteristic function of is . Since converges pointwise to the characteristic function of , convergence in distribution in Theorem 2 is proved. (For the properties of a characteristic function, see Feller Feller for example.)
Keep in mind that the neglect of the odd-power terms is a crucial point also in the other central limit theorems in this paper. ∎
Proof of Theorem 3.
As with the above proof of Theorem 2, the characteristic function of the left-hand side of Eq. (7) is
[TABLE]
By Lemma 2, the second sum is neglected and the dominant terms are calculated to
[TABLE]
Therefore, the converges in distribution to is proved. ∎
We give the proofs of Lemmas 1 and 2 in the following three sections.
3 Starting point of Lemmas 1 and 2
We show Lemmas 1 and 2 by induction on . In this section, the case of in Lemmas 1 and 2 is proved (Cor. 3).
Proposition 1**.**
For a two-variable polynomial of finite degree,
[TABLE]
Here, and are taken over , whereas are over .
Proof.
Because of the linearity of , it is sufficient to check the case . The average is expressed using the moment generating function in Eq. (4).
[TABLE]
Using the derivative of the Gauss hypergeometric function Abramowitz
[TABLE]
and the symmetry , we have
[TABLE]
thereby
[TABLE]
∎
Here we comment on the utility of Prop. 1. We can calculate the moments recursively using Eq. (8):
[TABLE]
and so on. The first and second moments were individually calculated Werner (see Eq. (3)). Note, however, that Prop. 1 provides a systematic calculation method of the th moment of in a bottom-up way. Furthermore, we easily obtain for using Eq. (8).
Corollary 2**.**
Subtracting from Eq. (8), we have
[TABLE]
Lemma 3**.**
For ,
[TABLE]
That is to say, the asymptotic form of depends on whether is even or odd:
[TABLE]
for .
Remark*.*
In (), the leading terms are canceled because of . Similarly, in general , terms are successively canceled, so that the resultant leading order of becomes . This effect makes the estimation of difficult.
Proof.
The proof is by induction on . The statement is true for , and 2, because
[TABLE]
Assume that it is true up to , and we show it is true for . It follows from Cor. 2 that
[TABLE]
In the following, we separately investigate odd and even .
Case 1: is odd (). The dominant terms are and 1 in the summation, and the others can be neglected. Thus,
[TABLE]
Case 2: is even (). Picking out the terms up to carefully, we obtain
[TABLE]
We introduce the coefficient as
[TABLE]
and expand each term on the right-hand side of Eq. (11) up to using the induction hypothesis:
[TABLE]
Therefore,
[TABLE]
Note that terms and those including are all cancelled.
Therefore, the statement is true for any . ∎
Corollary 3**.**
Multiplying Eq. (10) by , we have
[TABLE]
Remark*.*
This result corresponds to the case in Lemmas 1 and 2.
Using this corollary, we can provide another proof of Theorem 1 as follows. The characteristic function for the lowest bifurcation ratio in Theorem 1 is calculated as
[TABLE]
so the convergence of to is proved.
4 Proof of Lemma 1
We first derive the asymptotic form of , which is needed in the proof of Lemma 1.
Proposition 2**.**
For , we have
[TABLE]
Remark*.*
Since for any , surely takes a finite value and is not divergent.
Proof.
Let us introduce the operator defined by
[TABLE]
where is integrable on any interval . Note that is the inverse of . Owing to the property
[TABLE]
the average of is expressed by
[TABLE]
Let us derive a relation between and as in Prop. 1.
[TABLE]
By using the derivative of the hypergeometric function in Eq. (9),
[TABLE]
whose asymptotic form is
[TABLE]
or
[TABLE]
Considering , we obtain
[TABLE]
∎
Proof of Lemma 1.
By induction on . For , the statement is equivalent to Cor. 3.
Assume that it is true up to , and we show it is true for . Using Eq. (5),
[TABLE]
Case 1: is even ().
[TABLE]
By using Eq. (5) again and Prop. 2,
[TABLE]
Case 2: is odd (). Using Eq. (5) and Prop. 2 as above,
[TABLE]
∎
5 Proof of Lemma 2
In the proof of Lemma 2, we use the following relations.
Lemma 4**.**
For ,
[TABLE]
Remark*.*
The complicated form of the odd-power result in Lemma 2 is actually due to the factor “”.
Proof.
By induction on . For , the statement is equivalent to Lemma 3.
Assume that it is true up to , and we show for . Using Prop. 1,
[TABLE]
Case 1: is even (). In the summation, only is dominant, so that
[TABLE]
By using the induction hypothesis,
[TABLE]
Case 2: is odd (). Note that and 1 in the summation are the dominant terms.
[TABLE]
∎
Proof of Lemma 2.
By induction on . For , the statement is equivalent to Lemma 3.
Assume that it is true up to , and we show it is true for . Using Eq. (5) to calculate
[TABLE]
We split the summation over according to the parity of , and use the induction hypothesis. The estimation of the summation is complex compared with the other proofs above.
Case 1: is even ().
[TABLE]
From Lemma 4, the first summation is , while the second summation is . Thus, we can neglect the second sum, so that
[TABLE]
Case 2: is odd ().
[TABLE]
Both two sums are , so we need to consider them. By using Lemma 4,
[TABLE]
where we have used
[TABLE]
∎
6 Further generalization
Theorems 2 and 3 are further generalized to the central limit theorem as follows.
Theorem 4**.**
For ,
[TABLE]
Remark*.*
This theorem is reduced to Theorem 2 when and to Theorem 3 when ; moreover, it is reduced to Theorem 1 when .
Lemma 5**.**
For and ,
[TABLE]
Remark*.*
In comparison with Cor. 3, the effect of appears in the form of the factor .
Proof.
By induction on . is equivalent to Cor 3.
Assume that the statement is true for , and we show that it is true for . Using Eq. (5) and Prop. 2,
[TABLE]
Similarly,
[TABLE]
∎
Proof of Theorem 4.
By using Lemma 5, the characteristic function of the left-hand side, , can be calculated as with Theorems 2 and 3:
[TABLE]
∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) P. Ball, Branches (Oxford University Press, Oxford, 2011).
- 2(2) D. E. Knuth, The Art of Computer Programming, vol. 3 (Addison Wesley, Reading, 1973).
- 3(3) J. D. Archibald, Aristotle’s Ladder, Darwin’s Tree: The Evolution of Visual Metaphors for Biological Order (Columbia University Press, New York, 2014).
- 4(4) R. E. Horton, Geol. Soc. Am. Bull. 56, 275 (1945).
- 5(5) A. N. Strahler, Trans. Am. Geophys. Un. 38, 913 (1957).
- 6(6) R. L. Shreve, J. Geol. 75, 178 (1967).
- 7(7) E. Tokunaga, Geogr. Rep. Tokyo Metrop. Univ. 13, 1 (1987).
- 8(8) R. P. Stanley, Enumerative Combinatorics, vol. 2 (Cambridge University Press, Cambridge, 1999).
