Large deviation theorem for branches of the random binary tree in the   Horton-Strahler analysis

Ken Yamamoto

arXiv:1907.13346·math.PR·April 2, 2020·SIAM J. Discret. Math.

Large deviation theorem for branches of the random binary tree in the Horton-Strahler analysis

Ken Yamamoto

PDF

TL;DR

This paper establishes a large deviation theorem for the distribution of branch orders in random binary trees using Horton-Strahler analysis, providing asymptotic rate functions for deviations.

Contribution

It introduces a large deviation theorem specific to branch order counts in random binary trees, with asymptotic analysis of the rate functions.

Findings

01

Large deviation theorem for branch counts in random binary trees

02

Asymptotic forms of the rate functions are derived

03

Provides a theoretical foundation for analyzing bifurcation complexity

Abstract

The Horton-Strahler analysis is a graph-theoretic method to measure the bifurcation complexity of branching patterns, by defining a number called the order to each branch. The main result of this paper is a large deviation theorem for the number of branches of each order in a random binary tree. The rate function associated with a large deviation cannot be derived in a closed form; instead, asymptotic forms of the rate function are given.

Equations149

∣ Ω_{n} ∣ = \frac{( 2 n - 2 )!}{n ! ( n - 1 )!},

∣ Ω_{n} ∣ = \frac{( 2 n - 2 )!}{n ! ( n - 1 )!},

E [f (S_{r + 1, n})] = \frac{n ! ( n - 1 )! ( n - 2 )!}{( 2 n - 2 )!} m = 1 \sum ⌊ n /2 ⌋ \frac{2 ^{n - 2 m}}{( n - 2 m )! m ! ( m - 1 )!} E [f (S_{r, m})],

E [f (S_{r + 1, n})] = \frac{n ! ( n - 1 )! ( n - 2 )!}{( 2 n - 2 )!} m = 1 \sum ⌊ n /2 ⌋ \frac{2 ^{n - 2 m}}{( n - 2 m )! m ! ( m - 1 )!} E [f (S_{r, m})],

n (\frac{S _{2, n}}{n} - \frac{1}{4}) D N (0, \frac{1}{16}), n \to \infty,

n (\frac{S _{2, n}}{n} - \frac{1}{4}) D N (0, \frac{1}{16}), n \to \infty,

n (\frac{S _{r + 1, n}}{n} - \frac{1}{4 ^{r}}) D N (0, \frac{4 ^{r} - 1}{3 \cdot 1 6 ^{r}}), n \to \infty

n (\frac{S _{r + 1, n}}{n} - \frac{1}{4 ^{r}}) D N (0, \frac{4 ^{r} - 1}{3 \cdot 1 6 ^{r}}), n \to \infty

n (\frac{S _{r + 1, n}}{S _{r, n}} - \frac{1}{4}) D N (0, 4^{r - 3}), n \to \infty

n (\frac{S _{r + 1, n}}{S _{r, n}} - \frac{1}{4}) D N (0, 4^{r - 3}), n \to \infty

S_{r, n} (\frac{S _{r + 1, n}}{S _{r, n}} - \frac{1}{4}) D N (0, \frac{1}{16}), n \to \infty.

S_{r, n} (\frac{S _{r + 1, n}}{S _{r, n}} - \frac{1}{4}) D N (0, \frac{1}{16}), n \to \infty.

n \to \infty lim \frac{1}{n} lo g P (\frac{S _{2, n}}{n} > y) = - I (y), y \in (\frac{1}{4}, \frac{1}{2})

n \to \infty lim \frac{1}{n} lo g P (\frac{S _{2, n}}{n} > y) = - I (y), y \in (\frac{1}{4}, \frac{1}{2})

n \to \infty lim \frac{1}{n} lo g P (\frac{S _{2, n}}{n} < y) = - I (y), y \in (0, \frac{1}{4}),

n \to \infty lim \frac{1}{n} lo g P (\frac{S _{2, n}}{n} < y) = - I (y), y \in (0, \frac{1}{4}),

I (y) = (4 y - 1) tanh^{- 1} (4 y - 1) - lo g (cosh (tanh^{- 1} (4 y - 1))) .

I (y) = (4 y - 1) tanh^{- 1} (4 y - 1) - lo g (cosh (tanh^{- 1} (4 y - 1))) .

φ_{n} (ξ) = a_{n}^{- 1} lo g E [exp (ξ X_{n})],

φ_{n} (ξ) = a_{n}^{- 1} lo g E [exp (ξ X_{n})],

n \to \infty lim φ_{n} (ξ) = φ_{\infty} (ξ) < \infty,

n \to \infty lim φ_{n} (ξ) = φ_{\infty} (ξ) < \infty,

n \to \infty lim a_{n}^{- 1} lo g P (\frac{X _{n}}{a _{n}} > y) = - I (y), y \in (μ, α_{+})

n \to \infty lim a_{n}^{- 1} lo g P (\frac{X _{n}}{a _{n}} > y) = - I (y), y \in (μ, α_{+})

n \to \infty lim a_{n}^{- 1} lo g P (\frac{X _{n}}{a _{n}} < y) = - I (y), y \in (α_{-}, μ),

n \to \infty lim a_{n}^{- 1} lo g P (\frac{X _{n}}{a _{n}} < y) = - I (y), y \in (α_{-}, μ),

\frac{X _{n} - E [ X _{n} ]}{a _{n}} D N (0, σ^{2}), n \to \infty

\frac{X _{n} - E [ X _{n} ]}{a _{n}} D N (0, σ^{2}), n \to \infty

n \to \infty lim \frac{1}{n} lo g E [exp (ξ S_{2, n})] = \frac{ξ}{4} + lo g (cosh \frac{ξ}{4}) =: φ (ξ),

n \to \infty lim \frac{1}{n} lo g E [exp (ξ S_{2, n})] = \frac{ξ}{4} + lo g (cosh \frac{ξ}{4}) =: φ (ξ),

n \to \infty lim \frac{1}{n} lo g E [exp (ξ S_{r + 1, n})] = φ \circ \dots \circ φ r (ξ) = φ^{r} (ξ),

n \to \infty lim \frac{1}{n} lo g E [exp (ξ S_{r + 1, n})] = φ \circ \dots \circ φ r (ξ) = φ^{r} (ξ),

n \to \infty lim \frac{1}{n} lo g P (\frac{S _{r + 1, n}}{n} > y) = - I_{r} (y), y \in (\frac{1}{4 ^{r}}, \frac{1}{2 ^{r}})

n \to \infty lim \frac{1}{n} lo g P (\frac{S _{r + 1, n}}{n} > y) = - I_{r} (y), y \in (\frac{1}{4 ^{r}}, \frac{1}{2 ^{r}})

n \to \infty lim \frac{1}{n} lo g P (\frac{S _{r + 1, n}}{n} < y) = - I_{r} (y), y \in (0, \frac{1}{4 ^{r}}),

n \to \infty lim \frac{1}{n} lo g P (\frac{S _{r + 1, n}}{n} < y) = - I_{r} (y), y \in (0, \frac{1}{4 ^{r}}),

(φ^{r})^{'} (ξ) = j = 0 \prod r - 1 φ^{'} (φ^{j} (ξ)) = j = 0 \prod r - 1 \frac{1 + tanh ( φ ^{j} ( ξ ) /4 )}{4} .

(φ^{r})^{'} (ξ) = j = 0 \prod r - 1 φ^{'} (φ^{j} (ξ)) = j = 0 \prod r - 1 \frac{1 + tanh ( φ ^{j} ( ξ ) /4 )}{4} .

μ = (φ^{r})^{'} (0) = \frac{1}{4 ^{r}}, α_{-} = (φ^{r})^{'} (- \infty) = 0, α_{+} = (φ^{r})^{'} (\infty) = \frac{1}{2 ^{r}} .

μ = (φ^{r})^{'} (0) = \frac{1}{4 ^{r}}, α_{-} = (φ^{r})^{'} (- \infty) = 0, α_{+} = (φ^{r})^{'} (\infty) = \frac{1}{2 ^{r}} .

(φ^{r})^{''} (ξ)

(φ^{r})^{''} (ξ)

= (φ^{r})^{'} (ξ) k = 0 \sum r - 1 \frac{φ ^{''} ( φ ^{k} ( ξ ))}{φ ^{'} ( φ ^{k} ( ξ ))} l = 0 \prod k - 1 φ^{'} (φ^{l} (ξ)) .

(φ^{r})^{''} (0) = \frac{4 ^{r} - 1}{3 \cdot 1 6 ^{r}} .

(φ^{r})^{''} (0) = \frac{4 ^{r} - 1}{3 \cdot 1 6 ^{r}} .

E [exp (ξ S_{2, n})] = \frac{n ! ( n - 1 )! ( n - 2 )!}{( 2 n - 2 )!} m = 1 \sum ⌊ n /2 ⌋ \frac{2 ^{n - 2 m}}{( n - 2 m )! m ! ( m - 1 )!} e^{ξ m} .

E [exp (ξ S_{2, n})] = \frac{n ! ( n - 1 )! ( n - 2 )!}{( 2 n - 2 )!} m = 1 \sum ⌊ n /2 ⌋ \frac{2 ^{n - 2 m}}{( n - 2 m )! m ! ( m - 1 )!} e^{ξ m} .

N! \sim 2 π N (\frac{N}{e})^{N},

N! \sim 2 π N (\frac{N}{e})^{N},

E [exp (ξ S_{2, n})]

E [exp (ξ S_{2, n})]

= \frac{2}{π n} \int_{0}^{1/2} \frac{1}{1 - 2 β} exp (n g (β; ξ)) d β,

g (β; ξ) := ξ β - (1 - 2 β) lo g (1 - 2 β) - 2 β lo g β - β lo g 4 - lo g 2,

g (β; ξ) := ξ β - (1 - 2 β) lo g (1 - 2 β) - 2 β lo g β - β lo g 4 - lo g 2,

β_{0} = \frac{e ^{ξ /4}}{4 cosh ( ξ /4 )},

β_{0} = \frac{e ^{ξ /4}}{4 cosh ( ξ /4 )},

E [exp (ξ S_{2, n})] \sim \frac{2}{π n} \frac{1}{1 - 2 β _{0}} exp (n g (β_{0}; ξ)) \frac{2 π}{- g ^{''} ( β _{0} ; ξ )} .

E [exp (ξ S_{2, n})] \sim \frac{2}{π n} \frac{1}{1 - 2 β _{0}} exp (n g (β_{0}; ξ)) \frac{2 π}{- g ^{''} ( β _{0} ; ξ )} .

n \to \infty lim \frac{1}{n} lo g E [exp (ξ S_{2, n})] = g (β_{0}; ξ) = \frac{ξ}{4} + lo g (cosh \frac{ξ}{4}) .

n \to \infty lim \frac{1}{n} lo g E [exp (ξ S_{2, n})] = g (β_{0}; ξ) = \frac{ξ}{4} + lo g (cosh \frac{ξ}{4}) .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Large deviation theorem for branches of the random binary tree in the Horton-Strahler analysis

Ken Yamamoto Department of Physics and Earth Sciences, Faculty of Science, University of the Ryukyus, Senbaru, Okinawa, Japan (). [email protected]

Abstract

The Horton-Strahler analysis is a graph-theoretic method to measure the bifurcation complexity of branching patterns, by defining a number called the order to each branch. The main result of this paper is a large deviation theorem for the number of branches of each order in a random binary tree. The rate function associated with a large deviation cannot be derived in a closed form; instead, asymptotic forms of the rate function are given.

keywords:

large deviation, central limit theorem, binary tree

{AMS}

60F10, 05C05, 60F05

1 Introduction

The topological analysis of branching patterns or objects began with hydrological research on river networks. Horton proposed a systematic method to assign a number (called the order) to each stream based on the join of streams [8]. Horton’s law of stream numbers is an empirical relation stating that the number of streams of order $r$ decreases geometrically with $r$ . Horton’s method partially needs information about spatial configuration of the river network such as stream lengths and junction angles. Strahler refined Horton’s method so that the order is defined by a purely graph-theoretic way [14].

Strahler’s ordering method for a binary tree is composed of the following three rules.

The leaf nodes (degree-one nodes) are defined to have order 1. 2. 2.

A node whose children have different order $r_{1}$ and $r_{2}$ ( $r_{1}\neq r_{2}$ ) has order $\max\{r_{1},r_{2}\}$ . 3. 3.

A node whose two children have the same order $r$ has order $r+1$ .

We define a branch of order $r$ as a maximal connected path whose constituent node(s) all have the same order $r$ . For a binary tree $\tau$ having $n$ leaves, let $S_{r,n}(\tau)$ denote the number of its order- $r$ branches. Further, based on Strahler’s ordering, Tokunaga [15] established a method, called the Tokunaga indexing, to describe the structure of side-branching. The Horton-Strahler analysis, based on the branch order, has been applied to a wide variety of branching patterns and structures, such as botanical trees [10] and blood vessels [16] in biology, register allocation in computer science [4], cracks in material engineering [5], and complex network analysis [7].

In this paper, we focus on rooted, planar, full binary trees [13]. This class of binary trees appears naturally in modeling a river network. A special node corresponding to the estuary is called the root; each stream has a flow direction towards it. A river network is embedded in the ground surface, so the corresponding tree is planar; formally, a planar binary tree is defined as a rooted binary tree with right and left directions assigned to each pair of children of the same parent. If we set only the junction points as internal nodes, each node has either zero or two children; this type of binary tree is called a full binary tree. We let $\Omega_{n}$ denote the set of planar full binary trees having $n$ leaves. The number of distinct trees in $\Omega_{n}$ is expressed as

[TABLE]

and this combinatorial quantity is known as the $(n-1)$ th Catalan number [13]. For example, $\Omega_{3}$ consists of two binary trees, each of which has three branches of order 1 and one branch of order 2 (see Fig. 1 for reference). A probability space formed by introducing the uniform probability measure on $\Omega_{n}$ is referred to as the random binary tree model (or random model for short), introduced by Shreve [12]. Note that $S_{r,n}$ is a random variable on the random model.

A striking feature common to some stochastic tree models is self-similarity, which means the invariance under the operation of pruning (cutting leaves) [11]. A stochastic tree model is said to be Horton self-similar if it satisfies Horton’s law, and self-similarity involving side-branching structure is called Tokunaga self-similarity. The random model, the Tokunaga model [15], and the random self-similar network [17] are well-known self-similar tree models. For the development and related topics of the self-similarity of random trees, see Kovchegov and Zaliapin [9].

The main subject of this paper is the asymptotic property of $S_{r,n}$ . For any function $f:\mathbb{N}\cup\{0\}\to\mathbb{R}$ , $f(S_{r,n}(\cdot))$ is a real-valued random variable on the random model. By applying the pruning operation, a recursive relation

[TABLE]

between the averages of two adjoining orders $r$ and $r+1$ holds [20], where $E\left[\cdot\right]$ denotes the average on the random model. In this paper, we mainly study the case where $f$ is an exponential function.

Wang and Waymire [18] proved the central limit theorem for $S_{2,n}$ :

[TABLE]

where ‘ $\xrightarrow{D}$ ’ denotes convergence in distribution, and $N(\mu,\sigma^{2})$ is the normal distribution with mean $\mu$ and variance $\sigma^{2}$ . Recently, Yamamoto [21] obtained two generalized forms of Eq. (2) as

[TABLE]

and

[TABLE]

for each $r=1,2,\ldots$ These results are both reduced to Eq. (2) when $r=1$ . Equation (3) implies that $S_{r+1,n}/n$ converges in probability to $4^{-r}$ as $n\to\infty$ , which is compared with Horton’s law.

It is worth pointing out that the central limit theorem (4) can be derived easily by the pruning operation [1, 9]. Since pruning a binary tree $\tau\in\Omega_{n}$ $r-1$ times yields a binary tree having $S_{r,n}(\tau)$ leaves, Eq. (2) immediately implies

[TABLE]

Considering $S_{r,n}/n\to 4^{1-r}$ along with this equation, we obtain the central limit theorem (4).

As for $S_{2,n}$ , the following large deviation theorem was demonstrated [18].

Theorem 1.1 (Large deviation theorem for $S_{2,n}$ ).

For the random model,

[TABLE]

and

[TABLE]

where the rate function $I(y)$ is given by

[TABLE]

For the proof of this theorem, the following general result on large deviation properties is important.

Theorem 1.2 (Cox and Griffeath [2]).

Let $(X_{1},X_{2}.\ldots)$ be a sequence of random variables and let

[TABLE]

where $\{a_{n}\}$ is a sequence of positive numbers such that $a_{n}\to\infty$ . Assume that on the interval $(\xi_{-},\xi_{+})\ni 0$ ,

[TABLE]

where $\varphi_{\infty}(\xi)$ is strictly convex and $C^{2}$ on $(\xi_{-},\xi_{+})$ . If $\varphi^{\prime}_{n}$ is convex on $[0,\xi_{+})$ and $\lim_{n\to\infty}\varphi^{\prime\prime}_{n}(0)=\sigma^{2}=\varphi^{\prime\prime}_{\infty}(0)$ , then

[TABLE]

and

[TABLE]

where $\mu=\varphi^{\prime}_{\infty}(0)$ , $\alpha_{-}=\varphi^{\prime}_{\infty}(\xi_{-}+)$ , $\alpha_{+}=\varphi^{\prime}_{\infty}(\xi_{+}-)$ , and $I(y)$ is the Legendre transform of $\varphi_{\infty}(\xi)$ . In addition, the central limit theorem

[TABLE]

*holds.

For systematic treatment of large deviation theory, see Ellis [6] and Deuschel and Stroock [3] for example. Wang and Waymire [18] derived

[TABLE]

for any $\xi\in\mathbb{R}$ , which leads to the proof of Theorem 5 (with $a_{n}=n$ in Theorem 1.2). The rate function $I(y)$ given in Eq. (5) is the Legendre transform of $\varphi$ . Furthermore, owing to $\varphi^{\prime\prime}(0)=1/16$ , Theorem 5 directly implies the central limit theorem (2) via Theorem 1.2.

Unlike Eq. (2), central limit theorems (3) and (4) were obtained by the asymptotic properties of the characteristic functions of $S_{r+1,n}/n$ and $S_{r+1,n}/S_{r,n}$ [21]. Thus, a natural problem is to establish the large deviation theorems corresponding to Eqs. (3) and (4). In this paper, a large deviation theorem connected to Eq. (3) is formulated and proved.

2 Main result

Lemma 2.1.

For $r=1,2,\ldots$ and $\xi\in\mathbb{R}$ ,

[TABLE]

*where the function $\varphi$ is introduced in Eq. (6).

This is a generalized result of Eq. (6). We give the proof of this lemma in the next section.

By Lemma 2.1, we can prove the following large deviation theorem for $S_{r+1,n}$ .

Theorem 2.2 (Large deviation theorem for $S_{r+1,n}$ ).

For $r=1,2,\ldots$ ,

[TABLE]

and

[TABLE]

*where the rate function $I_{r}(y)$ is the Legendre transform of $\varphi^{r}(\xi)$ .

Note that this theorem includes Theorem 5 as a special case of $r=1$ .

Proof 2.3.

We can complete the proof by substituting $\varphi^{r}(\xi)$ in Lemma 2.1 for $\varphi_{\infty}(\xi)$ in Theorem 1.2. Since the function $\varphi(\xi)$ is strictly increasing, strictly convex and $C^{2}$ on $\mathbb{R}$ , its composite $\varphi^{r}(\xi)$ also possesses these properties. Hence, $\xi_{\pm}=\pm\infty$ for any $r$ . By the chain rule and $\varphi^{\prime}(\xi)=[1+\tanh(\xi/4)]/4$ , the derivative of $\varphi^{r}(\xi)$ is

[TABLE]

Owing to $\varphi(0)=0$ and $\varphi(\pm\infty)=\pm\infty$ , we obtain

[TABLE]

*Therefore, the proof is complete. *

Remark 1.

As a consequence of Theorem 2.2, the central limit theorem (3) holds straightforwardly from Lemma 2.1 and Theorem 1.2. By differentiating Eq. (7) again and applying the Leibniz rule, the second derivative of $\varphi^{r}$ is

[TABLE]

Using $\varphi(0)=0$ , $\varphi^{\prime}(0)=1/4$ , $\varphi^{\prime\prime}(0)=1/16$ , and ( $\varphi^{r})^{\prime}(0)=4^{-r}$ , we obtain

[TABLE]

*Thus, the central limit theorem (3) is derived. *

Lemma 2.1 and Theorem 2.2 indicate that the order $r$ appears in the composition $\varphi^{r}$ . The author believes that this regularity implies self-similarity of trees from the perspective of large deviation theory.

A large deviation formalism of Eq. (4) is not studied in this paper, and is an open problem.

3 Proof of Lemma 2.1

This section is mainly devoted to the proof of Lemma 2.1.

First, we show Lemma 2.1 for $r=1$ (corresponding to $S_{2,n}$ ). This case (6) was already proved by Wang and Waymire [18], but we employ a formula different from theirs. Our method has the major advantage that we can easily extend to $r\geq 2$ .

We need to estimate $E\left[\exp(\xi S_{2,n})\right]$ , which is the moment generating function of $S_{2,n}$ . By setting $r=1$ and $f(S_{r,n})=\exp(\xi S_{r,n})$ in Eq. (1),

[TABLE]

This sum can be calculated exactly using the Gauss hypergeometric function [19], but here we perform an asymptotic analysis using a saddle-point method.

Letting $m=\beta n$ ( $0<\beta<1/2$ ) to replace the sum by integral about $\beta$ , and using Stirling’s approximation

[TABLE]

we get

[TABLE]

where

[TABLE]

and ‘ $\sim$ ’ denotes the asymptotic equality in the sense that the ratio between both hand sides tends to unity as $n\to\infty$ . One can easily confirm that the function $g(\beta;\xi)$ takes a maximum value at

[TABLE]

thereby

[TABLE]

Therefore,

[TABLE]

Remark 2.

Equation (6) was previously obtained [18] by using a saddle-point method to

[TABLE]

*However, as noted in [18], this procedure needs to treat the two cases where $e^{\xi}-1$ is positive and where $e^{\xi}-1$ is negative separately. Moreover, it seems to be difficult to extend their method to general $S_{r+1,n}$ . In this light, our method, starting with Eq. (1), is advantageous compared to the preceding one. *

Next, we proceed to general $S_{r+1,n}$ by induction on $r$ . Assume that

[TABLE]

where the coefficient $C_{r,n}$ satisfies

[TABLE]

and we show Eq. (10) for $r+1$ . By Eq. (1) and asymptotic approximation as above, we have

[TABLE]

The saddle-point estimation requires to maximize the same function $g$ as in Eq. (9), but $\xi$ in Eq. (9) is replaced by $\varphi^{r-1}(\xi)$ here. We also note that the coefficient $C_{r,\beta n}$ does not affect the saddle-point method. Hence, for some coefficient $C_{r+1,n}$ , we have

[TABLE]

so that

[TABLE]

Thus, the statement holds for any $r$ .

4 Note on approximate forms of the rate function

Unfortunately, the rate function $I_{r}(y)$ in Theorem 2.2 cannot be expressed exactly for $r\geq 2$ . By the definition of the Legendre transformation, $I_{r}(y)$ is given by

[TABLE]

where $\xi_{r}^{\ast}(y)$ satisfies

[TABLE]

In short, $\xi_{r}^{\ast}$ is the inverse function of $(\varphi^{r})^{\prime}$ . The difficulty for $I_{r}(y)$ is due to the fact that $(\varphi^{r})^{\prime}$ has a complicated form for $r\geq 2$ and $\xi_{r}^{\ast}(y)$ cannot be solved explicitly. Instead of the exact form of $I_{r}(y)$ , we derive its approximate forms.

According to the general theory of rate functions [6], $I_{r}(y)$ is convex and takes the minimum value 0 at $y=(\varphi^{r})^{\prime}(0)=4^{-r}$ . Moreover, the derivative of Eq. (11) yields

[TABLE]

Owing to $I_{r}(4^{-r})=0$ and $I_{r}^{\prime}(4^{-r})=0$ , a second-order Taylor expansion of $I_{r}$ around $y=4^{-r}$ becomes

[TABLE]

In other words, the bottom of the curve of $I_{r}(y)$ is approximated by a parabola, and this is equivalent to the central limit theorem (3). Differentiating Eq. (13), we have

[TABLE]

Figure 2 shows numerical results of $I_{r}(y)$ and $\xi_{r}^{\ast}(y)$ for $r=1$ , 2, and 3 by the solid curves. We used the Newton-Raphson method to solve $\xi_{r}^{\ast}(y)$ . Approximate forms (13) and (14) corresponding to the central limit theorem are shown by dashed curves and lines. The dashed curves are close to the solid ones only in the vicinity of $y=4^{-r}$ . In what follows, we calculate approximate forms of $I_{r}(y)$ near $y=0$ (leftmost point) and $2^{-r}$ (rightmost point).

Definition 4.1.

For simplicity of notation, we introduce

[TABLE]

*for $X\in[0,\infty)$ . *

Proposition 3.

$\varphi$ * and $\Psi$ possess the following properties.*

$\varphi(\log X)=\log\Psi(X),\quad\varphi^{k}(\log X)=\log\Psi^{k}(X)$ ** 2. 2.

$\varphi^{\prime}(\log X)=\frac{\sqrt{X}}{4\Psi(X)}$ ** 3. 3.

$\varphi^{\prime\prime}(\log X)=\frac{\sqrt{X}}{16\Psi(X)^{2}}$ **

Proof 4.2.

One can easily prove this by using the following relations:

[TABLE]

Proposition 4.

By using $\Psi$ , the first and second derivatives of $\varphi^{r}$ are respectively expressed as

[TABLE]

and

[TABLE]

Proof 4.3.

By Eq. (7) and Proposition 3, $(\varphi^{r})^{\prime}$ is written as

[TABLE]

*Next, we get Eq. (16) straightforwardly by Eq. (8) and Proposition 3. *

Using the above properties of $\varphi^{r}$ , let us derive the expansion of $\xi_{r}^{\ast}(y)$ .

Theorem 4.4 (Asymptotic forms of $\xi_{r}^{\ast}(y)$ around $y=0$ and $y=2^{-r}$ ).

Around $y=0$ which is the leftmost point of $I_{r}(y)$ ,

[TABLE] 2. 2.

Around $y=2^{-r}$ which is the rightmost point of $I_{r}(y)$ ,

[TABLE]

Proof 4.5.

By Eq. (12),

[TABLE]

In this proof, we use this formula as a differential equation to determine $\xi_{r}^{\ast}$ . Since $\xi_{r}^{\ast}(0)=-\infty$ and $\xi_{r}^{\ast}(2^{-r})=\infty$ , we need to expand $(\varphi^{r})^{\prime\prime}(\xi)$ around $\xi=\mp\infty$ , corresponding to $y=0$ and $2^{-r}$ , by means of Proposition 16.

We expand $(\varphi^{r})^{\prime\prime}(\xi)$ on condition that $e^{\xi}$ is sufficiently small. By definition, $\Psi(e^{\xi})=(e^{\xi/2}+1)/2=\Psi(0)+O(e^{\xi/2})$ , and similarly $\Psi^{k}(e^{\xi})=\Psi^{k}(0)+O(e^{\xi/2})$ for $k\geq 1$ . Putting into Eq. (15), we immediately obtain

[TABLE]

To estimate the sum in Eq. (16), the term of $l=0$ is $O(e^{-\xi/2})$ which is the leading order, and the others are $O(1)$ . Thus, by neglecting the terms other than $l=0$ , we have

[TABLE]

Hence, Eq. (17) becomes

[TABLE]

By integrating from $y=0$ to $\eta$ ,

[TABLE]

and the solution $\xi_{r}^{\ast}$ is

[TABLE] 2. 2.

Contrary to the above, $\xi_{r}^{\ast}(y)$ tends to infinity as $y\nearrow 2^{-r}$ , so we need to expand $(\varphi^{r})^{\prime\prime}(\xi)$ when $e^{\xi}$ is sufficiently large. From the observation

[TABLE]

we reasonably set $\Psi^{k}(e^{\xi})=2^{-\rho_{k}}\exp(\xi/2^{k})+O(1)$ . The exponent $\rho_{k}$ satisfies $\rho_{1}=1$ and $\rho_{k+1}=\rho_{k}/2+1$ , so that

[TABLE]

Noting that

[TABLE]

we have

[TABLE]

In this case, the dominant term of the sum in Eq. (16) corresponds to $l=r-1$ , so that

[TABLE]

Finally, integrating Eq. (17) from $y=2^{-r}-\eta$ to $2^{-r}$ as above, we get

[TABLE]

By using $\xi_{r}^{\ast}(y)$ in Theorem 4.4, we reach asymptotic forms of $I_{r}(y)$ .

Theorem 4.6 (Asymptotic forms of the rate function $I_{r}(y)$ around $y=0$ and $2^{-r}$ ).

Around $y=0$ ,

[TABLE] 2. 2.

Around $y=2^{-r}$ ,

[TABLE]

Proof 4.7.

As in the proof of Proposition 3, we use $Q_{r}=4^{r}[\Psi^{r}(0)\prod_{k=1}^{r}\Psi^{k}(0)]^{1/2}$ and

[TABLE]

By integrating from [math] to $\eta$ ,

[TABLE]

The proof is completed by calculating $I_{r}(0)$ as

[TABLE] 2. 2.

By integrating Eq. (17) from $2^{-r}-\eta$ to $2^{-r}$ , we have

[TABLE]

We need to be careful in the calculation of $I_{r}(2^{-r})$ , because both $\xi_{r}^{\ast}(y)$ and $\varphi^{r}(\xi_{r}^{\ast}(y))$ in Eq. (11) diverge as $y\to 2^{-r}$ .

[TABLE]

Remark 5.

The same calculation applies to $y=4^{-r}$ ( $\xi_{r}^{\ast}(4^{-r})=0$ ). Note that $e^{0}=1$ is the fixed point of $\Psi$ , namely $\Psi(1)=1$ , so we obtain

[TABLE]

*The Taylor expansion (13) of $I_{r}(y)$ around $y=4^{-r}$ is reproduced. *

By the exact form of $I_{1}(y)=I(y)$ in Eq. (5), $I_{1}(y)$ is symmetric about $y=1/4$ . On the other hand, comparing the approximate forms of $I_{r}(y)$ near $y=0$ and $2^{-r}$ in Theorem 4.6, $I_{r}(y)$ for $r\geq 2$ is clearly asymmetric.

In Fig. 3, we show numerical results of $\xi_{r}^{\ast}(y)$ and $I_{r}(y)$ along with the asymptotic forms at $y\simeq 0$ (dashed curves) and $y\simeq 2^{-r}$ (dot-dashed curves) from Theorems 4.4 and 4.6. (The solid curves are the same as in Fig. 2.)

Acknowledgments

The author is grateful to referees for instructing recent related articles. The idea that Eq. (4) is derived by the pruning operation is suggested by a referee. The present work was partially supported by a University of the Ryukyus Research Project Promotion Grant for Young Researchers (17SP04109), and Hayao Nakayama Foundation for Science & Technology and Culture (H29-B-41).

Bibliography21

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Burd, G. A., Waymire, E. C. and Winn, R. D. (2000). A self-similar invariance of critical binary Galton-Watson trees. Bernoulli 6, 1–21.
2[2] Cox, J. T. and Griffeath, D. (1984). Large deviations for Poisson systems of independent random walks. Zeitschrift für Wahrscheinlichkeitstheorie und verwandte Gebiete 66, 543–558.
3[3] Deuschel, J. D. and Stroock, D. W. (1989). Large Deviations , Academic Press, Boston.
4[4] Devroye, L. and Kruszewski, P. (1994). A note on the Horton-Strahler number for random trees. Inform. Process. Lett. 52, 155–159.
5[5] Djordjevic, Z. V., Li, X. F., Shin, W. S., Wunder, S. L. and Baran, G. R. (1995). Fractal and topological characterization of branching patterns on the fracture surfaces of cross-linked dimethacrylate resins. J. Mater. Sci. 30, 2968–2980.
6[6] Ellis, R. S. (2006). Entropy, Large Deviations, and Statistical Mechanics , Springer, Berlin.
7[7] Guimerà, R., Danon, L., Díaz-Guilera, A., Giralt, F. and Arenas, A. (2003). Self-similar community structure in a network of human interactions. Phys. Rev. E 68, 065103(R).
8[8] Horton, R. E. (1945). Erosional development of streams and their drainage basins; hydrophysical approach to quantitative morphology. Geol. Soc. Am. Bull. 56, 275–370.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Large deviation theorem for branches of the random binary tree in the Horton-Strahler analysis

Abstract

keywords:

1 Introduction

Theorem 1.1** (Large deviation theorem for S2,nS_{2,n}S2,n​).**

Theorem 1.2** (Cox and Griffeath [2]).**

2 Main result

Lemma 2.1**.**

Theorem 2.2** (Large deviation theorem for Sr+1,nS_{r+1,n}Sr+1,n​).**

Proof 2.3**.**

Remark 1**.**

3 Proof of Lemma 2.1

Remark 2**.**

4 Note on approximate forms of the rate function

Definition 4.1**.**

Proposition 3**.**

Proof 4.2**.**

Proposition 4**.**

Proof 4.3**.**

Theorem 4.4** (Asymptotic forms of ξr∗(y)\xi_{r}^{\ast}(y)ξr∗​(y) around y=0y=0y=0 and y=2−ry=2^{-r}y=2−r).**

Proof 4.5**.**

Theorem 4.6** (Asymptotic forms of the rate function Ir(y)I_{r}(y)Ir​(y) around y=0y=0y=0 and 2−r2^{-r}2−r).**

Proof 4.7**.**

Remark 5**.**

Acknowledgments

Theorem 1.1 (Large deviation theorem for $S_{2,n}$ ).

Theorem 1.2 (Cox and Griffeath [2]).

Lemma 2.1.

Theorem 2.2 (Large deviation theorem for $S_{r+1,n}$ ).

Proof 2.3.

Remark 1.

Remark 2.

Definition 4.1.

Proposition 3.

Proof 4.2.

Proposition 4.

Proof 4.3.

Theorem 4.4 (Asymptotic forms of $\xi_{r}^{\ast}(y)$ around $y=0$ and $y=2^{-r}$ ).

Proof 4.5.

Theorem 4.6 (Asymptotic forms of the rate function $I_{r}(y)$ around $y=0$ and $2^{-r}$ ).

Proof 4.7.

Remark 5.