Large Degree Asymptotics and the Reconstruction Threshold of the   Asymmetric Binary Channels

Wenjian Liu; Ning Ning

arXiv:1812.06039·math.PR·February 20, 2019

Large Degree Asymptotics and the Reconstruction Threshold of the Asymmetric Binary Channels

Wenjian Liu, Ning Ning

PDF

TL;DR

This paper investigates the reconstruction problem on noisy tree networks with asymmetric binary channels, providing precise thresholds and asymptotic behavior for large degrees through refined analytical methods.

Contribution

It extends previous work by rigorously determining the conditions under which the reconstruction threshold is tight for asymmetric binary channels on large-degree trees.

Findings

01

Established the exact reconstruction threshold for asymmetric binary channels.

02

Derived asymptotic behavior of the threshold as the degree grows large.

03

Provided refined analysis techniques for moment recursion and concentration.

Abstract

In this paper, we consider a broadcasting process in which information is propagated from a given root node on a noisy tree network, and answer the question that whether the symbols at the nth level of the tree contain non-vanishing information of the root as n goes to infinity. Although the reconstruction problem on the tree has been studied in numerous contexts including information theory, mathematical genetics and statistical physics, the existing literatures with rigorous reconstruction thresholds established are very limited. In the remarkable work of Borgs, Chayes, Mossel and Roch (The Kesten-Stigum reconstruction bound is tight for roughly symmetric binary channels), the exact threshold for the reconstruction problem for a binary asymmetric channel on the d-ary tree is establish, provided that the asymmetry is sufficiently small, which is the first exact reconstruction threshold…

Equations363

P (σ_{v} = j ∣ σ_{u} = i) = M_{ij}, i, j \in C .

P (σ_{v} = j ∣ σ_{u} = i) = M_{ij}, i, j \in C .

\mathbf{M}=\frac{1}{2}\left(\begin{array}[]{cc}1+\theta&1-\theta\\ 1-\theta&1+\theta\\ \end{array}\right)+\frac{\Delta}{2}\left(\begin{array}[]{cc}-1&1\\ -1&1\\ \end{array}\right),

\mathbf{M}=\frac{1}{2}\left(\begin{array}[]{cc}1+\theta&1-\theta\\ 1-\theta&1+\theta\\ \end{array}\right)+\frac{\Delta}{2}\left(\begin{array}[]{cc}-1&1\\ -1&1\\ \end{array}\right),

π_{1} = \frac{1}{2} - \frac{Δ}{2 ( 1 - θ )} and π_{2} = \frac{1}{2} + \frac{Δ}{2 ( 1 - θ )},

π_{1} = \frac{1}{2} - \frac{Δ}{2 ( 1 - θ )} and π_{2} = \frac{1}{2} + \frac{Δ}{2 ( 1 - θ )},

H (σ) = - ⟨ i, j ⟩ \sum J_{ij} σ_{i} σ_{j} - μ j \sum h_{j} σ_{j},

H (σ) = - ⟨ i, j ⟩ \sum J_{ij} σ_{i} σ_{j} - μ j \sum h_{j} σ_{j},

n \to \infty lim sup d_{T V} (σ^{i} (n), σ^{j} (n)) > 0.

n \to \infty lim sup d_{T V} (σ^{i} (n), σ^{j} (n)) > 0.

θ^{+} \leq d^{- 1/2} and θ^{-} \geq - d^{- 1/2},

θ^{+} \leq d^{- 1/2} and θ^{-} \geq - d^{- 1/2},

d \to \infty lim d (θ^{\pm})^{2} = C_{π},

d \to \infty lim d (θ^{\pm})^{2} = C_{π},

θ^{+} = d^{- 1/2} and θ^{-} = - d^{- 1/2} .

θ^{+} = d^{- 1/2} and θ^{-} = - d^{- 1/2} .

n \to \infty lim x_{n} = 0.

n \to \infty lim x_{n} = 0.

x_{n + 1} \approx d θ^{2} x_{n} + \frac{1 - 6 π _{1} π _{2}}{π _{1} π _{2}^{2}} \frac{d ( d - 1 )}{2} θ^{4} x_{n}^{2} .

x_{n + 1} \approx d θ^{2} x_{n} + \frac{1 - 6 π _{1} π _{2}}{π _{1} π _{2}^{2}} \frac{d ( d - 1 )}{2} θ^{4} x_{n}^{2} .

f_{n} (i, A) = P (σ_{ρ} = i ∣ σ (n) = A) .

f_{n} (i, A) = P (σ_{ρ} = i ∣ σ (n) = A) .

f_{n} (i, A) = P (σ_{u_{j}} = i ∣ σ_{j} (n + 1) = A) .

f_{n} (i, A) = P (σ_{u_{j}} = i ∣ σ_{j} (n + 1) = A) .

X_{i} = X_{i} (n) = f_{n} (i, σ (n)), X^{+} = X^{+} (n) = f_{n} (1, σ^{1} (n)), X^{-} = X^{-} (n) = f_{n} (2, σ^{2} (n)),

X_{i} = X_{i} (n) = f_{n} (i, σ (n)), X^{+} = X^{+} (n) = f_{n} (1, σ^{1} (n)), X^{-} = X^{-} (n) = f_{n} (2, σ^{2} (n)),

Y_{j} = Y_{j} (n) = f_{n} (1, σ_{j}^{1} (n + 1)),

Y_{j} = Y_{j} (n) = f_{n} (1, σ_{j}^{1} (n + 1)),

X_{1} (n) + X_{2} (n) = 1

X_{1} (n) + X_{2} (n) = 1

E (X_{1}) = π_{1}, E (X_{2}) = π_{2} .

E (X_{1}) = π_{1}, E (X_{2}) = π_{2} .

x_{n} = E (X^{+} (n) - π_{1}) and z_{n} = E (X^{+} (n) - π_{1})^{2} .

x_{n} = E (X^{+} (n) - π_{1}) and z_{n} = E (X^{+} (n) - π_{1})^{2} .

x_{n} = \frac{1}{π _{1}} E (X_{1} - π_{1})^{2} = E (X^{+} (n) - π_{1})^{2} + \frac{π _{2}}{π _{1}} E (X^{-} (n) - π_{2})^{2} \geq z_{n} \geq 0.

x_{n} = \frac{1}{π _{1}} E (X_{1} - π_{1})^{2} = E (X^{+} (n) - π_{1})^{2} + \frac{π _{2}}{π _{1}} E (X^{-} (n) - π_{2})^{2} \geq z_{n} \geq 0.

E X^{+}

E X^{+}

E X^{-} = E f_{n} (2, σ^{2} (n)) = \frac{1}{π _{2}} E (X_{2}^{2}) .

E X^{-} = E f_{n} (2, σ^{2} (n)) = \frac{1}{π _{2}} E (X_{2}^{2}) .

x_{n} = \frac{1}{π _{1}} (E (X_{1}^{2}) - π_{1}^{2}) = \frac{1}{π _{1}} E (X_{1} - π_{1})^{2} .

x_{n} = \frac{1}{π _{1}} (E (X_{1}^{2}) - π_{1}^{2}) = \frac{1}{π _{1}} E (X_{1} - π_{1})^{2} .

x_{n} = \frac{1}{π _{1}} E (X_{2} - π_{2})^{2} = \frac{π _{2}}{π _{1}} (E X^{-} (n) - π_{2}) .

x_{n} = \frac{1}{π _{1}} E (X_{2} - π_{2})^{2} = \frac{π _{2}}{π _{1}} (E X^{-} (n) - π_{2}) .

x_{n}

x_{n}

E (Y_{j} - π_{1}) = θ x_{n} and E (Y_{j} - π_{1})^{2} = π_{1} x_{n} + θ (z_{n} - π_{1} x_{n}) .

E (Y_{j} - π_{1}) = θ x_{n} and E (Y_{j} - π_{1})^{2} = π_{1} x_{n} + θ (z_{n} - π_{1} x_{n}) .

E (Y_{j} - π_{1}) = P (σ_{u_{j}}^{1} = 1) E (X^{+} (n) - π_{1}) + P (σ_{u_{j}}^{1} = 2) E (1 - X^{-} (n) - π_{1}) = M_{11} x_{n} - M_{12} \frac{π _{1}}{π _{2}} x_{n} = θ x_{n}

E (Y_{j} - π_{1}) = P (σ_{u_{j}}^{1} = 1) E (X^{+} (n) - π_{1}) + P (σ_{u_{j}}^{1} = 2) E (1 - X^{-} (n) - π_{1}) = M_{11} x_{n} - M_{12} \frac{π _{1}}{π _{2}} x_{n} = θ x_{n}

E (Y_{j} - π_{1})^{2}

E (Y_{j} - π_{1})^{2}

n \to \infty lim x_{n} = 0.

n \to \infty lim x_{n} = 0.

Δ_{n} = E max {X_{1} (n), X_{2} (n)} .

Δ_{n} = E max {X_{1} (n), X_{2} (n)} .

Δ_{n}

Δ_{n}

\leq π_{1} + E max {X_{1} (n) - π_{1}, X_{2} (n) - π_{2}}

= π_{1} + E ∣ X_{1} (n) - π_{1} ∣

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

∎

11institutetext: Wenjian Liu 22institutetext: Dept.of Mathematics and Computer Science, Queensborough Community College, City University of New York

Research supported by CUNY Community College Research Grant #1541

22email: [email protected] 33institutetext: Ning Ning (Corresponding Author)44institutetext: Dept. of Applied Mathematics, University of Washington, Seattle

44email: [email protected]

Large Degree Asymptotics and the Reconstruction Threshold of the Asymmetric Binary Channels

Wenjian Liu

Ning Ning

Abstract

In this paper, we consider a broadcasting process in which information is propagated from a given root node on a noisy tree network, and answer the question that whether the symbols at the $n$ th level of the tree contain non-vanishing information of the root as $n$ goes to infinity. Although the reconstruction problem on the tree has been studied in numerous contexts including information theory, mathematical genetics and statistical physics, the existing literatures with rigorous reconstruction thresholds established are very limited. In the remarkable work of Borgs, Chayes, Mossel and Roch (The Kesten-Stigum reconstruction bound is tight for roughly symmetric binary channels. FOCS, IEEE Comput. Soc. (2006): 518–530. Berkeley, CA.), the exact threshold for the reconstruction problem for a binary asymmetric channel on the $d$ -ary tree is establish, provided that the asymmetry is sufficiently small, which is the first exact reconstruction threshold obtained in roughly a decade. In this paper, by means of refined analyses of moment recursion on a weighted version of the magnetization, and concentration investigations, we rigorously give a complete answer to the question of how small it needs to be to establish the tightness of the reconstruction threshold and further determine its asymptotics of large degrees.

Keywords:

Kesten-Stigum reconstruction bound Markov random fields on trees Distributional recursion Nonlinear dynamical system

MSC:

60K35 82B26 82B20

1 Introduction

1.1 Broadcasting Process and the Reconstruction Problem

We consider the following broadcasting process that can be considered as a communication tree network, as a model for propagation of a genetic property or as a tree-indexed Markov chain. In this paper, we restrict our attention to the regular $d$ -ary trees, which is an infinite rooted tree where every vertex has exactly $d$ offspring, denoted by $\mathbb{T}=(\mathbb{V},\mathbb{E},\rho)$ with nodes $\mathbb{V}$ edges $\mathbb{E}$ and root $\rho\in\mathbb{V}$ . A configuration on $\mathbb{T}$ is an element of $\mathcal{C}^{\mathbb{T}}$ with $\mathcal{C}$ being a finite characters set, that is, an assignment of a state in $\mathcal{C}$ to each vertex. The state of the root $\rho$ , denoted by $\sigma_{\rho}$ , is chosen according to some initial distribution $\pi$ on $\mathcal{C}$ . This symbol is then propagated in the tree according to the probability transition matrix $\mathbf{M}=(M_{ij})_{i,j\in\mathcal{C}}$ , which functions as the noisy communication channel on each edge. That is, for each vertex $v$ having $u$ as its parent, the spin at $v$ is defined according to the probabilities

[TABLE]

The objective model taken into account is the asymmetric binary channel with the configuration set $\mathcal{C}=\{1,2\}$ , whose transition matrix is of the form

[TABLE]

where $|\theta|+|\Delta|\leq 1$ and $\Delta$ is used to describe the deviation of $\mathbf{M}$ from the symmetric channel. It is easy to see that $\mathbf{M}$ has two eigenvalues, 1 and $\theta$ . Then we pick a state at the root according to the stationary distribution $\pi=(\pi_{1},\pi_{2})$ of $\mathbf{M}$ , which is given by

[TABLE]

and without loss of generality, it is convenient to assume that $\pi_{1}\geq\pi_{2}$ .

Recall that the classical Ising model, a mathematical model of ferromagnetism in statistical mechanics, consists of discrete variables that represent magnetic dipole moments of atomic spins that can be in one of two states ( $-1$ or $+1$ ). Consider a set of lattice sites $\Lambda$ , each with a set of adjacent sites (e.g. a graph) forming a lattice, and for each $k\in\Lambda$ , there is a discrete variable $\sigma_{k}\in\{-1,+1\}$ representing the site’s spin. The energy of a configuration $\sigma$ is given by the Hamiltonian function

[TABLE]

where the notation $\langle i,j\rangle$ indicates that sites $i$ and $j$ are the nearest neighbors, $J_{ij}$ denotes the interaction between two adjacent sites $i,j\in\Lambda$ and $h_{j}$ models the external magnetic field interaction of site $j\in\Lambda$ . In this literature, the current model corresponds to the general Ising model with external field on the tree.

The problem of reconstruction is to analyze whether there exists non-vanishing information on the letter transmitted from the root, given all the symbols received at the vertices of the $n$ th generation, as $n$ goes to infinity. We define the distance between probability measures in line with Evans et al. [2000]. Let $v_{+}$ and $v_{-}$ be two probability measures on the same space. Set $f=dv_{+}/dv$ and $g=dv_{-}/dv$ where $v:=(v_{+}+v_{-})/2$ . Inferring the root spin $\sigma_{\rho}$ from the spin configurations on the finite vertex set is a basic problem of Bayesian hypothesis testing. The total variation distance, defined as $d_{TV}(v_{+},v_{-}):=\frac{1}{2}\int|f-g|dv$ , can be interpreted as the difference between the probabilities of correct and erroneous inferences. Denote $\sigma(n)$ as the spins at distance $n$ from the root and $\sigma^{i}(n)$ as $\sigma(n)$ conditioned on $\sigma_{\rho}=i$ . Then the reconstruction problem can be mathematical formulated as the following:

Definition 1.

The reconstruction problem for the infinite tree $\mathbb{T}$ is solvable if for some $i,j\in\mathcal{C}$ ,

[TABLE]

When the $\limsup$ is [math], we will say that the model has non-reconstruction on $\mathbb{T}$ .

1.2 Background and Applications

The reconstruction problem arises naturally in statistical physics, where the reconstruction threshold is identified as the threshold for extremality of the infinite-volume Gibbs measure with free boundary conditions (see Georgii [2011]). In Berger et al. [2005], Martinelli et al. [2007], Tetali et al. [2012], the reconstruction bound is found to have a crucial determination effect on the efficiency of the Glauber dynamics on trees and random graphs. The reconstruction threshold is also believed to play an important role in a variety of other contexts, including phylogenetic reconstruction in evolutionary biology (Mossel [2004a], Daskalakis et al. [2006], Roch [2006]), communication theory in the study of noisy computation (Evans et al. [2000]), clustering problem in the setting of the stochastic block model (Mossel et al. [2012, 2013], Neeman and Netrapalli [2014]), and network tomography (Bhamidi et al. [2010]). For detailed explanation on the reconstruction problem in mixing, phylogeny and replicas, we refer to Section 1.3 in Bernussou and Abatut [1977]. For other applications of reconstruction, we refer to Section 1.4 in Sly [2011] and Section 1.3 in Liu et al. [2018], as well as the references therein.

In this paper, we focus on analyzing the tightness of the reconstruction bound on the tree for asymmetric binary channels, which corresponds to the asymmetric Ising model on the tree in statistical physics term. Well known, the reconstruction problem is closely related to $\lambda$ , the second largest eigenvalue in absolute value of the transition probability matrix, which is $\theta$ in the current model under investigation. Kesten and Stigum [1966, 1967] showed that the reconstruction problem is solvable if $d\lambda^{2}>1$ , which is known as the Kesten-Stigum bound. However in the case of larger noise, i.e. $d\lambda^{2}<1$ , one may wonder whether reconstruction problem is still solvable, that is collecting and analyzing the whole set of symbols received at the $n$ th generation to retrieve information transmitted from the root.

First consider the symmetric channel. It was shown in Bleher et al. [1995] that the reconstruction problem is solvable if and only if $d\lambda^{2}>1$ in the binary model. For all other models, it was also known and easy to prove that $d\lambda^{2}>1$ implies solvability, while proving non-reconstructibility turned out to be harder. Although coupling arguments easily yield non-reconstruction, these arguments are typically not rigorous. A natural approach to establish non-reconstructibility is to analyze recursions in terms of random variables, each of whose values is the expectation of the chain at a vertex, given the state at the leaves of the subtree below it, and the corresponding probabilities. Although the reconstruction problem on the tree has been studied in numerous contexts, the existing literatures with rigorous reconstruction thresholds established are very limited. Sly [2011] proved the first exact reconstruction threshold in a nonbinary model by establishing the Kesten–Stigum bound for the $3$ -state Potts model on regular trees of large degree, and further established that the Kesten–Stigum bound is not tight for the $q$ -state Potts model when $q\geq 5$ , which confirms much of the picture conjectured earlier by Mézard and Montanari [2006]. Liu et al. [2018] considered a $2q$ -state symmetric model, with two categories of $q$ states in each category, and 3 transition probabilities (the probability to remain in the same state, the probability to change states but remain in the same category, and the probability to change categories) and showed that the Kesten-Stigum reconstruction bound is not tight when $q\geq 4$ .

Next let us turn to the existing results regarding the asymmetric channel. Mossel [2001, 2004b] showed that the Kesten-Stigum bound is not the bound for reconstruction in the binary asymmetric model with sufficiently large asymmetry or in the symmetric Potts model with sufficiently many characters, which shed the light on exploring the tightness of the Kesten-Stigum bound. Furthermore, Proposition 12 in Mossel [2001] implies that for any asymmetric channel, given $d$ and $\pi$ , the reconstructibility is monotone in $|\theta|$ , say, there exist the thresholds $\theta^{-}<0<\theta^{+}$ such that, there is non-reconstruction when $\theta\in(\theta^{-},\theta^{+})$ , while it is reconstructible when $\theta<\theta^{-}$ or $\theta>\theta^{+}$ . Therefore, the Kesten-Stigum bound mentioned above implies immediately

[TABLE]

but exact thresholds for non-solvability had not been known. The breakthrough result in Borgs et al. [2006] established the exact threshold for the reconstruction problem with the binary asymmetric channel on the $d$ -ary tree, provided that the asymmetry is sufficiently small, which is the first exact reconstruction threshold obtained in roughly a decade. However, this beautiful result only rigorously proved the existence of $\Delta$ to satisfy the reconstruction criterion, does not answer the question that how small the asymmetry needs to be, therefore rigorously estimating the range of asymmetry to keep Kesten-Stigum bound tight is a natural question, which will be answered in the next section.

1.3 Main Results and Proof Sketch

In this section, we will present a critical condition of the stationary initial distribution $\pi$ to keep the tightness of the Kesten-Stigum bound, by means of refined recursive equations of vector-valued distributions and concentration analyses. Furthermore, when the Kesten-Stigum bound is not tight, we provide a new reconstruction threshold $C_{\pi}\in(0,1)$ for sufficiently large $d$ . Since $d\theta^{2}>1$ always implies reconstruction, it suffices to consider $d\theta^{2}\leq 1$ in the following context.

Theorem 1.1.

For every $d$ and $\pi$ such that $\pi_{1}\pi_{2}<\frac{1}{6}$ , the Kesten-Stigum bound is not tight. In other words, the reconstruction problem is solvable for some $\theta$ , even if $d\theta^{2}<1$ .

The proof to Theorem 1.1 above is given in Section 5. The proofs to Theorem 1.2 and Theorem 1.3 below are given in Section 6.3 and Section 6.4 respectively.

Theorem 1.2.

For every $\pi$ such that $\pi_{1}\pi_{2}<\frac{1}{6}$ , there exists an asymptotic result of the reconstruction threshold, that is, when $d$ goes to infinity,

[TABLE]

where $C_{\pi}$ is a constant taking values in $(0,1)$ and depends only on $\pi$ .

Theorem 1.3.

For every $\pi$ such that $\pi_{1}\pi_{2}>\frac{1}{6}$ , there exists a $D=D(\pi)>0$ , such that for $d>D$ the Kesten-Stigum bound is sharp, that is

[TABLE]

Furthermore, there is non-reconstruction at the Kesten-Stigum bound, when $\theta=\theta^{+}$ or $\theta^{-}$ .

The idea to establish Theorem 1.1, Theorem 1.2 and Theorem 1.3 is the following. One standard way to classify reconstruction and non-reconstruction is to analyze the quantity $x_{n}$ : the probability of giving a correct guess of the root given the spins $\sigma(n)$ at distance $n$ from the root, minus the probability of guessing the root according to stationary initial distribution. Non-reconstruction means that the mutual information between the root and the spins at distance $n$ goes to zero as $n$ tends to infinity. In Lemma 3, we rigorously show that $x_{n}$ is always positive and the non-reconstruction is equivalent to

[TABLE]

To analyze whether the reconstruction holds, inspired by Chayes et al. [1986], Borgs et al. [2006] and Sly [2011], we establish the distributional recursion and moment recursion, and then the recursive relation between the $n$ th and the $(n+1)$ th generation’s structure of the tree leads to a corresponding nonlinear dynamical system. In the mean time, we show that the interactions between spins become very weak, if they are sufficiently far away from each other. Therefore, under this weak interacting situation, i.e. $x_{n}$ being sufficiently small, the concentration analysis is successfully developed and an approximation to the dynamical system is nicely established:

[TABLE]

The sign of coefficient of the quadratic term which is determined by $1-6\pi_{1}\pi_{2}$ , plays a crucial role in the asymptotic behavior of $x_{n}$ . When $1-6\pi_{1}\pi_{2}>0$ , equivalently $\Delta^{2}>(1-\theta)^{2}/3$ , if $d\theta^{2}$ is sufficiently close to 1, then $x_{n}$ does not converge to [math] and then there is reconstruction beyond the Kesten-Stigum bound. Then our focus is to find this new reconstruction threshold, which is executed in the following three steps: Step one, we rigorously show that, when degree $d$ is large, the interactions between spins become very weak; Step two, using the Central Limit Theorem, we approximate the corresponding collection of small independent samples, to show that the reconstruction function can be asymptotically given by a new Gaussian approximation function $g(s)$ , that is, $x_{n+1}\approx g(d\theta^{2}x_{n})$ ; Step three, we explore the first several major terms of the Maclaurin series of $g(s)$ , and rigorously establish the reconstruction threshold by discussing the fixed point of $g(s)$ . On the other hand, when $1-6\pi_{1}\pi_{2}<0$ , the analysis of large degree asymptotics yields $g(s)<s$ , which implies $\lim_{n\to\infty}x_{n}=0$ , that is, there is non-reconstruction.

2 Preliminaries

2.1 Notations

Let $u_{1},\ldots,u_{d}$ be the children of $\rho$ and $\mathbb{T}_{v}$ be the subtree of descendants of $v\in\mathbb{T}$ . Denote the $n$ th level of the tree as $L(n)=\{v\in\mathbb{V}:d(\rho,v)=n\}$ , where $d(\cdot,\cdot)$ is the graph distance on $\mathbb{T}$ . With the notations above, let $\sigma(n)$ and $\sigma_{j}(n)$ be the spins on $L_{n}$ and $L(n)\cap\mathbb{T}_{u_{j}}$ respectively. For a configuration $A$ on $L(n)$ , define the posterior function $f_{n}$ by

[TABLE]

By the recursive nature of the tree, for a configuration $A$ on $L(n+1)\cap\mathbb{T}_{u_{j}}$ , an equivalent form is given by

[TABLE]

Next, with $i=1,2$ , define

[TABLE]

and for $1\leq j\leq d$ ,

[TABLE]

where it is clear that the random variables $\{Y_{j}\}_{1\leq j\leq d}$ are independent and identical in distribution. It is apparent that

[TABLE]

and

[TABLE]

We introduce the objective quantities in this paper:

[TABLE]

2.2 Preparations

Before proceeding to the analysis, it is convenient to firstly derive some very useful identities concerning $x_{n}$ .

Lemma 1.

For any $n\in\mathbb{N}\cup\{0\}$ , we have

[TABLE]

Proof.

By Bayes’ rule, we have

[TABLE]

and similarly,

[TABLE]

Then it follows from equation (2.2) that

[TABLE]

Next by equation (2.1), one has

[TABLE]

Therefore, the quantitative relation between $x_{n}$ and $z_{n}$ holds:

[TABLE]

∎

With the preceding results, we calculate the means and variances of $Y_{j}$ .

Lemma 2.

For each $1\leq j\leq d$ , we have

[TABLE]

Proof.

If $\sigma_{u_{j}}^{1}=1$ , $Y_{j}$ is distributed according to $X^{+}(n)$ , while if $\sigma_{u_{j}}^{1}=2$ , $Y_{j}$ is distributed according to $1-X^{-}(n)$ . Therefore we have

[TABLE]

and similarly we have

[TABLE]

as desired. ∎

2.3 An Equivalent Condition for Non-reconstruction

If the reconstruction problem is solvable, then $\sigma(n)$ contains significant information on the root variable, which may be formulated in several equivalent ways (see Mossel [2001], Proposition 14). As a result, in order to analyze the reconstruction, it suffices to investigate the asymptotic behavior of $x_{n}$ as $n$ goes to infinity.

Lemma 3.

The non-reconstruction is equivalent to

[TABLE]

Proof.

The maximum-likelihood algorithm, which is the optimal reconstruction algorithm of $\sigma_{\rho}$ given $\sigma(n)$ , is successful with probability

[TABLE]

Therefore, the inequality of $x_{n}+\pi_{1}\leq\Delta_{n}$ holds, which is an analogous result to that of Mézard and Montanari [2006], by noting that the algorithm that chooses $\sigma_{\rho}$ randomly according to probabilities $X_{i}$ is correct with probability $x_{n}+\pi_{1}$ . On the other hand, recalling the assumption that $\pi_{1}\geq\pi_{2}$ , by the Cauchy-Schwartz inequality together with the identities equation (2.1) and equation (2.3), one can conclude

[TABLE]

Hence, one has

[TABLE]

implying that $x_{n}$ converging to [math] is equivalent to $\Delta_{n}$ converging to $\pi_{1}$ , which is equivalent to non-reconstruction (see Mossel [2001]). ∎

3 Moment Recursion

3.1 Distributional Recursion

In the last section, it is known that the asymptotic behavior of $x_{n}$ as $n$ goes to infinity plays a crucial role, however it is still too difficult and not necessary to get the explicit expression for $x_{n}$ . In fact, we only need to investigate the recursive formula of $x_{n}$ , from which it is possible to illustrate the trend of $x_{n}$ as $n$ goes to infinity. Thus the key idea is to analyze the recursive relation between $X^{+}(n)$ and $X^{+}(n+1)$ by the structure of the tree. Suppose that $A$ is a configuration on $L(n+1)$ and let $A_{j}$ be its restriction on $\mathbb{T}_{u_{j}}\cap L(n+1)$ . Then from the Markov random field property, we have

[TABLE]

where

[TABLE]

and

[TABLE]

Next conditioning the root being $1$ and setting $A=\sigma^{1}(n+1)$ , we have

[TABLE]

where

[TABLE]

and

[TABLE]

3.2 Main Expansion of $x_{n+1}$

With the help of those preliminary results, we are about to figure out the recursive relation regarding $x_{n+1}$ , specifically, its main expansion, which would play a crucial rule in further discussions. Firstly we take care of the approximating means and variances of $Z_{i}$ and the Taylor series approximations.

Lemma 4.

For each positive integer $k$ , there exists a $C=C(\pi,k)$ which only depends on $\pi$ and $k$ , such that for each $0\leq\ell,m\leq k$ ,

[TABLE]

where

[TABLE]

Proof.

Since $\{Y_{j}\}_{1\leq j\leq d}$ are independent and identical in distribution, we have

[TABLE]

It follows from $0\leq Y_{1}\leq 1$ that $|Y_{1}(n)-\pi_{1}|\leq 1$ , and then when $i\geq 2$ , we have $|Y_{1}(n)-\pi_{1}|^{i}\leq(Y_{1}(n)-\pi_{1})^{2}$ . Therefore Lemma 2 implies that

[TABLE]

where $\{c_{i}\}_{i=1}^{l+m}$ and $c$ depend on $\pi$ and $k$ only. Consequently, we have $d|u|\leq cx_{n}$ by means of $d\theta^{2}\leq 1$ . Using the binomial expansion and the Remainder Theorem, we have

[TABLE]

Taking $h=0,1,2$ respectively and $\displaystyle C=\max_{h\in\{0,1,2\}}\left\{e^{c}c^{h+1}\right\}$ complete the proof. ∎

Next we aim to figure out the recursive relation of $x_{n+1}$ by virtue of the following identity

[TABLE]

Specifically, plugging $a=\pi_{1}Z_{1}$ , $r=\pi_{1}Z_{1}+\pi_{2}Z_{2}-1$ and $s=1$ in equation (3.2) yields

[TABLE]

In the following, we analyze terms in equation (3.3), using the notation $O_{\pi}$ to emphasize that the constant associated with the $O$ -term depends on $\pi$ only

[TABLE]

and

[TABLE]

Then the preceding results yield

[TABLE]

Similarly, we have

[TABLE]

and then

[TABLE]

As a consequence, we have

[TABLE]

where

[TABLE]

with $C_{R}$ a constant depending only on $\pi$ , and

[TABLE]

which will be handled in the following concentration investigation.

4 Concentration Analysis

Noting that $Z_{1},Z_{2}\geq 0$ , we have $0\leq\frac{\pi_{1}Z_{1}}{\pi_{1}Z_{1}+\pi_{2}Z_{2}}\leq 1$ . It is concluded from equations (3.3) and (3.4) that

[TABLE]

where $C=C(\pi)$ depends only on $\pi$ . In equation (4.1), the first inequality follows from Lemma 1 which states that $0\leq z_{n}\leq x_{n}$ , and the last inequality holds if $x_{n}<\delta$ for $\delta=\delta(\pi,\varepsilon)$ small enough. The following lemma ensures that $x_{n}$ does not drop too fast.

Lemma 5.

For any $\varrho>0$ , there exists a constant $\gamma=\gamma(\pi,\varrho)>0$ , such that for all $n$ when $|\theta|>\varrho$ ,

[TABLE]

Proof.

For a configuration $A$ on $\mathbb{T}_{u_{1}}\cap L(n+1)$ , we have

[TABLE]

and then

[TABLE]

Therefore, it follows from equation (2.4) that

[TABLE]

namely,

[TABLE]

Next choosing $\varepsilon=\varrho^{2}$ , equation (4.1) indicates that there exists a $\delta=\delta(\pi,\varepsilon)>0$ , such that if $x_{n}<\delta$ then

[TABLE]

On the other hand, if $x_{n}\geq\delta$ , equation (4.2) becomes $x_{n+1}\geq\varrho^{4}\delta x_{n}$ . Finally taking $\gamma=\min\{\varrho^{2},\varrho^{4}\delta\}$ completes the proof. ∎

Actually, it can be seen from equation (3.5) that the estimates of $R$ and $S$ would play a key role in the recursive expression of $x_{n+1}$ , hence we will verify that $\frac{\pi_{1}Z_{1}}{\pi_{1}+\pi_{2}Z_{2}}$ and $\frac{z_{n}}{x_{n}}$ are both sufficiently around $\pi_{1}$ , analogous to the concentration analysis result in Sly [2011]. In the following lemma, we firstly establish a technical uniqueness result where the set of vertices which can be conditioned is limited to a set of $k$ vertices.

Lemma 6.

For any $\varepsilon>0$ and positive integer $k$ , there exists $M=M(\pi,\varepsilon,k)$ , such that for any collection of vertices $v_{1},\ldots,v_{k}\in L(M)$ ,

[TABLE]

Proof.

Denote the entries of the transition matrix at distance $s$ as

[TABLE]

and it is natural that $M_{1,2}^{s}=1-U_{s}$ and $M_{2,1}^{s}=1-V_{s}$ . As a result, it follows that

[TABLE]

which yields a second order recursive formula

[TABLE]

with the initial conditions $U_{0}=1$ and $U_{1}=M_{11}=\pi_{1}+\pi_{2}\theta$ . Then the general solutions are given by

[TABLE]

Consequently, under the condition of $d\theta^{2}\leq 1$ , we have

[TABLE]

For fixed $\pi$ , $d$ and $k$ , define

[TABLE]

and let $N=N(\pi,\varepsilon,k)$ be a sufficiently large integer such that

[TABLE]

where the last inequality holds by the fact that $d^{-s/2}\leq 2^{-s/2}\to 0$ as $s\to\infty$ which implies $B(s)\to 1$ uniformly for all $d$ .

Now fix an integer $M$ such that $M>kN$ and choose any $v_{1},\ldots,v_{k}\in L(M)$ . For $0\leq\ell\leq M$ , define $n(\ell)$ as the number of vertices of distance $\ell$ from the root with a decedent in the set $\{v_{1},\ldots,v_{k}\}$ , that is

[TABLE]

Then according to the definition, it is trivial to see that $n(\ell)$ is an increasing integer valued function with respect to $\ell$ from $n_{0}=1$ to $n_{M}=k$ , which, by the pigeonhole principle, implies that there must exist some $\ell$ such that $n(\ell)=n(\ell+N)$ . Next, denote $\{w_{1},\ldots,w_{n(\ell)}\}$ and $\{\overline{w}_{1},\ldots,\overline{w}_{n(\ell)}\}$ as vertices in sets $\{v\in L(\ell+N):|\mathbb{T}_{v}\cap\{v_{1},\ldots,v_{k}\}|>0\}$ and $\{v\in L(\ell):|\mathbb{T}_{v}\cap\{v_{1},\ldots,v_{k}\}|>0\}$ respectively, such that $w_{j}$ is the descendent of $\overline{w}_{j}$ , and then

[TABLE]

By Bayes’ Rule and the Markov random field property, for any $i_{1},\ldots,i_{n(\ell)}\in\mathcal{C}$ , we have

[TABLE]

which implies that

[TABLE]

Hence, for the reason that $\sigma_{\rho}$ is conditionally independent of the collection $\sigma_{v_{1}},\ldots,\sigma_{v_{k}}$ given $\sigma_{w_{1}},\ldots,\sigma_{w_{n(\ell)}}$ , one has

[TABLE]

∎

Lemma 7.

Assume $|\theta|>\varrho$ for some $\varrho>0$ . Given arbitrary $\varepsilon,\alpha>0$ , there exist constants $C=C(\pi,\varepsilon,\alpha,\varrho)>0$ and $N=N(\pi,\varepsilon,\alpha)$ , such that whenever $n\geq N$ ,

[TABLE]

Proof.

Fix $k$ an integer with $k>\alpha$ . Choose $M$ to hold with bound $\varepsilon/2$ in Lemma 6. Let $v_{1},\ldots,v_{|L(M)|}$ denote the vertices in $L(M)$ and define

[TABLE]

where $\sigma_{v}^{1}(n+1)$ denotes the spins of vertices in $\mathbb{T}_{v}\cap L(n+1)$ . Then $W(v)$ would be distributed as

[TABLE]

The recursion formula in equation (3.1) together with the fact that $1-W(v)=f_{n+1-M}(2,\sigma_{v}^{1}(n+1))$ , yield a function

[TABLE]

where $W_{i}=W(v_{i})$ for $1\leq i\leq|L(M)|$ . There is no difficulty in finding that when all the entries $W_{i}$ are identically $\pi_{1}$ one has

[TABLE]

and $H$ is a continuous function of the vector $(W_{i})_{1\leq i\leq|L(M)|}$ . Therefore, by Lemma 6, if there are at most $k$ vertices in $L(M)$ such that $W(v)\neq\pi_{1}$ , then

[TABLE]

and there exists some $\delta=\delta(\pi,\varepsilon)>0$ such that if

[TABLE]

then

[TABLE]

Next, by the Chebyshev’s inequality together with equation (4.6), the following result holds:

[TABLE]

Random variables $|W(v)-\pi_{1}|$ for distinct $v$ are conditionally independent given $\sigma(M)$ , so there exist suitable constants $C(\pi,\varepsilon,\alpha,\varrho)$ and $N(\pi,\varepsilon,\alpha)$ , such that when $n\geq N$ , one has

[TABLE]

where $\mathbf{B}\left(\cdot,\cdot\right)$ denotes the binomial distribution and the last inequality holds due to Lemma 5. ∎

Now, we are able to bound $S$ and $R$ in equation (3.7) using the preceding concentration results.

Proposition 1.

Assume $|\theta|>\varrho$ for some $\varrho>0$ . For any $\varepsilon>0$ , there exist $N=N(\pi,\varepsilon)$ and $\delta=\delta(\pi,\varepsilon,\varrho)>0$ , such that if $n\geq N$ and $x_{n}\leq\delta$ then $|S|\leq\varepsilon x_{n}^{2}$ .

Proof.

For any $\eta>0$ , using the Cauchy-Schwartz inequality and by Lemma 7, one has

[TABLE]

Also, it follows from equation (3.4) and Lemma 4 respectively that

[TABLE]

and

[TABLE]

Taking $\alpha=6$ in Lemma 7, there exist $C_{3}=C_{3}(\pi,\eta,\varrho)$ and $N=N(\pi,\eta)$ , such that if $n\geq N$ then

[TABLE]

Finally taking $\eta=\varepsilon/(2C_{1})$ and $\delta=\varepsilon/(2C_{2}C_{3})$ , we have that if $n\geq N$ and $x_{n}\leq\delta$ then

[TABLE]

∎

Proposition 2.

Assume $|\theta|>\varrho$ for some $\varrho>0$ . For any $\varepsilon>0$ , there exist $N=N(\pi,\varepsilon)$ and $\delta=\delta(\pi,\varepsilon,\varrho)$ , such that if $n\geq N$ and $x_{n}\leq\delta$ then

[TABLE]

Proof.

Plugging $a=(Z_{1}-Z_{2})^{2}$ , $r=(\pi_{1}Z_{1}+\pi_{2}Z_{2})^{2}-1$ and $s=1$ in the identity equation (3.2), we have

[TABLE]

Next we will estimate these expectation terms one by one with the $O_{\pi}$ -constants depend only on $\pi$ :

[TABLE]

where we used the fact that $\pi_{1}^{2}\pi_{2}^{2}(Z_{1}-Z_{2})^{2}/(\pi_{1}Z_{1}+\pi_{2}Z_{2})^{2}\leq 1$ in the last inequality. Therefore, the recursion formula of $z_{n+1}$ can be written as

[TABLE]

In the rest of the proof, we let $\{C_{i}\}_{i=1,2,3,4}$ be constants depend only on $\pi$ . It follows from equation (4.1) that

[TABLE]

and in view of $d\theta^{2}\geq 1/2$ , there exists $\delta_{1}=\delta_{1}(\pi)>0$ , such that if $x_{n}\leq\delta_{1}$ then

[TABLE]

Consequently,

[TABLE]

For any $k\in\mathbb{N}$ , by equation (4.1), there exists a $\delta_{2}=\delta_{2}(\pi,k)$ , such that if $x_{n}\leq\delta_{2}$ then $x_{n+\ell}\leq 2\delta_{2}\leq\delta_{1}$ for any $1\leq\ell\leq k$ . Now iterating $k$ times the inequality (4.8) yields

[TABLE]

Therefore, noting that $|\theta|\leq d^{-1/2}\leq 2^{-1/2}$ , and taking $k=k(\varepsilon)$ large enough and $\delta_{3}=\delta_{3}(\pi,\varepsilon,k)=\delta_{3}(\pi,\varepsilon)$ sufficiently small, we obtain that if $x_{n}\leq\delta_{3}$ then

[TABLE]

where the first inequality relies on the fact that $|z_{n}/x_{n}-\pi_{1}|<1$ . At last, choosing $N=N(\pi,\varepsilon)>k$ and $\delta=\delta(\pi,\varepsilon,\varrho)=\gamma^{k}\delta_{3}$ , and noting that by Lemma 5 if $n\geq N$ and $x_{n}\leq\delta$ then $x_{n-k}\leq\gamma^{-k}x_{n}\leq\delta_{3}$ , the previous result in equation (4.9) completes the proof. ∎

5 Proof of Theorem 1.1

To accomplish the proof, it suffices to show that when $d\theta^{2}$ is close enough to $1$ , $x_{n}$ does not converge to [math]. For convenience, we suppose that $d\theta^{2}\geq 1/2$ . For any fixed $d$ and $\pi$ , there is $|\theta|\geq(2d)^{-1/2}$ , and we take $\varrho=(2d)^{-1/2}$ in Lemma 5 to generate $\gamma=\gamma(\pi,d)>0$ . When $1-6\pi_{1}\pi_{2}>0$ , by Proposition 1 and Proposition 2, there exist $N=N(\pi)$ and $\delta=\delta(\pi,d)>0$ , such that if $n\geq N$ and $x_{n}\leq\delta$ , then the remainders in equation (3.5) could be evaluated respectively as

[TABLE]

and

[TABLE]

As a consequence,

[TABLE]

Furthermore, in light of $x_{0}=1-\pi_{1}=\pi_{2}$ and Lemma 5, for all $n$ we have

[TABLE]

Define $\varepsilon=\varepsilon(\pi,d)=\min\{\pi_{2}\gamma^{N},\delta\gamma\}>0$ . Then equation (5.4) implies that $x_{n}\geq\varepsilon$ when $n\leq N$ . Next, by choosing suitable $|\theta|<d^{-1/2}$ , we achieve

[TABLE]

for the reason that $\varepsilon$ is independent of $\theta$ . Now, suppose $x_{n}\geq\varepsilon$ for some $n\geq N$ . If $x_{n}\geq\gamma^{-1}\varepsilon$ , then Lemma 5 gives $x_{n+1}\geq\gamma x_{n}\geq\varepsilon$ . If $\varepsilon\leq x_{n}\leq\gamma^{-1}\varepsilon\leq\delta$ , then by equation (5.3) and equation (5.5), we have

[TABLE]

Hence it can be shown by induction that $x_{n}\geq\varepsilon$ for all $n$ , namely, the Kesten-Stigum bound is not tight.

6 Large Degree Asymptotics

6.1 Gaussian Approximation

For $1\leq j\leq d$ , define

[TABLE]

Lemma 8.

There exist positive constants $C=C(\pi)$ and $D=D(\pi)$ , such that when $d>D$ ,

[TABLE]

Proof.

Starting with the Taylor series expansion of $\log(1+w)$ , there exists a constant $W>0$ , such that when $|w|<W$ ,

[TABLE]

Taking $D=D(\pi)$ sufficiently large, when $d>D$ , we have that $|\theta|\leq d^{-\frac{1}{2}}$ is small enough to guarantee equation (6.1) for $w=\theta(Y_{j}-\pi_{1})/\pi_{1}$ and then

[TABLE]

for some constant $C=C(\pi)$ , where the third inequality follows from $0\leq z_{n}\leq x_{n}\leq 1$ . The rest estimates follow similarly. ∎

Next, define a 2-dimensional vector $\mu=(\mu_{1},\mu_{2})$ with $\mu_{1}=\frac{1}{2\pi_{1}},\mu_{2}=-\frac{1+\pi_{2}}{2\pi_{2}^{2}}$ , and a $2\times 2$ -covariance matrix

[TABLE]

which is a positive semi-definite symmetric $2\times 2$ -matrix. Let $(G_{1},G_{2})$ possess the Gaussian distribution $\mathbf{N}(\mu,\Sigma)$ , then the following lemma can be established by the Central Limit Theorem, the Gaussian approximation and the Portmanteau Theorem.

Lemma 9.

Let $\psi:\mathbb{R}^{2}\mapsto\mathbb{R}$ be a differentiable bounded function. For any $\varepsilon>0$ , there exists $D=D(\pi,\psi,\varepsilon)>0$ , such that if $d>D$ then

[TABLE]

Next, define

[TABLE]

and then

[TABLE]

If $(W_{1},W_{2})$ has the Gaussian distribution $\mathbf{N}(0,\Sigma)$ , then $(s\mu_{1}+\sqrt{s}W_{1},s\mu_{2}+\sqrt{s}W_{2})$ is distributed according to $\mathbf{N}(s\mu,s\Sigma)$ . At last, define

[TABLE]

Therefore, Lemma 9 implies the following approximation to $x_{n+1}$ .

Lemma 10.

For arbitrary $\varepsilon>0$ , there exists a $D=D(\pi,\varepsilon)>0$ , such that whenever $d>D$ ,

[TABLE]

6.2 Asymptotic Estimation of the Reconstruction Threshold

In order to estimate $x_{n+1}$ , it suffices to investigate the properties of $g(s)$ on the interval $[0,\pi_{2}]$ , considering that $0\leq x_{n}\leq\pi_{2}$ and $d\theta^{2}\leq 1$ .

Lemma 11.

The function $g(s)$ is continuously differentiable and increasing on the interval $(0,\pi_{2}]$ .

Proof.

When $s>0$ , it is concluded that

[TABLE]

by the fact that $\left|\frac{\pi_{2}}{\pi_{1}}e^{t}\bigg{/}\left(1+\frac{\pi_{2}}{\pi_{1}}e^{t}\right)^{2}\right|\leq 1/4$ holds for any $t\in\mathbb{R}$ . Then we establish the differentiability with respect to $s$ .

Now, let $(W_{1}^{\prime},W_{2}^{\prime})$ be an independent copy of $(W_{1},W_{2})$ . Thus if $0\leq s^{\prime}\leq s$ , it is feasible to construct equivalent distributions such as

[TABLE]

In view of $(W_{1},W_{2})\sim\mathbf{N}(0,\Sigma)$ , it follows that $\mathbf{E}(W_{2}-W_{1})=0$ and

[TABLE]

which implies that $W_{2}-W_{1}$ and $W_{2}^{\prime}-W_{1}^{\prime}$ are both distributed as $\mathbf{N}(0,a)$ , with $a=1/\pi_{1}\pi_{2}^{2}$ .

Next, it is well known that if $W$ has the distribution $\mathbf{N}(\mu,\sigma^{2})$ , the expectation of the exponential random variable could be estimated as

[TABLE]

based on which, we are able to estimate the conditional expectation given $W_{1}$ and $W_{2}$ :

[TABLE]

Then applying Jensen’s inequality, and considering that the function $(1+x)^{-1}$ is convex and

[TABLE]

we have

[TABLE]

as desired. ∎

It is necessary to discuss the Taylor expansion of $g(s)$ in the small neighborhoods of $s=0$ .

Lemma 12.

For small $s>0$ , we have

[TABLE]

Proof.

Define $W=s(\mu_{2}-\mu_{1})+\sqrt{s}(W_{2}-W_{1})$ . By the results in Lemma 11, it is apparent that $W\sim\mathbf{N}\left(-as/2,as\right)$ . Therefore by equation (6.2) the following moments can be calculated:

[TABLE]

Next starting from the identity

[TABLE]

we obtain the power series of $g(s)$ as

[TABLE]

that is,

[TABLE]

∎

6.3 Proof of Theorem 1.2

In this section, we precisely rephrase Theorem 1.2 and give its rigorous proof.

Theorem 6.1.

When $\pi_{1}\pi_{2}<\frac{1}{6}$ , define

[TABLE]

Then $0<\omega^{*}<1$ , and for any $\delta>0$ there exists a $D=D(\pi,\delta)$ , such that if $d>D$ then the model has reconstruction when $d\theta^{2}\geq\omega^{*}+\delta$ , but does not have reconstruction when $d\theta^{2}\leq\omega^{*}-\delta$ . In other words,

[TABLE]

Proof.

It follows from Lemma 12 that when $1-6\pi_{1}\pi_{2}>0$ , there exists $0<\bar{s}<\pi_{2}$ such that $g(\bar{s})>\bar{s}$ . Moreover, noting that $g(0\cdot\bar{s})=g(0)=0<\bar{s}$ , the Intermediate Value Theorem implies the existence of $0<\bar{\omega}<1$ such that $g(\bar{\omega}\bar{s})=\bar{s}$ . Consequently, $\omega^{*}$ does exist and $0\leq\omega^{*}\leq\bar{\omega}<1$ . Furthermore, for any $\omega^{*}<\omega<1$ , it follows from Lemma 11 that the set $\left\{0<s<\pi_{2}:g(\omega s)\geq s\right\}$ is a non-empty compact set bounded away from [math]. Then it is further established by the continuity of $g(s)$ that the set

[TABLE]

is non-empty and compact. Hence it implies immediately that $0<\omega^{*}<1$ .

Next, taking $s^{*}\in\{0<s<\pi_{2}:g(\omega^{*}s)=s\}$ and considering $d\theta^{2}=\omega^{*}+\delta$ , one has

[TABLE]

Define $\varepsilon=\varepsilon(\pi,\delta)=s^{*}-s^{*}\omega^{*}/(\omega^{*}+\delta)>0$ . By Lemma 10 there exists a $D=D(\pi,\varepsilon)=D(\pi,\delta)$ , such that if $d>D$ and $x_{n}\geq s^{*}\omega^{*}/(\omega^{*}+\delta)$ then

[TABLE]

where the second inequality follows from Lemma 11. Consequently, it is shown by induction, and noting the initial value $x_{0}=\pi_{2}>s^{*}$ , that $x_{n}\geq s^{*}\omega^{*}/(\omega^{*}+\delta)$ for all $n$ , which further establishes reconstruction. At last, Proposition 12 in Mossel [2001] implies that the reconstruction is solvable for any $d\theta^{2}\geq\omega^{*}+\delta$ .

On the other hand, when $d\theta^{2}=\omega^{*}-\delta$ , we have $g(d\theta^{2}s)\leq d\theta^{2}s/\omega^{*}$ . Taking $\eta=\left(1-\omega^{*}\right)/2>0$ in equation (4.1), there exists a constant $\zeta=\zeta(\pi)$ , such that if $x_{n}<\zeta$ then

[TABLE]

where the fact that $\left(1+\omega^{*}\right)/2<1$ implies that $\lim_{n\to\infty}x_{n}=0$ and then there is non-reconstruction. So, it suffices to find some $m$ , such that $x_{m}<\zeta$ , which could be accomplished by choosing

[TABLE]

in Lemma 10. Then, there exists a sufficiently large $D=D(\pi,\varepsilon)=D(\pi,\delta)$ , such that if $d>D$ and $x_{n}\geq\zeta$ then

[TABLE]

Then the fact that $\left(1+d\theta^{2}/\omega^{*}\right)/2<1$ guarantees the existence of $m$ satisfying $x_{m}<\zeta$ , as desired. Finally using Proposition 12 in Mossel [2001] again, one can conclude non-reconstruction for any $d\theta^{2}\leq\omega^{*}-\delta$ . ∎

6.4 Proof of Theorem 1.3

When $1-6\pi_{1}\pi_{2}<0$ , the proof of Theorem 1.3 would resemble Theorem 1.1 in establishing a similar recursive inequality as equation (5.3), under the condition that $x_{n}\leq\delta$ and $n\geq N$ for suitable $\delta=\delta(\pi,d)$ and $N=N(\pi)$ . However, there still exists a crucial discrepancy between these two proofs, that is, Theorem 1.3 relies heavily on large $d$ . Before we proceed, let us firstly give the following lemma:

Lemma 13.

For any $0<\varepsilon<1$ and $\alpha>1$ , there exist $C=C(\pi,\varepsilon,\alpha)$ and $D=D(\pi,\varepsilon,\alpha)$ such that if $d>D$ then

[TABLE]

Furthermore, there exist $D=D(\pi,\varepsilon)$ and $\delta=\delta(\pi,\varepsilon)$ such that if $d>D$ and $x_{n}\leq\delta$ then

[TABLE]

Proof. For any $1\leq j\leq d$ , define

[TABLE]

From equation (6.1) and $|\theta|\leq d^{-\frac{1}{2}}$ , one can find a suitable $D=D(\pi,M)>0$ such that when $d\geq D$ we have $\theta<\pi_{1}$ , $U_{j}\geq w_{j}$ and $|w_{j}-\mathbf{E}w_{j}|\leq 2M$ . It is concluded from Lemma 2 that $|d\mathbf{E}w_{j}|\leq C_{1}x_{n}$ and

[TABLE]

where $C_{1}$ and $C_{2}$ denote the constants depending only on $\pi$ . In the following context, it is convenient to presume

[TABLE]

for the reason that equation (6.3) would be trivial otherwise. Therefore, it follows from equation (6.6) and the Bennet’s inequality that

[TABLE]

where $C_{3}$ depends only on $\pi$ , $\varepsilon$ , $\alpha$ . Similarly one can show that $\mathbf{P}(Z_{1}\geq 1+\varepsilon)<C_{4}x_{n}^{\alpha}$ and then

[TABLE]

for some $C_{4}=C_{4}(\pi,\varepsilon,\alpha)$ . Similarly one can also show that $\mathbf{P}(|Z_{2}-1|>\varepsilon)\leq C_{5}x_{n}^{\alpha}$ . On the other hand, there exists $\eta=\eta(\pi,\varepsilon)>0$ such that if $|Z_{i}-1|\leq\eta$ for $i=1,2$ then

[TABLE]

Finally, we have

[TABLE]

where $C=C(\pi,\varepsilon,\alpha)$ . Then we can achieve equation (6.4) by modifying the proof of Proposition 1. ∎

Lemma 14.

When $1-6\pi_{1}\pi_{2}<0$ , for any $0<s\leq\pi_{2}$ we have

[TABLE]

Proof of Theorem 1.3. Similar to the proof of Theorem 1.1, we will analyze $R$ and $S$ in equation (3.5) respectively, under the condition that $1-6\pi_{1}\pi_{2}<0$ . Taking $D_{1}=C_{R}^{2}(6\pi_{1}\pi_{2}^{2})^{2}/(6\pi_{1}\pi_{2}-1)^{2}$ , if $d>D_{1}$ which implies $|\theta|\leq d^{-1/2}\leq D_{1}^{-1/2}$ , by equation (3.6) and the inequality that $\left|z_{n}/x_{n}-\pi_{1}\right|\leq 1$ , we obtain

[TABLE]

Moreover, according to Lemma 13, there exist $D_{2}=D_{2}(\pi)>D_{1}$ and $\delta=\delta(\pi)>0$ independent of $d$ , such that if $d>D_{2}$ and $x_{n}<\delta$ then an analogue of equation (5.2) holds as

[TABLE]

and then by equation (6.7) we have

[TABLE]

Next we claim that there is a positive integer $m$ such that $x_{m}<\delta$ . Define $\varepsilon=\varepsilon(\pi,\delta)=\varepsilon(\pi)=\frac{1}{2}\min_{s\geq\delta}(s-g(s))$ . Since the function $s-g(s)$ is continuous and positive on $[\delta,\pi_{2}]$ by Lemma 14, we have $\varepsilon>0$ . Then by Lemma 10, there exists a $D=D(\pi,\varepsilon)=D(\pi)>D_{2}>0$ , such that when $d>D$ ,

[TABLE]

thus if $x_{n}\geq\delta$ then

[TABLE]

where the second inequality follows from Lemma 11 which claims that $g(s)$ is increasing on $[0,\pi_{2}]$ . Therefore, there must exist an $m\in\mathbb{Z}^{+}$ , such that $x_{m}<\delta$ , as desired.

When $d>D$ , it can be shown by induction, equation (6.8) and equation (6.9) that when $n\geq m$ ,

[TABLE]

Therefore, the limit $L$ defined as $L=\lim_{n\to\infty}x_{n}\geq 0$ does exist, since the sequence $\{x_{n}\}_{n\geq m}$ is bounded and decreasing. Thus, passing to the limit on both sides of equation (6.10) gives

[TABLE]

which implies $L=0$ , hence non-reconstruction.

∎

Acknowledgement

We give special thanks to the journal editor and two anonymous reviewers who provided us with many constructive and helpful comments. We also give special thanks to Sebastien Roch for his inspiring discussions and reading of some proofs in the first version of this paper. We truly appreciate the warm encouragements on finishing this complete version and helpful discussions on this topic as well as future extensions, from colleagues at the 2017 & 2018 Columbia-Princeton Probability Day, 2017 Northeast Probability Seminar, 2018 Frontier Probability Days, 2017 & 2018 Finger Lakes Probability Seminar, and 2017 & 2018 Seminar on Stochastic Processes.

Bibliography23

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Berger et al. [2005] Noam Berger, Claire Kenyon, Elchanan Mossel, and Yuval Peres. Glauber dynamics on trees and hyperbolic graphs. Probability Theory and Related Fields , 131(3):311–340, 2005.
2Bernussou and Abatut [1977] Jacques Bernussou and Jean-Louis Abatut. Point mapping stability . Pergamon, 1977.
3Bhamidi et al. [2010] Shankar Bhamidi, Ram Rajagopal, and Sébastien Roch. Network delay inference from additive metrics. Random Structures & Algorithms , 37(2):176–203, 2010.
4Bleher et al. [1995] Pavel M Bleher, Jean Ruiz, and Valentin A Zagrebnov. On the purity of the limiting Gibbs state for the Ising model on the bethe lattice. Journal of Statistical Physics , 79(1-2):473–482, 1995.
5Borgs et al. [2006] Christian Borgs, Jennifer Chayes, Elchanan Mossel, and Sébastien Roch. The Kesten-Stigum reconstruction bound is tight for roughly symmetric binary channels. In Foundations of Computer Science, 2006. FOCS’06. 47th Annual IEEE Symposium on , pages 518–530. IEEE, 2006.
6Chayes et al. [1986] JT Chayes, L Chayes, James P Sethna, and DJ Thouless. A mean field spin glass with short-range interactions. Communications in Mathematical Physics , 106(1):41–89, 1986.
7Daskalakis et al. [2006] Constantinos Daskalakis, Elchanan Mossel, and Sébastien Roch. Optimal phylogenetic reconstruction. In Proceedings of the thirty-eighth annual ACM symposium on Theory of computing , pages 159–168. ACM, 2006.
8Evans et al. [2000] William Evans, Claire Kenyon, Yuval Peres, and Leonard J Schulman. Broadcasting on trees and the Ising model. Annals of Applied Probability , pages 410–433, 2000.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Large Degree Asymptotics and the Reconstruction Threshold of the Asymmetric Binary Channels

Abstract

Keywords:

MSC:

1 Introduction

1.1 Broadcasting Process and the Reconstruction Problem

Definition 1.

1.2 Background and Applications

1.3 Main Results and Proof Sketch

Theorem 1.1.

Theorem 1.2.

Theorem 1.3.

2 Preliminaries

2.1 Notations

2.2 Preparations

Lemma 1.

Proof.

Lemma 2.

Proof.

2.3 An Equivalent Condition for Non-reconstruction

Lemma 3.

Proof.

3 Moment Recursion

3.1 Distributional Recursion

3.2 Main Expansion of xn+1x_{n+1}xn+1​

Lemma 4.

Proof.

4 Concentration Analysis

Lemma 5.

Proof.

Lemma 6.

Proof.

Lemma 7.

Proof.

Proposition 1.

Proof.

Proposition 2.

Proof.

5 Proof of Theorem 1.1

6 Large Degree Asymptotics

6.1 Gaussian Approximation

Lemma 8.

Proof.

Lemma 9.

Lemma 10.

6.2 Asymptotic Estimation of the Reconstruction Threshold

Lemma 11.

Proof.

Lemma 12.

Proof.

6.3 Proof of Theorem 1.2

Theorem 6.1.

Proof.

6.4 Proof of Theorem 1.3

Lemma 13.

Lemma 14.

Acknowledgement

3.2 Main Expansion of $x_{n+1}$