Large Degree Asymptotics and the Reconstruction Threshold of the Asymmetric Binary Channels
Wenjian Liu, Ning Ning

TL;DR
This paper investigates the reconstruction problem on noisy tree networks with asymmetric binary channels, providing precise thresholds and asymptotic behavior for large degrees through refined analytical methods.
Contribution
It extends previous work by rigorously determining the conditions under which the reconstruction threshold is tight for asymmetric binary channels on large-degree trees.
Findings
Established the exact reconstruction threshold for asymmetric binary channels.
Derived asymptotic behavior of the threshold as the degree grows large.
Provided refined analysis techniques for moment recursion and concentration.
Abstract
In this paper, we consider a broadcasting process in which information is propagated from a given root node on a noisy tree network, and answer the question that whether the symbols at the nth level of the tree contain non-vanishing information of the root as n goes to infinity. Although the reconstruction problem on the tree has been studied in numerous contexts including information theory, mathematical genetics and statistical physics, the existing literatures with rigorous reconstruction thresholds established are very limited. In the remarkable work of Borgs, Chayes, Mossel and Roch (The Kesten-Stigum reconstruction bound is tight for roughly symmetric binary channels), the exact threshold for the reconstruction problem for a binary asymmetric channel on the d-ary tree is establish, provided that the asymmetry is sufficiently small, which is the first exact reconstruction threshold…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
∎
11institutetext: Wenjian Liu 22institutetext: Dept.of Mathematics and Computer Science, Queensborough Community College, City University of New York
Research supported by CUNY Community College Research Grant #1541
22email: [email protected] 33institutetext: Ning Ning (Corresponding Author)44institutetext: Dept. of Applied Mathematics, University of Washington, Seattle
44email: [email protected]
Large Degree Asymptotics and the Reconstruction Threshold of the Asymmetric Binary Channels
Wenjian Liu
Ning Ning
Abstract
In this paper, we consider a broadcasting process in which information is propagated from a given root node on a noisy tree network, and answer the question that whether the symbols at the th level of the tree contain non-vanishing information of the root as goes to infinity. Although the reconstruction problem on the tree has been studied in numerous contexts including information theory, mathematical genetics and statistical physics, the existing literatures with rigorous reconstruction thresholds established are very limited. In the remarkable work of Borgs, Chayes, Mossel and Roch (The Kesten-Stigum reconstruction bound is tight for roughly symmetric binary channels. FOCS, IEEE Comput. Soc. (2006): 518–530. Berkeley, CA.), the exact threshold for the reconstruction problem for a binary asymmetric channel on the -ary tree is establish, provided that the asymmetry is sufficiently small, which is the first exact reconstruction threshold obtained in roughly a decade. In this paper, by means of refined analyses of moment recursion on a weighted version of the magnetization, and concentration investigations, we rigorously give a complete answer to the question of how small it needs to be to establish the tightness of the reconstruction threshold and further determine its asymptotics of large degrees.
Keywords:
Kesten-Stigum reconstruction bound Markov random fields on trees Distributional recursion Nonlinear dynamical system
MSC:
60K35 82B26 82B20
1 Introduction
1.1 Broadcasting Process and the Reconstruction Problem
We consider the following broadcasting process that can be considered as a communication tree network, as a model for propagation of a genetic property or as a tree-indexed Markov chain. In this paper, we restrict our attention to the regular -ary trees, which is an infinite rooted tree where every vertex has exactly offspring, denoted by with nodes edges and root . A configuration on is an element of with being a finite characters set, that is, an assignment of a state in to each vertex. The state of the root , denoted by , is chosen according to some initial distribution on . This symbol is then propagated in the tree according to the probability transition matrix , which functions as the noisy communication channel on each edge. That is, for each vertex having as its parent, the spin at is defined according to the probabilities
[TABLE]
The objective model taken into account is the asymmetric binary channel with the configuration set , whose transition matrix is of the form
[TABLE]
where and is used to describe the deviation of from the symmetric channel. It is easy to see that has two eigenvalues, 1 and . Then we pick a state at the root according to the stationary distribution of , which is given by
[TABLE]
and without loss of generality, it is convenient to assume that .
Recall that the classical Ising model, a mathematical model of ferromagnetism in statistical mechanics, consists of discrete variables that represent magnetic dipole moments of atomic spins that can be in one of two states ( or ). Consider a set of lattice sites , each with a set of adjacent sites (e.g. a graph) forming a lattice, and for each , there is a discrete variable representing the site’s spin. The energy of a configuration is given by the Hamiltonian function
[TABLE]
where the notation indicates that sites and are the nearest neighbors, denotes the interaction between two adjacent sites and models the external magnetic field interaction of site . In this literature, the current model corresponds to the general Ising model with external field on the tree.
The problem of reconstruction is to analyze whether there exists non-vanishing information on the letter transmitted from the root, given all the symbols received at the vertices of the th generation, as goes to infinity. We define the distance between probability measures in line with Evans et al. [2000]. Let and be two probability measures on the same space. Set and where . Inferring the root spin from the spin configurations on the finite vertex set is a basic problem of Bayesian hypothesis testing. The total variation distance, defined as , can be interpreted as the difference between the probabilities of correct and erroneous inferences. Denote as the spins at distance from the root and as conditioned on . Then the reconstruction problem can be mathematical formulated as the following:
Definition 1.
The reconstruction problem for the infinite tree is solvable if for some ,
[TABLE]
When the is [math], we will say that the model has non-reconstruction on .
1.2 Background and Applications
The reconstruction problem arises naturally in statistical physics, where the reconstruction threshold is identified as the threshold for extremality of the infinite-volume Gibbs measure with free boundary conditions (see Georgii [2011]). In Berger et al. [2005], Martinelli et al. [2007], Tetali et al. [2012], the reconstruction bound is found to have a crucial determination effect on the efficiency of the Glauber dynamics on trees and random graphs. The reconstruction threshold is also believed to play an important role in a variety of other contexts, including phylogenetic reconstruction in evolutionary biology (Mossel [2004a], Daskalakis et al. [2006], Roch [2006]), communication theory in the study of noisy computation (Evans et al. [2000]), clustering problem in the setting of the stochastic block model (Mossel et al. [2012, 2013], Neeman and Netrapalli [2014]), and network tomography (Bhamidi et al. [2010]). For detailed explanation on the reconstruction problem in mixing, phylogeny and replicas, we refer to Section 1.3 in Bernussou and Abatut [1977]. For other applications of reconstruction, we refer to Section 1.4 in Sly [2011] and Section 1.3 in Liu et al. [2018], as well as the references therein.
In this paper, we focus on analyzing the tightness of the reconstruction bound on the tree for asymmetric binary channels, which corresponds to the asymmetric Ising model on the tree in statistical physics term. Well known, the reconstruction problem is closely related to , the second largest eigenvalue in absolute value of the transition probability matrix, which is in the current model under investigation. Kesten and Stigum [1966, 1967] showed that the reconstruction problem is solvable if , which is known as the Kesten-Stigum bound. However in the case of larger noise, i.e. , one may wonder whether reconstruction problem is still solvable, that is collecting and analyzing the whole set of symbols received at the th generation to retrieve information transmitted from the root.
First consider the symmetric channel. It was shown in Bleher et al. [1995] that the reconstruction problem is solvable if and only if in the binary model. For all other models, it was also known and easy to prove that implies solvability, while proving non-reconstructibility turned out to be harder. Although coupling arguments easily yield non-reconstruction, these arguments are typically not rigorous. A natural approach to establish non-reconstructibility is to analyze recursions in terms of random variables, each of whose values is the expectation of the chain at a vertex, given the state at the leaves of the subtree below it, and the corresponding probabilities. Although the reconstruction problem on the tree has been studied in numerous contexts, the existing literatures with rigorous reconstruction thresholds established are very limited. Sly [2011] proved the first exact reconstruction threshold in a nonbinary model by establishing the Kesten–Stigum bound for the -state Potts model on regular trees of large degree, and further established that the Kesten–Stigum bound is not tight for the -state Potts model when , which confirms much of the picture conjectured earlier by Mézard and Montanari [2006]. Liu et al. [2018] considered a -state symmetric model, with two categories of states in each category, and 3 transition probabilities (the probability to remain in the same state, the probability to change states but remain in the same category, and the probability to change categories) and showed that the Kesten-Stigum reconstruction bound is not tight when .
Next let us turn to the existing results regarding the asymmetric channel. Mossel [2001, 2004b] showed that the Kesten-Stigum bound is not the bound for reconstruction in the binary asymmetric model with sufficiently large asymmetry or in the symmetric Potts model with sufficiently many characters, which shed the light on exploring the tightness of the Kesten-Stigum bound. Furthermore, Proposition 12 in Mossel [2001] implies that for any asymmetric channel, given and , the reconstructibility is monotone in , say, there exist the thresholds such that, there is non-reconstruction when , while it is reconstructible when or . Therefore, the Kesten-Stigum bound mentioned above implies immediately
[TABLE]
but exact thresholds for non-solvability had not been known. The breakthrough result in Borgs et al. [2006] established the exact threshold for the reconstruction problem with the binary asymmetric channel on the -ary tree, provided that the asymmetry is sufficiently small, which is the first exact reconstruction threshold obtained in roughly a decade. However, this beautiful result only rigorously proved the existence of to satisfy the reconstruction criterion, does not answer the question that how small the asymmetry needs to be, therefore rigorously estimating the range of asymmetry to keep Kesten-Stigum bound tight is a natural question, which will be answered in the next section.
1.3 Main Results and Proof Sketch
In this section, we will present a critical condition of the stationary initial distribution to keep the tightness of the Kesten-Stigum bound, by means of refined recursive equations of vector-valued distributions and concentration analyses. Furthermore, when the Kesten-Stigum bound is not tight, we provide a new reconstruction threshold for sufficiently large . Since always implies reconstruction, it suffices to consider in the following context.
Theorem 1.1.
For every and such that , the Kesten-Stigum bound is not tight. In other words, the reconstruction problem is solvable for some , even if .
The proof to Theorem 1.1 above is given in Section 5. The proofs to Theorem 1.2 and Theorem 1.3 below are given in Section 6.3 and Section 6.4 respectively.
Theorem 1.2.
For every such that , there exists an asymptotic result of the reconstruction threshold, that is, when goes to infinity,
[TABLE]
where is a constant taking values in and depends only on .
Theorem 1.3.
For every such that , there exists a , such that for the Kesten-Stigum bound is sharp, that is
[TABLE]
Furthermore, there is non-reconstruction at the Kesten-Stigum bound, when or .
The idea to establish Theorem 1.1, Theorem 1.2 and Theorem 1.3 is the following. One standard way to classify reconstruction and non-reconstruction is to analyze the quantity : the probability of giving a correct guess of the root given the spins at distance from the root, minus the probability of guessing the root according to stationary initial distribution. Non-reconstruction means that the mutual information between the root and the spins at distance goes to zero as tends to infinity. In Lemma 3, we rigorously show that is always positive and the non-reconstruction is equivalent to
[TABLE]
To analyze whether the reconstruction holds, inspired by Chayes et al. [1986], Borgs et al. [2006] and Sly [2011], we establish the distributional recursion and moment recursion, and then the recursive relation between the th and the th generation’s structure of the tree leads to a corresponding nonlinear dynamical system. In the mean time, we show that the interactions between spins become very weak, if they are sufficiently far away from each other. Therefore, under this weak interacting situation, i.e. being sufficiently small, the concentration analysis is successfully developed and an approximation to the dynamical system is nicely established:
[TABLE]
The sign of coefficient of the quadratic term which is determined by , plays a crucial role in the asymptotic behavior of . When , equivalently , if is sufficiently close to 1, then does not converge to [math] and then there is reconstruction beyond the Kesten-Stigum bound. Then our focus is to find this new reconstruction threshold, which is executed in the following three steps: Step one, we rigorously show that, when degree is large, the interactions between spins become very weak; Step two, using the Central Limit Theorem, we approximate the corresponding collection of small independent samples, to show that the reconstruction function can be asymptotically given by a new Gaussian approximation function , that is, ; Step three, we explore the first several major terms of the Maclaurin series of , and rigorously establish the reconstruction threshold by discussing the fixed point of . On the other hand, when , the analysis of large degree asymptotics yields , which implies , that is, there is non-reconstruction.
2 Preliminaries
2.1 Notations
Let be the children of and be the subtree of descendants of . Denote the th level of the tree as , where is the graph distance on . With the notations above, let and be the spins on and respectively. For a configuration on , define the posterior function by
[TABLE]
By the recursive nature of the tree, for a configuration on , an equivalent form is given by
[TABLE]
Next, with , define
[TABLE]
and for ,
[TABLE]
where it is clear that the random variables are independent and identical in distribution. It is apparent that
[TABLE]
and
[TABLE]
We introduce the objective quantities in this paper:
[TABLE]
2.2 Preparations
Before proceeding to the analysis, it is convenient to firstly derive some very useful identities concerning .
Lemma 1.
For any , we have
[TABLE]
Proof.
By Bayes’ rule, we have
[TABLE]
and similarly,
[TABLE]
Then it follows from equation (2.2) that
[TABLE]
Next by equation (2.1), one has
[TABLE]
Therefore, the quantitative relation between and holds:
[TABLE]
∎
With the preceding results, we calculate the means and variances of .
Lemma 2.
For each , we have
[TABLE]
Proof.
If , is distributed according to , while if , is distributed according to . Therefore we have
[TABLE]
and similarly we have
[TABLE]
as desired. ∎
2.3 An Equivalent Condition for Non-reconstruction
If the reconstruction problem is solvable, then contains significant information on the root variable, which may be formulated in several equivalent ways (see Mossel [2001], Proposition 14). As a result, in order to analyze the reconstruction, it suffices to investigate the asymptotic behavior of as goes to infinity.
Lemma 3.
The non-reconstruction is equivalent to
[TABLE]
Proof.
The maximum-likelihood algorithm, which is the optimal reconstruction algorithm of given , is successful with probability
[TABLE]
Therefore, the inequality of holds, which is an analogous result to that of Mézard and Montanari [2006], by noting that the algorithm that chooses randomly according to probabilities is correct with probability . On the other hand, recalling the assumption that , by the Cauchy-Schwartz inequality together with the identities equation (2.1) and equation (2.3), one can conclude
[TABLE]
Hence, one has
[TABLE]
implying that converging to [math] is equivalent to converging to , which is equivalent to non-reconstruction (see Mossel [2001]). ∎
3 Moment Recursion
3.1 Distributional Recursion
In the last section, it is known that the asymptotic behavior of as goes to infinity plays a crucial role, however it is still too difficult and not necessary to get the explicit expression for . In fact, we only need to investigate the recursive formula of , from which it is possible to illustrate the trend of as goes to infinity. Thus the key idea is to analyze the recursive relation between and by the structure of the tree. Suppose that is a configuration on and let be its restriction on . Then from the Markov random field property, we have
[TABLE]
where
[TABLE]
and
[TABLE]
Next conditioning the root being and setting , we have
[TABLE]
where
[TABLE]
and
[TABLE]
3.2 Main Expansion of
With the help of those preliminary results, we are about to figure out the recursive relation regarding , specifically, its main expansion, which would play a crucial rule in further discussions. Firstly we take care of the approximating means and variances of and the Taylor series approximations.
Lemma 4.
For each positive integer , there exists a which only depends on and , such that for each ,
[TABLE]
where
[TABLE]
Proof.
Since are independent and identical in distribution, we have
[TABLE]
It follows from that , and then when , we have . Therefore Lemma 2 implies that
[TABLE]
where and depend on and only. Consequently, we have by means of . Using the binomial expansion and the Remainder Theorem, we have
[TABLE]
Taking respectively and complete the proof. ∎
Next we aim to figure out the recursive relation of by virtue of the following identity
[TABLE]
Specifically, plugging , and in equation (3.2) yields
[TABLE]
In the following, we analyze terms in equation (3.3), using the notation to emphasize that the constant associated with the -term depends on only
[TABLE]
[TABLE]
and
[TABLE]
Then the preceding results yield
[TABLE]
Similarly, we have
[TABLE]
[TABLE]
and then
[TABLE]
As a consequence, we have
[TABLE]
where
[TABLE]
with a constant depending only on , and
[TABLE]
which will be handled in the following concentration investigation.
4 Concentration Analysis
Noting that , we have . It is concluded from equations (3.3) and (3.4) that
[TABLE]
where depends only on . In equation (4.1), the first inequality follows from Lemma 1 which states that , and the last inequality holds if for small enough. The following lemma ensures that does not drop too fast.
Lemma 5.
For any , there exists a constant , such that for all when ,
[TABLE]
Proof.
For a configuration on , we have
[TABLE]
and then
[TABLE]
Therefore, it follows from equation (2.4) that
[TABLE]
namely,
[TABLE]
Next choosing , equation (4.1) indicates that there exists a , such that if then
[TABLE]
On the other hand, if , equation (4.2) becomes . Finally taking completes the proof. ∎
Actually, it can be seen from equation (3.5) that the estimates of and would play a key role in the recursive expression of , hence we will verify that and are both sufficiently around , analogous to the concentration analysis result in Sly [2011]. In the following lemma, we firstly establish a technical uniqueness result where the set of vertices which can be conditioned is limited to a set of vertices.
Lemma 6.
For any and positive integer , there exists , such that for any collection of vertices ,
[TABLE]
Proof.
Denote the entries of the transition matrix at distance as
[TABLE]
and it is natural that and . As a result, it follows that
[TABLE]
which yields a second order recursive formula
[TABLE]
with the initial conditions and . Then the general solutions are given by
[TABLE]
Consequently, under the condition of , we have
[TABLE]
[TABLE]
[TABLE]
[TABLE]
For fixed , and , define
[TABLE]
and let be a sufficiently large integer such that
[TABLE]
where the last inequality holds by the fact that as which implies uniformly for all .
Now fix an integer such that and choose any . For , define as the number of vertices of distance from the root with a decedent in the set , that is
[TABLE]
Then according to the definition, it is trivial to see that is an increasing integer valued function with respect to from to , which, by the pigeonhole principle, implies that there must exist some such that . Next, denote and as vertices in sets and respectively, such that is the descendent of , and then
[TABLE]
By Bayes’ Rule and the Markov random field property, for any , we have
[TABLE]
which implies that
[TABLE]
Hence, for the reason that is conditionally independent of the collection given , one has
[TABLE]
∎
Lemma 7.
Assume for some . Given arbitrary , there exist constants and , such that whenever ,
[TABLE]
Proof.
Fix an integer with . Choose to hold with bound in Lemma 6. Let denote the vertices in and define
[TABLE]
where denotes the spins of vertices in . Then would be distributed as
[TABLE]
The recursion formula in equation (3.1) together with the fact that , yield a function
[TABLE]
where for . There is no difficulty in finding that when all the entries are identically one has
[TABLE]
and is a continuous function of the vector . Therefore, by Lemma 6, if there are at most vertices in such that , then
[TABLE]
and there exists some such that if
[TABLE]
then
[TABLE]
Next, by the Chebyshev’s inequality together with equation (4.6), the following result holds:
[TABLE]
Random variables for distinct are conditionally independent given , so there exist suitable constants and , such that when , one has
[TABLE]
where denotes the binomial distribution and the last inequality holds due to Lemma 5. ∎
Now, we are able to bound and in equation (3.7) using the preceding concentration results.
Proposition 1.
Assume for some . For any , there exist and , such that if and then .
Proof.
For any , using the Cauchy-Schwartz inequality and by Lemma 7, one has
[TABLE]
Also, it follows from equation (3.4) and Lemma 4 respectively that
[TABLE]
and
[TABLE]
Taking in Lemma 7, there exist and , such that if then
[TABLE]
Finally taking and , we have that if and then
[TABLE]
∎
Proposition 2.
Assume for some . For any , there exist and , such that if and then
[TABLE]
Proof.
Plugging , and in the identity equation (3.2), we have
[TABLE]
Next we will estimate these expectation terms one by one with the -constants depend only on :
[TABLE]
[TABLE]
where we used the fact that in the last inequality. Therefore, the recursion formula of can be written as
[TABLE]
In the rest of the proof, we let be constants depend only on . It follows from equation (4.1) that
[TABLE]
and in view of , there exists , such that if then
[TABLE]
Consequently,
[TABLE]
For any , by equation (4.1), there exists a , such that if then for any . Now iterating times the inequality (4.8) yields
[TABLE]
Therefore, noting that , and taking large enough and sufficiently small, we obtain that if then
[TABLE]
where the first inequality relies on the fact that . At last, choosing and , and noting that by Lemma 5 if and then , the previous result in equation (4.9) completes the proof. ∎
5 Proof of Theorem 1.1
To accomplish the proof, it suffices to show that when is close enough to , does not converge to [math]. For convenience, we suppose that . For any fixed and , there is , and we take in Lemma 5 to generate . When , by Proposition 1 and Proposition 2, there exist and , such that if and , then the remainders in equation (3.5) could be evaluated respectively as
[TABLE]
and
[TABLE]
As a consequence,
[TABLE]
Furthermore, in light of and Lemma 5, for all we have
[TABLE]
Define . Then equation (5.4) implies that when . Next, by choosing suitable , we achieve
[TABLE]
for the reason that is independent of . Now, suppose for some . If , then Lemma 5 gives . If , then by equation (5.3) and equation (5.5), we have
[TABLE]
Hence it can be shown by induction that for all , namely, the Kesten-Stigum bound is not tight.
6 Large Degree Asymptotics
6.1 Gaussian Approximation
For , define
[TABLE]
Lemma 8.
There exist positive constants and , such that when ,
[TABLE]
[TABLE]
[TABLE]
Proof.
Starting with the Taylor series expansion of , there exists a constant , such that when ,
[TABLE]
Taking sufficiently large, when , we have that is small enough to guarantee equation (6.1) for and then
[TABLE]
for some constant , where the third inequality follows from . The rest estimates follow similarly. ∎
Next, define a 2-dimensional vector with , and a -covariance matrix
[TABLE]
which is a positive semi-definite symmetric -matrix. Let possess the Gaussian distribution , then the following lemma can be established by the Central Limit Theorem, the Gaussian approximation and the Portmanteau Theorem.
Lemma 9.
Let be a differentiable bounded function. For any , there exists , such that if then
[TABLE]
Next, define
[TABLE]
and then
[TABLE]
If has the Gaussian distribution , then is distributed according to . At last, define
[TABLE]
Therefore, Lemma 9 implies the following approximation to .
Lemma 10.
For arbitrary , there exists a , such that whenever ,
[TABLE]
6.2 Asymptotic Estimation of the Reconstruction Threshold
In order to estimate , it suffices to investigate the properties of on the interval , considering that and .
Lemma 11.
The function is continuously differentiable and increasing on the interval .
Proof.
When , it is concluded that
[TABLE]
by the fact that \left|\frac{\pi_{2}}{\pi_{1}}e^{t}\bigg{/}\left(1+\frac{\pi_{2}}{\pi_{1}}e^{t}\right)^{2}\right|\leq 1/4 holds for any . Then we establish the differentiability with respect to .
Now, let be an independent copy of . Thus if , it is feasible to construct equivalent distributions such as
[TABLE]
In view of , it follows that and
[TABLE]
which implies that and are both distributed as , with .
Next, it is well known that if has the distribution , the expectation of the exponential random variable could be estimated as
[TABLE]
based on which, we are able to estimate the conditional expectation given and :
[TABLE]
Then applying Jensen’s inequality, and considering that the function is convex and
[TABLE]
we have
[TABLE]
as desired. ∎
It is necessary to discuss the Taylor expansion of in the small neighborhoods of .
Lemma 12.
For small , we have
[TABLE]
Proof.
Define . By the results in Lemma 11, it is apparent that . Therefore by equation (6.2) the following moments can be calculated:
[TABLE]
Next starting from the identity
[TABLE]
we obtain the power series of as
[TABLE]
that is,
[TABLE]
∎
6.3 Proof of Theorem 1.2
In this section, we precisely rephrase Theorem 1.2 and give its rigorous proof.
Theorem 6.1.
When , define
[TABLE]
Then , and for any there exists a , such that if then the model has reconstruction when , but does not have reconstruction when . In other words,
[TABLE]
Proof.
It follows from Lemma 12 that when , there exists such that . Moreover, noting that , the Intermediate Value Theorem implies the existence of such that . Consequently, does exist and . Furthermore, for any , it follows from Lemma 11 that the set is a non-empty compact set bounded away from [math]. Then it is further established by the continuity of that the set
[TABLE]
is non-empty and compact. Hence it implies immediately that .
Next, taking and considering , one has
[TABLE]
Define . By Lemma 10 there exists a , such that if and then
[TABLE]
where the second inequality follows from Lemma 11. Consequently, it is shown by induction, and noting the initial value , that for all , which further establishes reconstruction. At last, Proposition 12 in Mossel [2001] implies that the reconstruction is solvable for any .
On the other hand, when , we have . Taking in equation (4.1), there exists a constant , such that if then
[TABLE]
where the fact that implies that and then there is non-reconstruction. So, it suffices to find some , such that , which could be accomplished by choosing
[TABLE]
in Lemma 10. Then, there exists a sufficiently large , such that if and then
[TABLE]
Then the fact that guarantees the existence of satisfying , as desired. Finally using Proposition 12 in Mossel [2001] again, one can conclude non-reconstruction for any . ∎
6.4 Proof of Theorem 1.3
When , the proof of Theorem 1.3 would resemble Theorem 1.1 in establishing a similar recursive inequality as equation (5.3), under the condition that and for suitable and . However, there still exists a crucial discrepancy between these two proofs, that is, Theorem 1.3 relies heavily on large . Before we proceed, let us firstly give the following lemma:
Lemma 13.
For any and , there exist and such that if then
[TABLE]
Furthermore, there exist and such that if and then
[TABLE]
Proof. For any , define
[TABLE]
From equation (6.1) and , one can find a suitable such that when we have , and . It is concluded from Lemma 2 that and
[TABLE]
where and denote the constants depending only on . In the following context, it is convenient to presume
[TABLE]
for the reason that equation (6.3) would be trivial otherwise. Therefore, it follows from equation (6.6) and the Bennet’s inequality that
[TABLE]
where depends only on , , . Similarly one can show that and then
[TABLE]
for some . Similarly one can also show that . On the other hand, there exists such that if for then
[TABLE]
Finally, we have
[TABLE]
where . Then we can achieve equation (6.4) by modifying the proof of Proposition 1. ∎
Lemma 14.
When , for any we have
[TABLE]
Proof of Theorem 1.3. Similar to the proof of Theorem 1.1, we will analyze and in equation (3.5) respectively, under the condition that . Taking , if which implies , by equation (3.6) and the inequality that , we obtain
[TABLE]
Moreover, according to Lemma 13, there exist and independent of , such that if and then an analogue of equation (5.2) holds as
[TABLE]
and then by equation (6.7) we have
[TABLE]
Next we claim that there is a positive integer such that . Define . Since the function is continuous and positive on by Lemma 14, we have . Then by Lemma 10, there exists a , such that when ,
[TABLE]
thus if then
[TABLE]
where the second inequality follows from Lemma 11 which claims that is increasing on . Therefore, there must exist an , such that , as desired.
When , it can be shown by induction, equation (6.8) and equation (6.9) that when ,
[TABLE]
Therefore, the limit defined as does exist, since the sequence is bounded and decreasing. Thus, passing to the limit on both sides of equation (6.10) gives
[TABLE]
which implies , hence non-reconstruction.
∎
Acknowledgement
We give special thanks to the journal editor and two anonymous reviewers who provided us with many constructive and helpful comments. We also give special thanks to Sebastien Roch for his inspiring discussions and reading of some proofs in the first version of this paper. We truly appreciate the warm encouragements on finishing this complete version and helpful discussions on this topic as well as future extensions, from colleagues at the 2017 & 2018 Columbia-Princeton Probability Day, 2017 Northeast Probability Seminar, 2018 Frontier Probability Days, 2017 & 2018 Finger Lakes Probability Seminar, and 2017 & 2018 Seminar on Stochastic Processes.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Berger et al. [2005] Noam Berger, Claire Kenyon, Elchanan Mossel, and Yuval Peres. Glauber dynamics on trees and hyperbolic graphs. Probability Theory and Related Fields , 131(3):311–340, 2005.
- 2Bernussou and Abatut [1977] Jacques Bernussou and Jean-Louis Abatut. Point mapping stability . Pergamon, 1977.
- 3Bhamidi et al. [2010] Shankar Bhamidi, Ram Rajagopal, and Sébastien Roch. Network delay inference from additive metrics. Random Structures & Algorithms , 37(2):176–203, 2010.
- 4Bleher et al. [1995] Pavel M Bleher, Jean Ruiz, and Valentin A Zagrebnov. On the purity of the limiting Gibbs state for the Ising model on the bethe lattice. Journal of Statistical Physics , 79(1-2):473–482, 1995.
- 5Borgs et al. [2006] Christian Borgs, Jennifer Chayes, Elchanan Mossel, and Sébastien Roch. The Kesten-Stigum reconstruction bound is tight for roughly symmetric binary channels. In Foundations of Computer Science, 2006. FOCS’06. 47th Annual IEEE Symposium on , pages 518–530. IEEE, 2006.
- 6Chayes et al. [1986] JT Chayes, L Chayes, James P Sethna, and DJ Thouless. A mean field spin glass with short-range interactions. Communications in Mathematical Physics , 106(1):41–89, 1986.
- 7Daskalakis et al. [2006] Constantinos Daskalakis, Elchanan Mossel, and Sébastien Roch. Optimal phylogenetic reconstruction. In Proceedings of the thirty-eighth annual ACM symposium on Theory of computing , pages 159–168. ACM, 2006.
- 8Evans et al. [2000] William Evans, Claire Kenyon, Yuval Peres, and Leonard J Schulman. Broadcasting on trees and the Ising model. Annals of Applied Probability , pages 410–433, 2000.
