Broadcasting with Random Matrices
Charilaos Efthymiou, Kostas Zampetakis

TL;DR
This paper investigates the reconstruction problem for broadcasting models with random matrices on trees and sparse graphs, establishing thresholds related to spin-glasses and revealing phase transition coincidences.
Contribution
It introduces new thresholds for broadcasting with random matrices, extends analysis to Galton-Watson trees and random graphs, and develops novel estimators for complex spin-glass models.
Findings
Reconstruction thresholds extend Kesten-Stigum bounds.
Revealed phase transition coincidence in spin-glass models.
Developed new estimators for complex broadcasting models.
Abstract
Motivated by the theory of spin-glasses in physics, we study the so-called reconstruction problem for the related distributions on the tree, and on the sparse random graph . Both cases, reduce naturally to studying broadcasting models on the tree, where each edge has its own broadcasting matrix, and this matrix is drawn independently from a predefined distribution. In this context, we study the effect of the configuration at the root to that of the vertices at distance , as . We establish the reconstruction threshold for the cases where the broadcasting matrices give rise to symmetric, 2-spin Gibbs distributions. This threshold seems to be a natural extension of the well-known Kesten-Stigum bound which arises in the classic version of the reconstruction problem. Our results imply, as a special case, the reconstruction threshold for the well-known…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Broadcasting with Random Matrices
Charilaos Efthymiou∗ and Kostas Zampetakis∗
Charilaos Efthymiou, [email protected], The University of Warwick, Coventry, CV4 7AL, UK.
Kostas Zampetakis, [email protected], The University of Warwick, Coventry, CV4 7AL, UK.
Abstract.
Motivated by the theory of spin-glasses in physics, we study the so-called reconstruction problem on the tree, and on the sparse random graph . Both cases reduce naturally to analysing broadcasting models, where each edge has its own broadcasting matrix, and this matrix is drawn independently from a predefined distribution.
We establish the reconstruction threshold for the cases where the broadcasting matrices give rise to symmetric, 2-spin Gibbs distributions. This threshold seems to be a natural extension of the well-known Kesten-Stigum bound that manifests in the classic version of the reconstruction problem. Our results determine, as a special case, the reconstruction threshold for the prominent Edwards–Anderson model of spin-glasses, on the tree.
Also, we extend our analysis to the setting of the Galton-Watson random tree, and the (sparse) random graph , where we establish the corresponding thresholds. Interestingly, for the Edwards–Anderson model on the random graph, we show that the replica symmetry breaking phase transition, established by Guerra and and Toninelli in [21], coincides with the reconstruction threshold.
Compared to classical Gibbs distributions, spin-glasses have several unique features. In that respect, their study calls for new ideas, e.g. we introduce novel estimators for the reconstruction problem. The main technical challenge in the analysis of such systems, is the presence of (too) many levels of randomness, which we manage to circumvent by utilising recently proposed tools coming from the analysis of Markov chains.
∗Research supported by EPSRC New Investigator Award, grant EP/V050842/1, and Centre of Discrete Mathematics and Applications (DIMAP), University of Warwick, UK
1. Introduction
Motivated by the theory of spin-glasses in physics, we study the so-called reconstruction problem with respect to the related distributions, on the tree, and on the sparse random graph .
Spin-glasses are disordered magnetic materials that are studied by physicists (not necessarily the theoretical ones). It has been noted that even though they are a type of magnet, actually, “they are not very good at being magnets”. Metallic spin-glasses are “unremarkable conductors”, and the insulating spin-glasses are “fairly useless as practical insulators …”, e.g. see [30].
However, the research on spin-glasses has provided tools to analyse some exciting, and extremely challenging, problems in mathematics, physics, but also real world ones. Through their study, we have garnered a deep understanding of the nature of complex systems. A case in point is the pioneering work of Giorgio Parisi in ‘70s on the so-called Sherrington-Kirkpatrick spin-glass, which introduces the formulation of the renowned replica symmetry breaking [27]. Parisi’s ideas were highly influential in physics community, and later, in mathematics, and computer science. The theory of replica symmetry breaking was among the groundbreaking ideas which got Parisi the Nobel Prize in Physics in 2021.
Perhaps one of the most successful, and extensively studied spin-glass models, is the famous Edwards-Anderson model (EA-model for short), introduced back in ‘70s by Sam Edwards and Philip Anderson in [16]. Few months after the work of Edwards and Anderson, David Sherrington and Scott Kirkpatrick, in [28], introduced their own model of spin-glasses, the well-known in computer science literature, Sherrington-Kirkpatrick model (SK-model for short). As it turns out, the SK-model corresponds to the mean field version of the EA-model.
Given a fixed graph , the Edwards-Anderson model with inverse temperature , is the random Gibbs distribution on the configuration space defined as follows: let be independent identically distributed (i.i.d.) standard Gaussians. Then each configuration receives probability mass , defined by
[TABLE]
where stands for “proportional to”. We usually refer to as the coupling parameters. Let us comment here that, alternatively, the Gibbs distribution is defined by replacing the indicator in (1), with the product . However, the two formulations are equivalent, as a simple transformation converts one to the other (see Appendix A). We also note that there is a simpler version of the Edwards-Anderson model, in which coupling parameters take independently values, uniformly at random.
Apart from its mathematical elegance, and theoretical importance, the Edwards-Anderson model, and the related spin-glass distributions, arise also in applications such as neural networks (e.g. the so-called Hopfield model), protein folding, and conformational dynamics. We refer the interested reader to [30], and references therein.
In this work, we largely study the Edwards-Anderson model on trees, and the (locally tree-like) random graph with constant expected degree . This is the random graph on vertices, such that each edge appears independently with probability . Since the Edwards-Anderson model on shares essential features with random Constraint Satisfaction Problems (-CSPs for short), it is not surprising that has been studied extensively in terms of phase transitions, in physics, e.g. [19, 25], mathematics, e.g. [21, 12], but also in computer science, e.g. for sampling algorithms [17, 2].
In contrast to the standard Gibbs distributions on trees, e.g. the Ising model, the Hard-core model, and the Potts model, the Edwards-Anderson model, despite being the most basic distribution for spin-glasses, has not been sufficiently studied. As a result, several fundamental questions about it still remain open. Here, we consider the tree reconstruction problem for the Edwards-Anderson model (and some natural extensions).
The reconstruction problem studies the effect of the configuration at a vertex , on that of the vertices at distance from , as . Specifically, we want to distinguish the region of parameters where the effect is vanishing, from that where the effect is non-vanishing. Typically, the two regions are specified in terms of a sharp threshold, i.e., we have an abrupt transition from one region to the other as we vary the parameters of the model. We usually call this phenomenon reconstruction threshold, and it has been the subject of intense study, e.g. [26, 1, 22, 7, 29, 10]. In the context of r-CSPs, the onset of reconstruction has been linked to an abrupt deterioration of the performance of algorithms (both searching and counting), e.g. see [1].
In this work, among other results, we establish precisely the reconstruction threshold for the Edwards-Anderson model on the -ary tree, the Galton-Watson tree with general offspring distribution, and the random graph . Furthermore, as far as the Edwards-Anderson model on is concerned, we combine our results with [21, 12], to conclude that the reconstruction threshold coincides with the so-called Replica Symmetry Breaking phase transition.
Interestingly, for the -ary tree, we establish the reconstruction threshold, not only for the Edwards-Anderson model, but also for the general version of the Gibbs distribution defined in (1). That is, the coupling parameters are i.i.d. following a general distribution, not necessary the standard Normal.
It turns out that the corresponding reconstruction problems on the Galton-Watson tree with offspring, and on the sparse random graph , are not too different from each other. Connections have been established between these two Gibbs distributions, e.g. see [4, 15, 11, 14]. We relate the two reconstruction results, i.e., for the tree and the graph, by exploiting the idea of planted-model (Teacher-Student model [31]) and the notion of contiguity [12]. In that respect, our basic analysis involves the complete -tree, and the Galton-Watson tree, while, subsequently, we extend these results to the random graph .
We study the reconstruction problem on trees by means of the broadcasting models. These are abstractions of noisy transmitted information over the edges of the tree, i.e., the edges act as noisy channels. To our knowledge, the study of the broadcasting models, and the closely related reconstruction problem, dates back to ‘60s with the seminal work of Kesten and Stigum [24].
Establishing the reconstruction threshold for the Edwards-Anderson model on the -ary tree, as well as the generalisation of this distribution, turns out to be a challenging problem. The difficulty of these models stems from the manifestation of local frustration phenomena, i.e., mixed ferromagnetic and antiferromagnetic interaction in the same neighbourhood, but also from the “many levels of randomness” we need to deal with in their analysis.
To this end, we make an extensive use of various potentials in order to simplify the analysis. To establish non-reconstruction, we employ some newly introduced techniques in the area of Markov chains and Spectral Independence [3, 9], that combine potential functions to analyse tree recursions. To establish reconstruction, we use a carefully crafted potential as an estimator for the root configuration. We call this estimator flip-majority vote.
1.1. Broadcasting, Reconstruction and the Kesten-Stigum bound
Consider the -ary tree , of height . Let be the root of the tree . Broadcasting on , is a stochastic process which abstracts noisy transmission of information over the edges of the tree.
There is a finite set of spins , and an stochastic matrix , which we call the broadcasting matrix, or transition matrix. With the broadcasting we obtain a configuration by working recursively as follows: assume that the configuration at the root is obtained according to some predefined distribution over . If for the non-leaf vertex in we have , then for each vertex , child of , we have with probability , independently of the other children, i.e.,
[TABLE]
Here we assume that is distributed uniformly at random in .
A natural problem to study in this setting is the so-called reconstruction problem. Suppose that is the marginal distribution of the configuration of the vertices at distance from the root. The reconstruction problem amounts to studying the influence of the configuration at the root of the tree to the marginal . Specifically, we want to compare the two distributions , and for different , i.e., conditional on the configuration at the root being and , respectively. The comparison is by means of the total variation distance, i.e.,
[TABLE]
Typically, we focus on the behaviour of the quantity above, as grows.
Definition 1.1**.**
We say that the distribution exhibits reconstruction if there exist spins such that
[TABLE]
On the other hand, if for all the above limit is zero, then we have non-reconstruction.
The broadcasting process we describe above gives rise to well-known Gibbs distributions on such as the Ising model, the Potts model etc. In terms of the Gibbs distributions on the tree, the reconstruction problem can be formulated as to whether the free-measure on the tree is extremal, or not. The extremality here is considered with respect to whether the Gibbs distribution can be expressed as a convex combination of two, or more measures, e.g. see [20]. It is interesting to compare the extremality condition with various spatial mixing conditions of the Gibbs distribution. Perhaps the most interesting case is to compare it with the Gibbs tree uniqueness. Then, it is standard to show that the extremality is a weaker condition than uniqueness.
The reconstruction problem has been studied since 1960s. Perhaps the most general result in the area is the so-called Kesten-Stigum bound [24], or KS-bound for short. Let be such that
[TABLE]
where is the second largest, in magnitude, eigenvalue of the transition matrix . The result of [24] implies that if , then we have reconstruction.
In light of the above, a natural question is whether the condition implies that we have non-reconstruction. In general, the answer to this question is no, e.g. see [5, 29]. However, for several important distributions, including the Ising model, the KS-bound is tight, in the sense that the condition indeed implies non-reconstruction, see [7, 18, 22].
1.2. Broadcasting with random matrices
Here, we consider the natural problem of broadcasting on a tree, where the transition matrix is random. In this setting, as before, we consider the -ary tree , of height , rooted at . Also, we have a finite set of spins . Rather than using the same matrix for every edge of the tree, each edge has its own matrix, which is an independent sample from a predefined distribution .
More formally, every stochastic matrix can be viewed as a point in the Euclidean space. We endow the set of all stochastic matrices with the -algebra induced by the Borel algebra. Then, is a distribution over the set of these matrices.
Once we have a matrix for each edge of , the broadcasting proceeds with the same rules as in the deterministic case. If for the non-leaf vertex in we have , then the vertex , child of , gets with probability , independently of the other children of , i.e.,
[TABLE]
where .
The above setting gives rise to a random probability measure on the set of configurations which we denote as . Hence, the configuration we get from the broadcasting, consists of two-levels of randomness. The first level is due to the fact that the measure is induced by the random instances of the broadcasting matrices . Once these matrices have been fixed, the second level of randomness emerges from the random choices of the broadcasting process. The above formulation gives rise to well-studied Gibbs distributions, such as the Edwards–Anderson model of spin-glasses, by choosing appropriately the distribution .
In this new setting, we study the reconstruction problem. Here, the definition of reconstruction differs slightly from Definition 1.1 above. Denote with the marginal of on the vertices at distance from the root of the tree . Then, the reconstruction problem is defined as follows:
Definition 1.2**.**
For a distribution on stochastic matrices , we say that the random measure exhibits reconstruction if there exist spins such that
[TABLE]
where the expectation is with respect to the randomness of .
On the other hand, if for all the above limit is zero, then we have non-reconstruction.
We consider the reconstruction problem in terms of the KS-bound, i.e., we examine whether it is tight, or not. Before addressing this question, we need to specify what the parameter might be in this setting.
It turns out that a natural candidate for can be defined as follows: Let be a matrix sampled from the distribution , and define
[TABLE]
i.e., the matrix is the expectation of the tensor product of the matrix with itself. Let denote the vector whose entries are all equal to one. Also, write
[TABLE]
where is the standard inner product operation. Then, we define to be such that
[TABLE]
The above quantity, , arises in the study of phases transitions in random CSPs [12]. Specifically, it signifies an upper bound on the density of the so-called Replica Symmetric phase, of symmetric Gibbs distributions. The value is derived in [12] by means of a stability analysis of the so-called free-energy functional. Note that the above definition for applies to any set of spins , and any distribution on matrices.
Here, we prove that the above is indeed the analogue of KS-bound for symmetric, 2-spin distributions (including the EA model). That is, for any distribution over the broadcasting matrices whose support is comprised of symmetric matrices, we prove that the -ary tree exhibits reconstruction when , while we have non-reconstruction when .
Furthermore, we go beyond the basic case of the -ary tree. Firstly, we extend our results to the cases where the underlying graph is the Galton-Watson random tree with general offspring distribution. Secondly, we exploit the notion of contiguity of measures to derive non-reconstruction results for the Edwards-Anderson model on the random graph .
2. Results
We start the presentation of our results on the 2-spin, symmetric distributions, by considering the -ary tree. Specifically, for integers and , let be the -ary tree of height , rooted at vertex . We let be the set of spins.
Suppose that we have a broadcasting process on , while assume that each edge of the tree is equipped with its own broadcasting matrix, each matrix drawn independently from the distribution induced by the following experiment: We have two parameters, a real number , and a distribution on the real numbers , i.e., we have the probability space where is the -algebra induced by the Borel algebra. We generate a matrix following the two steps below:
**Step 1: **
Draw from the distribution .
**Step 2: **
Generate the matrix such that
[TABLE]
Note that our broadcasting matrices are always symmetric.
The above broadcasting process gives rise to configurations in following the Gibbs distribution specified as follows: Let be independent, identically distributed (i.i.d.) random variables such that each one of them is distributed as in (this is the same distribution used to generate matrix ). Each is assigned probability mass defined by
[TABLE]
where stands for “proportional to”.
At this point, it is immediate that by choosing to be the standard Gaussian distribution, we retrieve the Edwards-Anderson model in (1). Note however, that (9) above generates a whole family of “spin-glass” distributions with the EA-model being a special case.
The definition of the distribution of the broadcasting matrix in (8) allows us to derive an explicit formula for the quantity in (4). Specifically, for distributed according to , it is not hard to prove (see Appendix B) that
[TABLE]
where the expectation is with respect to the random variable . In light of the above, we prove the following result for the general Gibbs distribution.
Theorem 2.1**.**
For a real number , and a distribution on the real numbers let be defined as in (10).
For any integer , the Gibbs distribution , defined as in (9), on the -ary tree exhibits reconstruction. On the other hand, if the distribution exhibits non-reconstruction.
The proof of Theorem 2.1 appears in Section 5. Let us state the implications of Theorem 2.1 for the Edwards-Anderson model on the -ary tree.
Corollary 2.2**.**
For and the standard Gaussian , let
[TABLE]
where the expectation is with respect to .
For any integer , the distribution , the Edwards-Anderson model with inverse temperature on the -ary tree, exhibits reconstruction. On the other hand, if the distribution exhibits non-reconstruction.
2.1. The case of the Galton-Watson tree
As a further step, we study the reconstruction problem on the Galton-Watson tree. Even though this is a very interesting problem on its own, we make use of our results for the Galton-Watson tree to derive subsequent results for , see Section 2.2.
Let be a distribution over the non-negative integers. Then, the rooted tree is a Galton-Watson tree with offspring distribution , if the number of children for each vertex in is distributed according to , independently from the other vertices.
Note that broadcasting with random matrices over the Galton-Watson tree , gives rise to configurations that consist of three levels of randomness. One of the challenges we circumvent with our analysis, is to disentangle all of three levels of randomness, and make clear the contribution of each one of them. Before getting there, we need to clarify what we mean by (non-)reconstruction in the current setting.
Definition 2.3**.**
Consider the distributions over and over , and a real number . Let the Galton-Watson tree with offspring distribution , while let the measure be defined as in (9), on the tree . We say that exhibits reconstruction if
[TABLE]
On the other hand, if the above limit is zero, then we have non-reconstruction.
For the above, recall that is the marginal of on the set of vertices at distance from the root. Note that if has no vertex at level , then the total variation distance above is, degenerately, equal to zero. We use the double expectation in Definition 2.3 for the sake of clarity: we can just replace it by a single expectation with respect to both the random tree , and the random measure .
As far as the reconstruction problem on the Galton-Watson trees is concerned, we have the following result, which we prove in Section 8.
Theorem 2.4**.**
For any real numbers , for any distribution on , for any distribution on with expectation and bounded second moment, let be the Galton-Watson tree with offspring distribution . Let also be the Gibbs distribution defined as in (9), on the tree . Finally, let be defined as in (10).
The distribution exhibits reconstruction if . On the other hand, if , the distribution exhibits non-reconstruction.
Let us now state the implications of Theorem 2.4 for the Edwards-Anderson model on the Galton-Watson tree.
Corollary 2.5**.**
For , consider the quantity defined in Corollary 2.2. For any real number , and any distribution with expectation , and bounded second moment, let be the Galton-Watson tree with offspring distribution .
Then, for the Edwards-Anderson model with inverse temperature , on the tree , the following is true. The distribution exhibits reconstruction if . On the other hand, if , the distribution exhibits non-reconstruction.
2.2. The Edwards-Anderson model on
For integer , and real , let be the random graph on , whose edge set is obtained by including each edge with probability, independently.
The Edwards-Anderson model on at inverse temperature , is defined as follows: for a family of independent standard Gaussians, we let
[TABLE]
where
[TABLE]
Here we assume that , where is a fixed number. Typically, we study this distribution as . The natural question we ask here is how does the model change as we vary . According to the physics predictions, for any there exists a condensation threshold, denoted as , where the function
[TABLE]
is non-analytic [19]. This conjecture was proved by Guerra and Toninelli [21]. The regime is called the replica symmetric phase. This region has several interesting properties; here we consider one that seems to be most relevant to our discussion. For any the distribution satisfies the following property: for distributed as in , for two randomly chosen vertices and , the configurations and are asymptotically independent. Formally, the above can be expressed as follows: for and any , we have that
[TABLE]
where denotes expectation with respect to the Gibbs distribution . Note that the above holds not only for pairs of vertices, but also for sets of vertices, for any fixed integer . Using our notation, the work by Guerra and Toninelli [21] implies the following result.
Theorem 2.6** ([21]).**
For any , for the distribution defined as in (11), we have that
[TABLE]
where is a standard Gaussian random variable.
Interestingly, one obtains the above by combining our Theorem 2.4 and using standard results from [12, 13]. Our main focus is on the reconstruction threshold for the Edwards-Anderson model on . The reconstruction for is defined in a slightly different way than what we have for the random tree.
Definition 2.7**.**
For , for , consider the Gibbs distribution as this is defined in (11). We say that the measure exhibits reconstruction if
[TABLE]
where denote the Gibbs marginal at the vertices at distance from vertex . On the other hand, if the above limit is zero, then we have non-reconstruction.
Perhaps, it is interesting to notice the order with which we take the double limit in the above definition.
Furthermore, we let the reconstruction threshold, denoted as , to be the infimum over such that
[TABLE]
The region of values of such that is called the non-reconstruction phase. It is immediate from Definition 2.7 that, for any , we have that non-reconstruction.
In the following result, we prove that the replica symmetric phase coincides with the non-reconstruction phase of the Edwards-Anderson model on .
Theorem 2.8**.**
For any , for the distribution defined as in (11), we have that .
The above follows from Theorems 2.6, 2.5 and [12, Corollary 1.5].
Notation
For the graph and the Gibbs distribution on the set of configurations . For a configuration , we let denote the configuration that specifies on the set of vertices . We let denote the marginal of at the set . We let , denote the distribution conditional on the configuration at being . Also, we interpret the conditional marginal , for , in the natural way.
3. Approach
A major challenge in our setting is that we have to deal with multiple levels of randomness, i.e., we have two levels of randomness in the case of the -ary tree, while the levels increase with the Galton-Watson trees or . To circumvent this problem, we follow an analysis that allows us to disentangle the different sources of randomness in our models. In this section, we provide a high-level description of our approach. We restrict our discussion on the -ary tree.
Non-reconstruction
Consider the -ary tree rooted at . Suppose that we have a distribution as in (9) on , while assume that each edge has its own coupling parameter . Assume, for the moment, that the coupling parameters at the edges are fixed, e.g. the reader may assume that are arbitrary real numbers. That is, each can be either positive, or negative. Hence, one might consider the aforementioned distribution as a non-homogenous Ising model which involves both ferromagnetic and anti-ferromagnetic interactions. Let us focus on non-reconstruction. We derive an upper bound on
[TABLE]
which is expressed in terms of the influence between neighbouring vertices. The notion of influence between vertices is the same as the one developed in the context of Spectral Independence technique for establishing rapid mixing of Glauber dynamics [3, 9]. These influences are used in the context of the so-called down-up coupling to establish non-reconstruction. This is a coupling approach from [6], which also relies on ideas in [29].
Let us be more specific. For the probability measure we consider, let be the ratio of Gibbs marginals at the root defined by
[TABLE]
Recall that denotes the marginal of the Gibbs distribution at the root .
For a vertex , we let be the subtree of that includes , and all its descendants. Also, we let be the ratio of marginals at vertex , where the Gibbs distribution is, now, with respect to the subtree .
Suppose that the vertices are the children of the root . Our focus is on expressing recursively, as a function of . Note that we study the logarithm of the ratios involved, which can be viewed as applying the potential function to the tree recursions. We have that where
[TABLE]
Note that is the coupling parameter that corresponds to the edge between the root with its child .
All the above extends naturally in the case where we impose boundary conditions. That is, for a region , and , we define the ratio of marginals at the root, where now the ratio is between the conditional marginals and . The recursive function for the conditional ratios is exactly the same as the one above.
Our interest is on the gradient of the function . Specifically, for every , we let
[TABLE]
It turns out that, in our case, has a simple form
[TABLE]
Utilising the idea of down-up coupling from [6], we prove the following:
[TABLE]
where denotes the set of vertices at distance from the root . Note that the above provides a bound for the total variation distance of the the marginals for fixed, i.e., non-random, couplings . Inequality (15), extends naturally when we study reconstruction for the distribution defined in (9), i.e., when the coupling parameters are i.i.d. samples from a distribution . Indeed, averaging yields
[TABLE]
where we have , for each . Note that the above holds, since each depends only on , while the coupling parameters are assumed to be independent with each other.
At this point, and since the ’s are identically distributed, we further observe that for any , we have that
[TABLE]
Since the underlying tree is -ary, it is immediate to see that for , the r.h.s. of (16) tends to zero as . From this point on, it is standard to prove non-reconstruction.
Our analysis allows to deal with the randomness of the spin-glass measure by utilising the bound in (15). That is, the upper bound on the total variation distance has a nice product form of the quantities , which, in turn, expresses the dependence of the total variation distance on the edge couplings . This product form of the bound, behaves rather nicely when we need to take averages over the randomness of the coupling parameters of the the spin-glass measure .
Reconstruction
In the reconstruction regime, the configuration at the root has a non-vanishing effect on the configuration of the vertices at distance , regardless of the height . Specifically, the corresponding leaf configurations from the measure conditioned on root’s spin being , and , are so different with each other, that any discrepancies cannot be attributed to random fluctuations. Therefore, a question that naturally arises is how can we take advantage of the discrepancies so that we infer the spin of the root.
For the standard ferromagnetic Ising, several approaches have been developed to establish reconstruction (see [18], [8], [23]). Here, we build on an elegant argument in [18]. The authors in this work, show that a simple majority vote of the leaf spins, conveys information sufficient to reconstruct root’s spin, The majority vote on the leaves is defined by
[TABLE]
The estimation rule is to infer that the spin at the root is , i.e., the sign of . Impressively, it turns out that this estimator is optimal, i.e., it coincides with the maximum likelihood one. For the -ary tree, one establishes reconstruction for the ferromagnetic Ising model by employing a second moment argument on the estimator .
For the distributions we consider here, the above estimator is far from sufficient. This is due to various facts. Firstly, we allow for mixed couplings on the edges, i.e., certain edges can be ferromagnetic, and others can be anti-ferromagnetic. Secondly, the strength of the interaction, i.e., the magnitude of ’s, is expected to vary from one edge to the other. To this end, we introduce a new estimator, and we establish reconstruction by building on the second moment argument from [18]. The starting point towards deriving this estimator, comes from just considering the standard anti-antiferromagnetic Ising. The statistic from (17), clearly does not work for this distribution. However, there is an easy remedy, by taking into account the parity of the height , i.e., if is an even, or an odd number. We infer that the spin at the root is equal to , where
[TABLE]
For the spin-glass distributions we consider here, we need to get the above idea even further. Firstly, in order to accommodate the mixed ferromagnetic and anti-ferromagnetic couplings on the edges of the tree. It seems meaningful to use the estimator for the root configuration, where
[TABLE]
with denoting the set of edges along the unique path connecting to . So that in , for each leaf we essentially examine the parity of the number of antiferromagnetic couplings along the path that connects it to the root. Unfortunately, for the above estimator, our second moment argument does not seem to work all that well.
The estimator we end up using, is a reweighted version of , which we call the “flip majority” vote, and is defined by
[TABLE]
Note that the absolute value of the weight for the edge , above, coincides with the quantity in (16). Naturally, the estimation rule is to infer that the root spin is .
4. Tree recursions and Influences
What follows applies to any kind of tree. For the sake of simplicity, in this section, we consider the –ary tree rooted at . Suppose that we are given the number , while each edge has its own coupling parameter, . Assume, for the moment, that the coupling parameters at the edges are fixed, i.e., they are arbitrary real numbers. Given and , we consider the Gibbs distribution similarly to the one we have in (9). That is, every gets a probability mass defined by
[TABLE]
For a region and , we consider the ratio of marginals at the root such that
[TABLE]
Recall that denotes the marginal of the Gibbs distribution at the root . Also, note that the above allows for , when .
For a vertex , we let be the subtree of that includes , and all its descendants. We always assume that the root of is the vertex . With a slight abuse of notation, we let denote the ratio of marginals at the root for the subtree , where the Gibbs distribution is, now, with respect to .
Suppose that the root is of degree , while let the vertices be its children. We express it terms of ’s by having , for
[TABLE]
For the analysis that follows, we get cleaner results by equivalently working with log-ratios rather than ratios of Gibbs marginals. Let , which means that is such that
[TABLE]
From the above, it is elementary to verify that . The above transformation is standard in the literature, and can be viewed as applying the potential function in the tree recursion. For every , we let
[TABLE]
The quantities arise naturally in various settings in our analysis. Specifically, we use the theorem below, which follows as corollary from results in [3, 9].
Theorem 4.1**.**
For , consider the tree and for fixed . Let the Gibbs distribution on , defined as in (18).
For any two vertices , for any , and any the following holds:
[TABLE]
where is the set of edges along the path from to in , while ’s are defined in (20).
Specifically, Theorem 4.1 is a direct consequence of Lemma B.2 in [3], and Lemma 15 in [9]. For the distributions we consider in this work, it turns out, that the quantities have a simple form which, somehow, is a reminiscent of the quantity in (10).
Claim 4.2**.**
For , consider the quantity defined in (20). We have that
[TABLE]
Proof of Claim 4.2.
The derivations below are standard and we present them for the sake of our work being self-contained. For and , let be the function
[TABLE]
It is easy to verify that . It is also straightforward to see that for any real function we have
[TABLE]
so that
[TABLE]
Now let also , so that
[TABLE]
and notice that we want to show
[TABLE]
First, if , then (23) gives
[TABLE]
Assume now . Differentiating gives
[TABLE]
Since , and , we observe that vanishes only at , and in particular, must be the only sign alternation point of . Finally, it is elementary to check that
[TABLE]
Therefore, [math] and must be the global optima of . Hence, (23) yields
[TABLE]
as desired. ∎
5. Theorem 2.1 - Proof of non-reconstruction.
In order to prove Theorem 2.1, first consider the distribution we define in (18), in Section 4. That is, for a tree rooted at , assume that we are given the parameters and , such that . Note that are fixed real constants, i.e., they are not random numbers.
We define the Gibbs distribution on the tree such that each is assigned probability measure such that
[TABLE]
For two vertices in , write for the set of edges in the unique path from to . Building on Theorem 4.1, for the aforementioned distribution we have the following result:
Theorem 5.1**.**
For integer , , and such that , let be an arbitrary tree of height , rooted at vertex , and let the Gibbs distribution on be defined as in (24).
We have that
[TABLE]
where is the influence of edge defined in (22), and is the set of vertices at distance from the root.
For the above, recall that is the marginal of on the set of vertices at distance from the root, i.e., the set . In light of Theorem 5.1, the non-reconstruction part of Theorem 2.1 follows as a corollary.
Proof of Theorem 2.1 - Non-Reconstruction.
Consider the Gibbs distribution on the -ary tree , and let be defined as in (10). We need to show that for we have
[TABLE]
Given the -algebra generated by the coupling parameters , from Theorem 5.1, we have that
[TABLE]
where recall that is the set of vertices at distance from the root , while for every we have that
[TABLE]
For the sake of brevity, we let
[TABLE]
Then, from (27) we have that
[TABLE]
where the expectation is with respect to random variable . We derive the r.h.s. of the equation above using the observation that each depends only on , and the coupling parameters , are assumed to be independent with each other.
Furthermore, our assumption that , corresponds to having that . Hence, there exists such that
[TABLE]
Using the above, (29), and the fact that is -ary, and hence, the size of is , we get that
[TABLE]
Invoking Markov’s inequality we further get that
[TABLE]
or, since , we equivalently have that
[TABLE]
Furthermore, since and , we have
[TABLE]
The above implies (26), and concludes the non-reconstruction part of Theorem 2.1. ∎
6. Proof of Theorem 5.1
Recall that we are dealing with the Gibbs distribution on the tree of height . With respect to and every edge of the tree, we obtain the influence in the standard way.
To prove Theorem 5.1, we use the idea of down-up coupling from [6], which also relies on ideas in [29]. To this end, let us introduce a few notions. For , we let be the distribution on the configuration at the root of the tree that is induced by the following experiment. Recall that is the set of vertices at distance from the root. First, we obtain the configuration on the tree from the measure , where
[TABLE]
Next, we erase all the assignments apart from those at the vertices in . Then, we obtain a new configuration, , from the distribution , i.e., the distribution conditional on the configuration of set be as in . With the measure we denote the distribution of , i.e., the assingment of at the root .
Recall now that for , we write for the marginal of at the vertices at distance from the root, conditioned on . The following lemma was essentially proved for standard Gibbs distributions in [6]. For the sake of completeness, we present our own proof for the spin-glasses in the Appendix C.
Lemma 6.1** ([6] ).**
For integer , let be an arbitrary tree of height , rooted at vertex . For any , for any with , let the Gibbs distribution on be defined as in (24). Then,
[TABLE]
We prove the upper bound in (25) be means of the Lemma 6.1, i.e., by bounding appropriately the quantity on the r.h.s. of (30). Specifically, we use the bound obtained in the following proposition.
Proposition 6.2**.**
For integer , let be an arbitrary tree of height , rooted at vertex , and write for the set of vertices at distance from the root. For each , let be the influence of edge , given by (22). Then,
[TABLE]
where , are as in Lemma 6.1.
The proof of Proposition 6.2 appears in Section 7. Now, Theorem 5.1 follows by plugging (31) into (30), i.e., we have that
[TABLE]
7. Proof of Proposition 6.2
For , recall that be the marginal of on , conditional on the configuration at being . For any configuration , also recall that is the marginal of at the root , conditional on the configuration at being . In order to prove Proposition 6.2 we use the following result.
Lemma 7.1**.**
For integer , let be an arbitrary tree of height rooted at vertex , and write for the vertices at distance from the root. For any , for any such that , let the Gibbs distribution on be defined as in (24).
Then, for any distribution , coupling of the marginals and , we have
[TABLE]
Proof.
We have that
[TABLE]
The second derivation is due the triangle inequality, while last equality holds since is a coupling of and , and thus for any we have that
[TABLE]
Furthermore, note that for any , we have that
[TABLE]
The above follows from the definition of , and the law of total probability. Plugging (34) into (33), we get that
[TABLE]
The above concludes the proof of Lemma 7.1. ∎
Lemma 7.1 implies the following technical result, which we prove in Subsection 7.1 below.
Proposition 7.2**.**
For any distribution , coupling of the marginals and , the following is true:
[TABLE]
where is obtained by by changing the configuration at , from to its opposite.
Proposition 6.2 follows by bounding appropriately the r.h.s. of the inequality above. Specifically, from Theorem 4.1 we have that
[TABLE]
where recall that denotes the set of edges on the path from the root to the vertex .
Moreover, we show that for any we have
[TABLE]
It is immediate that Proposition 6.2 follows from Proposition 7.2 and (36), (37).
We prove (37) by explicitly describing a coupling that achieves the aforementioned bound. We call this coupling the “Down Coupling”.
Down Coupling
Recall that we want to couple the distributions and . Instead, we couple and , i.e., rather than coupling the conditional Gibbs marginals at , we couple the conditional measure. The coupling of and , trivially, specifies a coupling for their marginals at .
Write for the coupling of and we wish to define. We specify by describing how we generate two configurations which are distributed as in . We generate the two configurations inductively. In order to specify the configurations for the vertices at level of the tree, we use the configurations at level . Suppose that we need to decide the configuration for vertex , while we already have the configurations for vertex , the parent of , i.e., we have both and . Then, we use maximal coupling for the configuration at vertex , i.e., couple the distributions and so that is minimized. This implies that
[TABLE]
With the above coupling, we need to find an upper bound for , where . Ideally, we would like to get the one in (37).
For two vertices in the tree, and , such that is the parent of , we have the following: Given , then in the above coupling we have that
[TABLE]
whereas,
[TABLE]
where the last equality is due to Claim 4.2. All the above imply that if the coupling generates a disagreement at vertex , i.e., , then the disagreement propagates at with probability . From this point on, it is elementary to verify that (37) is true, concluding the proof of Proposition 6.2.
7.1. Proof of Proposition 7.2
Recall that is the set of vertices at distance from the root. Consider an enumeration of the vertices in , e.g., we have , where . For any two configurations , and any , we define the interpolating sequence as follows: for any we have such that
[TABLE]
Note that , while . Also note that any two and may be equal.
Lemma 7.3**.**
For any distribution , coupling of the marginals and , the following is true:
Consider distributed as in , and consider also the interpolating sequence
[TABLE]
that is induced by , . We have that
[TABLE]
where the expectation is with respect to and of the interpolating sequence .
Proof.
From Lemma 7.1, we have that
[TABLE]
The last inequality above follows from triangle inequality. Furthermore, the last inequality is equivalent to the following one:
[TABLE]
The lemma follows by applying the linearity of expectation on the inequality above. ∎
From Lemma 7.3 we get the following:
[TABLE]
The last derivation follows by noting that , if and only if, we have . All the above conclude the proof of Proposition 7.2.
8. Proof of Theorem 2.4 - Proof of Non-Reconstruction for Galton-Watson
First, let us briefly recall what we want to prove. For any real numbers , and , for any distribution on let be defined as in (10). For any offspring distribution (on ), with expectation and bounded second moment, let be the Galton-Watson tree with offspring distribution , and let the Gibbs distribution be defined as in (9) , on the tree .
We want to show that if , then exhibits non-reconstruction i.e.,
[TABLE]
Using Theorem 5.1, which holds for arbitrary trees and taking expectations we have that
[TABLE]
where denotes the set of vertices at distance from the root. Working out the r.h.s. of (40), using the law of total expectation, while conditioning on , we get
[TABLE]
since for fixed , the influences are independent, and identically distributed. Recalling now that , (notice that does not depend on ), and that the offspring distributions of the vertices of is with expectation , we can rewrite (41) as
[TABLE]
Per our assumption , there exists such that . Combining the above with (40) we get that
[TABLE]
Invoking Markov’s inequality, similarly to the proof of the non-reconstruction claim of Theorem 2.1 in Section 5, we get that
[TABLE]
so that taking limits as goes to infinity, gives the desired result.
9. Theorem 2.1 - Proof of reconstruction.
Here we prove the reconstruction part of Theorem 2.1. Before we delve into the proof, let us recall our setup. For an integer , let be the -ary tree rooted at . Let also be a distribution on , and let be i.i.d. random variables, each distributed as in . For a real number , recall that the probability measure on is defined by
[TABLE]
In this setting, for distributed as in we define
[TABLE]
where the expectation is over the random variable . For an integer , we write for the set of vertices at distance from the root . We also write , and for the marginal of on the set conditioned on root being and , respectively. We want to prove that if , then exhibits reconstruction, i.e.,
[TABLE]
To distinguish between the two layers of randomness considered here (spin configurations are random having distribution , and itself is random as is a random variable), we use to denote expectation with respect to the measure , and reserve for expectations taken with respect to the random variable of the couplings .
In the same spirit as in [18], we show that in order to establish (44), it is sufficient to find a real function on whose expected values with respect to measures , and , differ significantly, while its second moment with respect to is not much larger than the square of the first moment. In particular, we show the following technical result.
Theorem 9.1**.**
For integer , let be an arbitrary tree of height , rooted at vertex , and let be the set of vertices at distance from . For any , for any distribution on , let the Gibbs distribution on be defined as in (42).
Then, for any real function defined on spin configurations of , we have that
[TABLE]
where is with respect to the couplings on the edges of , induced the measure .
The proof of Theorem 9.1 appears in Section 10. We now wish to define a real function on spin configurations of , whose ratio in the r.h.s. of (45) is bounded away from zero. To this end, we define the “signed influence” of an edge to be
[TABLE]
Observe that due to Claim 4.2, we have that the relationship between , defined in above, and , defined by (14) in Section 4, is simply
[TABLE]
Definition 9.2**.**
For integer , let be an arbitrary tree, rooted at vertex , and let be the set of vertices of at distance from the root. For any , for any distribution on , let the Gibbs distribution on be defined as in (42).
The flipped majority vote is the function with
[TABLE]
where be defined as in (46).
The following proposition expresses the enumerator and denominator of ratio in the r.h.s. of (45) for the flipped majority vote, , defined above, in terms of the edge influences . For two vertices of , write for the set of edges along the unique path between and , and write for the common ancestor of and farthest from the root .
Proposition 9.3**.**
For integer , let be an arbitrary tree of height , rooted at vertex , and let be the set of vertices at distance from the root of . For any , for any distribution on , let the Gibbs distribution on be defined as in (42).
Then,
[TABLE]
and
[TABLE]
where is the influence of edge defined in (22).
The proof of Proposition 9.3 appears in Section 11. Finally, for the case of -ary tree, and using Proposition 9.3, we have the following lemma
Lemma 9.4**.**
For integers , let be the -ary tree of height , rooted at vertex , and let be the set of vertices at distance from the root of . For any , for any distribution on , let the Gibbs distribution on be defined as in (42).
Let also be defined as in (43), and let be the flipped majority vote defined in (48). Suppose that . Then, we have that
[TABLE]
where is defined by .
We prove Lemma 9.4 in Section 12. The reconstruction claim of Theorem 2.1, follows now readily from Theorem 9.1 and Lemma 9.4.
Proof of Theorem 2.1 - Reconstruction..
Consider the Gibbs distribution defined as in (42), on the -ary tree . We need to show that for , where is defined as in (43), we have that
[TABLE]
where is taken with respect to the random variables . Let now be the flipped majority vote defined as in (48). Applying Theorem 9.1, which holds for any real function on , on , gives
[TABLE]
Since , there exist a such that . Applying Lemma 9.4 on the r.h.s. of the above gives further that
[TABLE]
Taking limits, yields trivially
[TABLE]
as desired, concluding the proof of the reconstruction claim of Theorem 2.1. ∎
10. Proof Of Theorem 9.1
Let us first introduce some additional notation. For integer , let be an arbitrary tree of height rooted at vertex , and let be the set of vertices at distance from the root of . Given a function defined on the spin configurations of , write for the range of . For , and , let us define
[TABLE]
where we used the notation .
Recall also that denotes the marginal of at the root , while and denote the marginals of at conditioned on and , respectively.
Lemma 10.1**.**
For integer , let be arbitrary tree of height rooted at vertex , and let be the set of vertices at distance from the root of . For any , for any distribution on , let the Gibbs distribution on be defined as in (42).
Then, for any , we have that
[TABLE]
Proof.
First, we observe that the l.h.s. of (54) can be expressed as total variation distance. In particular, we have that
[TABLE]
where recall that is the marginal of at the root , while and denote the measure conditional on and , respectively. Indeed, we have
[TABLE]
where (56) and (57) follow from Bayes’ rule, and the fact that .
Recall now that the total variation distance of two measures and , defined on the same probability space , can be equivalently defined as
[TABLE]
Given a subalgebra , let and denote the restrictions of and on , respectively. Then
[TABLE]
Observe now that and are precisely the restrictions of and on the -algebra generated by the function , respectively. Hence, per the above and (55), we have that
[TABLE]
The above concludes the proof of Lemma 10.1. ∎
As usual, it is easier to handle squares than absolute values. Observing that
[TABLE]
we have that
[TABLE]
which further implies
[TABLE]
Finally, we need to prove the following lemma.
Lemma 10.2**.**
For integer , let be arbitrary tree of height rooted at vertex , and let be the set of vertices at distance from the root of . For any , for any distribution on , let the Gibbs distribution on be defined as in (42).
Then, for any , we have that
[TABLE]
Recall that the expectation is taken w.r.t. the coupling parameters in .
Proof.
Expanding the enumerator of the fraction in the right hand side of (59) gives
[TABLE]
where the last equality follows from Bayes’ rule. Applying now the Cauchy-Schwartz inequality with factors
[TABLE]
we further get that
[TABLE]
The above concludes the proof of Lemma 10.2. ∎
Theorem 9.1 now follows from (58) and Lemma 10.2.
11. Proof of Proposition 9.3
Let be an arbitrary tree rooted at . Let also be a distribution on , and let be i.i.d. random variables, each distributed as in . For a real number , recall that the probability measure on is defined by
[TABLE]
Recall also that for each we have defined
[TABLE]
and we use to denote the set of edges along the unique path between and . Finally, recall that denotes the measure conditional on , for . We now have the following lemma.
Lemma 11.1**.**
Let be an arbitrary tree, and let be the Gibbs measure on defined as in (61), and be as in (62). Then, for any two vertices of , and we have
[TABLE]
Proof.
Let be the unique path from to , and write for the set of vertices along that path, apart from . We now see that for any we have that
[TABLE]
We now prove (63) by induction on the distance between and . For the base case, corresponds to and being adjacent vertices. Then, for any , equation (64) becomes
[TABLE]
as desired.
Assume now (63) holds for any pair of vertices whose distance is at most . Let , be a pair of vertices of distance . In particular, let be the (unique) path from to , and write , and . From (64) we have that for
[TABLE]
where the last equality follows from the Markov property of the model. Pushing now forward the sum over the configurations of we further get
[TABLE]
where denotes the measure conditional on , and we get the last equality from (64). Expanding now the sum over , we further get
[TABLE]
where (65) follows from the inductive hypothesis applied on vertices and , while (66) follows from the definition of in (62). ∎
Using Lemma 11.1, we now prove the following lemma about pairwise spin correlations.
Lemma 11.2**.**
Let be any finite tree, and let be the Gibbs measure on defined as in (61), and be as in (62). Then, for any two vertices of , we have
[TABLE]
Proof.
Indeed,
[TABLE]
where , denote measure conditional on being and , respectively. We get (67) by the law of total probability, i.e., we condition on the spin of , and use the fact that . Also, (68) follows from Lemma 11.1. All the above conclude the proof of Lemma 11.2. ∎
Recall that , and denote the marginals of on , conditioned on , and , respectively. Finally, let be the signed influence of edge , defined as in (62), and be the flipped majority vote introduced in Definition 9.2.
We start by applying Lemmas 11.1, and 11.2, to calculate the first moments of with respect to the measures , and . We have that
[TABLE]
where the first equality follows from linearity of expectation, the second equality by applying Lemma 11.1, and the last equality is due to (47). Similarly,
[TABLE]
It is now easy to derive (49) as
[TABLE]
where the first equality follows from (69) and (70). To get the last equality, we use the linearity of expectation, and the fact that the couplings , (and thus, also ), are independent.
We now use Lemma 11.2 to calculate the second moment of with respect to . Expanding , we have that
[TABLE]
where the first equality follows from the linearity of expectation, while the second equality follows from Lemma 11.2. Recalling that denotes the common ancestor of and farthest from the root , we can rewrite the above as
[TABLE]
We are now ready to prove (50). Per (71) we have that
[TABLE]
where the first equality follows from the linearity of expectation, and the second from the fact that the couplings , (and thus, also ), are independent. This concludes the proof of Proposition 9.3.
12. Proof of Lemma 9.4
For integers , let now be the -ary tree rooted at , and be the Gibbs measure on defined as in (61). Let also be the set of vertices at distance from the root . By Proposition 9.3 we have that
[TABLE]
Recalling that , we have that
[TABLE]
where to get the first equality of (72) we observe that . Writing now for the vertices of at distance from the root , and reorganising the sum in the denominator of (72) with respect to the common ancestor , we get that
[TABLE]
which, due to our assumption that , for some , further simplifies to
[TABLE]
Plugging now the above into (72) we finally get
[TABLE]
All the above complete the proof of Lemma 9.4.
13. Proof of Theorem 2.4 - reconstruction for the Galton-Watson Tree
Let us briefly recall our setup. For any real number , for , for any distribution on , and any offspring distribution with expectation and bounded second moment, let be defined as in (10). Let be the Galton-Watson tree with offspring distribution , while let the Gibbs distribution , defined as in (9) , on the tree . For an integer , write for the set of vertices at distance from the root of (notice that is a random variable here). We also write and for the marginal of measure on the set , conditioned on the root of being and , respectively.
We want to show that if , the distribution exhibits reconstruction, i.e.,
[TABLE]
We start by noticing that Theorem 9.1 can be extended to Galton-Watson trees. That is, we have that for any real function defined on spin configurations of , we have that
[TABLE]
In fact, the proof (73) is almost identical to that of Theorem 9.1, the only difference being that at the very last step of the proof, we apply Cauchy-Schwartz to an expression with an additional sum (due to ). Hence, all it remains to do is to lower bound the rhs of (73) away from zero.
From Proposition 9.3 and conditioning over the random tree , we get that
[TABLE]
Given a random tree , the influences, , are independent. Moreover, recalling that , (notice that does not depend on ), we can further simplify the above as follows:
[TABLE]
Since from this point on we are left only with expectations with respect to , we drop the subscript in . Reorganising the sum of in the denominator in the last equation above, similarly to the proof of Lemma 9.4, and writing for the descendants of at distance ( without subscript refers to descendants of the root ), we get that
[TABLE]
Let us now focus on the expectation in the r.h.s. of the above equation. In particular, we invoke the law of total expectation conditioning on the set , comprised of the vertices at distance from the root
[TABLE]
where (75) follows from the linearity of expectation, and the fact that the offsprings of each vertex are identically distributed (and hence, the random variable coincides with , for all vertices ). We now estimate the inner expectation of (75) conditioning on .
[TABLE]
where (76) follows from the linearity of expectation, and the fact that the offsprings of each vertex are independent (and thus, the inner expectation of products becomes the product of the corresponding expectations), and identically distributed (and hence, ).
Putting them all together, we have that (74), (75),(76), and (77) yield
[TABLE]
Due to the fact that the offsprings of vertices in are i.i.d., we observe that for any , we have that , and thus, we can further simplify the above as follows
[TABLE]
Per our hypothesis, , and thus, , for some bounded number . Moreover, we have that , and thus, there exist a , such that . With that in mind, we further bound the above as
[TABLE]
This concludes the proof of the reconstruction claim of Theorem 2.4.
Appendix A Equivalence of Indicator and Product Gibbs distribution
Let be a graph, and let be arbitrary couplings over the edges of . For , let us write , and for the Gibbs distributions over , defined by the indicator, and product formulation, respectively. That is, for every , we have
[TABLE]
We will prove that , for every . Indeed, let be arbitrary, then
[TABLE]
Since , , are probability measures, we conclude that , as desired.
Appendix B KS-Bound Derivation
First, note that since is symmetric, must be symmetric as well. In particular, we have that
[TABLE]
It is also easy to check that the for any matrix with the same pattern on its entries we have the following
Observation B.1**.**
The spectrum of every matrix, , of the following form
[TABLE]
is precisely . In particular, every eigenvalue of is a linear combination of its elements.
Note that both , and , are of the form (79). In the following lemma we show that Observation B.1 allows us to change the order of averaging and taking eigenvalues of .
Lemma B.2**.**
Let , be as in Observation B.1. Then, for every we have that
[TABLE]
Proof.
Since both , and , are of the form (79), each is a linear combination of their entries, and thus, the result follows by the linearity of expectation. ∎
Let us now recall that equation (10) defins as follows
[TABLE]
where , and . Since is of the form (79), and in particular symmetric, it is easy to argue, e.g. using Courant-Fisher theorem, that the solution to the maximisation in (10) must be
[TABLE]
Using now Lemma B.2 we get that . Substituting the entries of from (78), yields , as desired.
Appendix C Proof of Lemma 6.1
Proof.
First, let us recall that we denote with the set of vertices at distance from the root . Also, for and , we write , and , for the marginal of on the root, conditioned on , and the marginal of on the the set , conditioned on , respectively. We now have that
[TABLE]
where (82) follows from Bayes’ rule, and (83) is due to the fact that . Next, we observe that
[TABLE]
Using the above, we also get that
[TABLE]
where (85) follows from (34), while (86) is due to the Bayes’ rule, and the fact that . Finally, we get (87) from the observation (84). The result now follows from the Cauchy-Schwartz inequality. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Dimitris Achlioptas and Amin Coja-Oghlan. Algorithmic barriers from phase transitions. In 2008 49th Annual IEEE Symposium on Foundations of Computer Science , pages 793–802. IEEE, 2008.
- 2[2] Ahmed El Alaoui, Andrea Montanari, and Mark Sellke. Sampling from the Sherrington-Kirkpatrick Gibbs measure via algorithmic stochastic localization. In 63rd IEEE Annual Symposium on Foundations of Computer Science, FOCS 2022, Denver, CO, USA, October 31 - November 3, 2022 , pages 323–334. IEEE, 2022. doi:10.1109/FOCS 54457.2022.00038 . · doi ↗
- 3[3] Nima Anari, Kuikui Liu, and Shayan Oveis Gharan. Spectral independence in high-dimensional expanders and applications to the hardcore model. SIAM Journal on Computing , 0(0):FOCS 20–1–FOCS 20–37, 2021. doi:10.1137/20M 1367696 . · doi ↗
- 4[4] Victor Bapst, Amin Coja-Oghlan, and Charilaos Efthymiou. Planting colourings silently. Combinatorics, probability and computing , 26(3):338–366, 2017.
- 5[5] Nayantara Bhatnagar, Allan Sly, and Prasad Tetali. Reconstruction threshold for the hardcore model. In Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques: 13th International Workshop, APPROX 2010, and 14th International Workshop, RANDOM 2010, Barcelona, Spain, September 1-3, 2010. Proceedings , pages 434–447. Springer, 2010.
- 6[6] Nayantara Bhatnagar, Juan Vera, Eric Vigoda, and Dror Weitz. Reconstruction for colorings on trees. SIAM Journal on Discrete Mathematics , 25(2):809–826, 2011.
- 7[7] Pavel M Bleher, Jean Ruiz, and Valentin A Zagrebnov. On the purity of the limiting Gibbs state for the Ising model on the Bethe lattice. Journal of Statistical Physics , 79:473–482, 1995.
- 8[8] Christian Borgs, Jennifer Chayes, Elchanan Mossel, and Sébastien Roch. The Kesten-Stigum reconstruction bound is tight for roughly symmetric binary channels. In 2006 47th Annual IEEE Symposium on Foundations of Computer Science (FOCS’06) , pages 518–530. IEEE, 2006.
