The Lov\'asz Theta Function for Random Regular Graphs and Community Detection in the Hard Regime
Jess Banks, Robert Kleinberg, Cristopher Moore

TL;DR
This paper investigates the limitations of the Lovász theta function and sum-of-squares proofs in refuting k-colorability in random regular graphs, revealing computational hardness in certain regimes and providing bounds related to graph girth.
Contribution
It establishes bounds on the degree for which the Lovász theta function can refute k-colorability, showing failure above the phase transition and linking to community detection hardness.
Findings
Refutation fails above the k-colorability transition.
Refutation fails below the Kesten-Stigum threshold.
Provides explicit bounds on theta for regular graphs with given girth.
Abstract
We derive upper and lower bounds on the degree for which the Lov\'asz function, or equivalently sum-of-squares proofs with degree two, can refute the existence of a -coloring in random regular graphs . We show that this type of refutation fails well above the -colorability transition, and in particular everywhere below the Kesten-Stigum threshold. This is consistent with the conjecture that refuting -colorability, or distinguishing from the planted coloring model, is hard in this region. Our results also apply to the disassortative case of the stochastic block model, adding evidence to the conjecture that there is a regime where community detection is computationally hard even though it is information-theoretically possible. Using orthogonal polynomials, we also provide explicit upper bounds on for regular graphs of a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
The Lovász Theta Function for Random Regular Graphs
and Community Detection in the Hard Regime
Jess Banks
,
Robert Kleinberg
and
Cristopher Moore
Abstract.
We derive upper and lower bounds on the degree for which the Lovász function, or equivalently sum-of-squares proofs with degree two, can refute the existence of a -coloring in random regular graphs . We show that this type of refutation fails well above the -colorability transition, and in particular everywhere below the Kesten-Stigum threshold. This is consistent with the conjecture that refuting -colorability, or distinguishing from the planted coloring model, is hard in this region. Our results also apply to the disassortative case of the stochastic block model, adding evidence to the conjecture that there is a regime where community detection is computationally hard even though it is information-theoretically possible. Using orthogonal polynomials, we also provide explicit upper bounds on for regular graphs of a given girth, which may be of independent interest.
Dept. of Mathematics, University of California-Berkeley, Berkeley CA
Dept. of Computer Science, Cornell University, Ithaca NY
Santa Fe Institute, Santa Fe NM
1. Introduction
Many constraint satisfaction problems have phase transitions in the random case: as the ratio between the number of constraints and the number of variables increases, there is a critical value at which the probability that a solution exists, in the limit , suddenly drops from one to zero. Above this transition, most instances are too constrained and hence unsatisfiable. But how many constraints do we need before it becomes easy to prove that a typical instance is unsatisfiable? When is there likely to be a short refutation, which we can find in polynomial time, proving that no solution exists?
For a closely related problem, suppose that a constraint satisfaction problem is generated randomly, but with a particular solution “planted” in it. Given the instance, can we recover the planted solution, at least approximately? For that matter, can we tell whether the instance was generated from this planted model, as opposed to an un-planted model with no built-in solution? We can think of this as a statistical inference problem. If there is an underlying pattern in a dataset (the planted solution) but also some noise (the probabilistic process by which the instance is generated) the question is how much data (how many constraints) we need before we can find the pattern, or confirm that one exists.
Here we focus on the -colorability of random graphs, and more generally the community detection problem. Let denote the Erdős-Rényi graph with vertices and average degree . A simple first moment argument shows that with high probability is is not -colorable if
[TABLE]
(We say that an event on graphs of size holds with high probability if , and with positive probability if .) Sophisticated uses of the second moment method [8, 24] show that this is essentially tight, and that the -colorability transition occurs at
[TABLE]
Now consider the planted coloring model, where we choose a coloring uniformly at random and condition on the event that is proper. If , then is probably not -colorable, while graphs drawn from the planted model are -colorable by construction. Thus, above the -colorability transition, we can tell with high probability whether was drawn from the planted or un-planted model by checking to see if is -colorable. However, searching exhaustively for -colorings would take exponential time.
A similar situation holds for the stochastic block model, a model of graphs with community structure also known as the planted partition problem (see [47, 2] for reviews). For our purposes, we will define it as follows: fix a constant , and say a partition of the vertices into groups is “good” if a fraction of the edges connect vertices within groups. Equivalently, if has edges, is a multiway cut with edges crossing between groups. Generalizing the planted coloring model where , the block model chooses uniformly, and conditions on the event that is good. The cases and , where vertices are more or less likely to be connected to others in the same group, are called assortative (or ferromagnetic) and disassortative (or antiferromagnetic) respectively.
Two natural problems related to the block model are detection, i.e., telling with high probability whether was drawn from the block model or from , and reconstruction, finding a partition which is significantly correlated with the planted partition . (This is sometimes called weak reconstruction to distinguish it from finding exactly, which becomes possible when [16, 1, 3, 30, 31, 9, 50].) Both problems become information-theoretically possible at a point called the condensation transition [39, 22, 19], and the first and second moment methods [12] show that this scales as
[TABLE]
where hides a multiplicative constant. As in -coloring this is roughly the first-moment bound above which, with high probability, no good partitions exist in . However, the obvious algorithms for detection and reconstruction, such as searching exhaustively for good partitions or sampling from an appropriate Gibbs distribution [6, 4], require exponential time.
In fact, conjectures from statistical physics [40, 25, 26] suggest this exponential difficulty is sometimes unavoidable. Specifically, these conjectures state that polynomial-time algorithms for detection and reconstruction exist if and only if is above the Kesten-Stigum threshold [34, 35],
[TABLE]
Several polynomial-time algorithms are now known to succeed whenever , including variants of belief propagation [49, 5] and spectral algorithms based on non-backtracking walks [48, 38, 43, 17]. Moreover, for we know that the information-theoretic and Kesten-Stigum thresholds coincide [51]. Comparing (2) and (3) we see that for any we have for sufficiently large , and in fact this occurs for some when and more generally when [6, 4, 12].
Thus in the regime , detection and reconstruction are information-theoretically possible, but are conjectured to be computationally hard. In particular, this conjecture implies that there is no way to refute the existence of a coloring, or of a good partition, whenever , even when is large enough so that a coloring or partition probably does not exist. Our goal in this paper is to rule out spectral refutations based on the Lovász theta function, or equivalently sum-of-squares proofs of degree two.
For technical reasons, we focus on random -regular graphs, which we denote . A series of papers applying the first and second moment methods in this setting [46, 7, 33, 21] have determined the likely chromatic number of for almost all , showing that the critical for -colorability is just as for . (There are a few values of and where could be -colorable with probability strictly between [math] and , so this transition might not be completely sharp.)
We define the -regular block model by choosing a planted partition uniformly at random and conditioning on the event that is good. Equivalently, we choose uniformly from all -regular graphs such that a fraction of their edges connect vertices within groups. We claim that our results also apply to the regular block model proposed in [51] where -regular graphs are chosen with probability proportional to : in that case, the fraction of within-group edges fluctuates, but is with high probability.111These models are not to be confused with a stricter model, where for some constants each vertex in group has exactly neighbors in group [18, 23, 53, 15]. Our model only constrains the total number of edges within or between groups. We again conjecture that refuting the existence of a coloring or a good partition is exponentially hard below the Kesten-Stigum bound. Since the branching ratio of a -regular tree is , in the regular case this becomes
[TABLE]
Main results. The Lovász function, which we review below, gives a lower bound on the chromatic number which can be computed in polynomial time. In particular, if , this provides a polynomial-time refutation of ’s -colorability. We first prove that this type of refutation exactly corresponds to sum-of-squares proofs of degree two in a natural encoding of -colorability as a system of polynomials; this is intuitive, but it does not seem to have appeared in the literature. We then show the following bounds on the likely value of when is a random -regular graph.
Theorem 1**.**
Let be constant. For any constant , with high probability
[TABLE]
As a consequence, the Lovász function cannot refute -colorability with high probability if
[TABLE]
and in particular if is below the Kesten-Stigum threshold.
Rearranging, no refutation of this kind can exist when
[TABLE]
Our lower bound on follows easily from Friedman’s theorem [29] on the spectrum of . For the upper bound, we first use orthogonal polynomials to derive explicit bounds on for arbitrary regular graphs of a given girth—which may be of independent interest—and then employ a concentration argument for .
We also relate the Lovász function to the existence of a good partition in the disassortative case of the block model, giving
Theorem 2**.**
Fix and say a partition is good if a fraction of its edges connect endpoints in the same group. Then sum-of-squares proofs of degree two cannot refute the existence of a good partition in if
[TABLE]
Thus degree-two sum-of-squares cannot distinguish the regular stochastic block model from until is roughly a factor of above the Kesten-Stigum threshold.
Related work. The distribution of for the Erdős-Rényi graph and the random -regular graph were studied in [20]. In particular, that work showed that when is sufficiently large, with high probability for a constant . Our results tighten this lower bound, making the constant explicit, and provide a nearly-matching upper bound.
Our results on the power of degree-two sum-of-squares refutations for -colorability contribute to a recent line of work on refutations of random CSPs, which we briefly survey. If we define the density of a CSP as the ratio of constraints to variables—which for coloring equals half the average degree of the graph—then the conjectured hard regime for -coloring corresponds to a range of densities bounded below and above by constants (i.e., depending on but not ). For CSPs such as -SAT and -XOR, there is again a satisfiability transition at constant density, but with high probability sum-of-squares refutations with constant degree do not exist unless the density is much higher, namely [55], a result which was recently extended to general CSPs whose constraint predicate supports a -wise uniform distribution [36]. Conversely, if a predicate does not support a -wise uniform distribution, then [10] shows that there is an efficient sum-of-squares refutation when the density is . For coloring, this gives refutations at roughly constant density; our contribution makes this a nearly-precise constant in the special case of degree-two sum-of-squares on random regular graphs.
The hidden clique problem also has a conjectured hard regime. It is well known that the random graph has no cliques larger than [28] but it is conjectured to be computationally hard to distinguish from a graph with a planted clique of size . A sequence of progressively stronger sum-of-squares lower bounds for this problem [27, 32, 45] have culminated in the theorem that with high probability the degree- sum-of-squares proof system cannot refute the existence of a clique of size in for some constant [13].
In contrast to the aforementioned work on refuting random -CSPs and planted cliques, our result pertains to a much more specific pair of problems, namely -coloring and the stochastic block model, and only to degree-two sum-of-squares refutations; but it attains a sharp bound, within an additive constant, on the density at which these refutations become possible. We conjecture that sum-of-squares refutations of any constant degree do not exist below the Kesten-Stigum threshold, but it seems difficult to extend our current techniques to degree higher than two.
2. Colorings, Partitions, and the Lovász Function
2.1. Background on sum-of-squares
One type of refutation which has gained a great deal of interest recently is sum-of-squares proofs: see [14] for a review. Suppose we encode our variables and constraints as a system of polynomial equations on variables, for all . One way to prove that no solution exists—in algebraic terms, that this variety is empty—is to find a linear combination of the which is greater than zero for all . Moreover, the positivstellensatz of Krivine [37] and Stengle [57] shows that a polynomial is nonnegative over if and only if it is a sum of squares of polynomials. Thus we need polynomials and and a constant (which we can always scale to if we like) such that
[TABLE]
This proof technique is complete as well as sound. That is, there is such a set of polynomials and if and only if no solution exists.
Even when the are of low degree, the polynomials and might be of high degree, making them difficult to find. However, we can ask when a refutation exists where both sides of (5) have degree or less. As we take we obtain the SOS hierarchy. The case is typically equivalent to a familiar semidefinite relaxation of the problem. More generally, a degree- refutation exists if and only if a certain semidefinite program on variables is feasible: thus we can find degree- refutations, or confirm that they do not exist, in time [56, 52, 54, 41]. To see why, note that if we write a polynomial as a bilinear form on monomials of degree ,
[TABLE]
then is a sum of squares of degree polynomials if and only if the matrix is positive semidefinite, or equivalently if is the sum of positive symmetric rank-one matrices. These are outer products of vectors with themselves, so there are vectors such that and where . Finally, the constraint that for some and some corresponds to a set of linear inequalities on the entries of .
The dual object to a degree- refutation is a pseudoexpectation. This is a linear operator on polynomials of degree at most with the properties that
[TABLE]
If we write as a bilinear form on monomials , then (6) and (7) are linear constraints on its entries, and (8) states that this matrix is positive semidefinite. The resulting SDP is dual to the SDP for refutations, so each of these SDPs is feasible precisely when the other is not. Thus there is a degree- refutation if and only if no degree- pseudoexpectation exists, and vice versa.
We can think of a pseudoexpectation as a way for an adversary to fool the SOS proof system. The adversary claims there are are many solutions—even if in reality there are none—and offers to compute the expectation of any low-degree polynomial over the set of solutions. As long as (6) and (7) hold, this appears to be a distribution over valid solutions, and as long as (8) holds, the SOS prover cannot catch the adversary in an obvious lie like the claim that some quantity of degree has negative variance.
2.2. Colorings, partitions, and sum-of-squares
For a given graph with adjacency matrix , we can encode the problem of -colorability as the following system of polynomial equations in variables , where indexes vertices and indexes colors:
[TABLE]
Then is -colorable if and only if (9)–(11) has a solution in . We can encode the stochastic block model similarly: fix , and recall that a partition of into groups is good if a fraction of the edges have endpoints in the same group. If has edges, we can replace constraint (11) with
[TABLE]
A degree- sum-of-squares refutation of (9)–(11) is an equation of the form
[TABLE]
where are polynomials over , is a sum of squares of polynomials, is a small positive constant which we will omit when clear, and the degree of each side is at most . Such an equation is a proof that no coloring exists. Replacing with gives a refutation of the system formed by (9), (10), and (12), proving that no good partition exists. We focus on refutations of degree two, which as we will see are related to a classic relaxation of graph coloring.
2.3. The Lovász function
An orthogonal representation of a graph with vertices is an assignment of a unit vector to each vertex such that \big{\langle}u_{i},u_{j}\big{\rangle}=0 for all . The Lovász function, denoted by convention, is the smallest for which there is an orthogonal representation and an additional unit vector such that \big{\langle}u_{i},\mathfrak{z}\big{\rangle}=1/\sqrt{\kappa}: that is, such that all the lie on a cone222To see that this definition of is equivalent to the more common one that \big{\langle}u_{i},\mathfrak{z}\big{\rangle}\leq 1/\sqrt{\kappa} for every , i.e., where the can be in the interior of this cone, simply rotate each in the subspace perpendicular to its neighbors until \big{\langle}u_{i},\mathfrak{z}\big{\rangle} is exactly . of width .
The Gram matrix P_{ij}=\big{\langle}u_{i},u_{j}\big{\rangle} of an orthogonal representation is positive semidefinite with and for . Adding an auxiliary row and column for the inner products with , we can define in terms of a semidefinite program,
[TABLE]
where is the matrix of all s and \big{\langle}A,B\big{\rangle}=\operatorname{tr}(A^{\dagger}B)=\sum_{i,j}A_{ij}B_{ij} denotes the matrix inner product.
If is -colorable then , since we can use the first basis vectors as an orthogonal representation and take . Thus if , the Lovász function gives a polynomial-time refutation of -colorability. As stated above, degree-two sum-of-squares proofs typically correspond to well-known semidefinite relaxations, and the next theorem shows that this is indeed the case here.
Theorem 3**.**
There is a degree-2 SOS refutation of -colorability for a graph if and only if .
We prove this in the Appendix, where we show that any orthogonal representation of that lies on an appropriate cone lets us define a pseudoexpectation for the system (9)–(11). This will also allow us to modify the SDPs for refutations and pseudoexpectations, and work with simplified but equivalent versions.
2.4. Good partitions and a relaxed Lovász function
The reader may have noticed that while the coloring constraint (11) fixes the inner product \sum_{c}x_{i,c}x_{j,c}=\big{\langle}x_{i},x_{j}\big{\rangle} to zero for each edge , the “good partition” constraint (12) only fixes the sum of all these inner products. This suggests a slight relaxation of the Lovász function, where we weaken the SDP (14) by replacing the individual constraints on for all with a constraint on their sum. In other words, we allow a vector coloring where neighboring vectors are orthogonal on average. We denote the resulting function :
[TABLE]
Since is a relaxation of , we always have .
This modified Lovász function is equivalent to degree-two SOS for good partitions in the dissasortative case of the block model, in the following sense.
Theorem 4**.**
If , there exists a degree-two SOS refutation of a partition of where a fraction of the edges are within groups if and only if
[TABLE]
Once again we leave the proof to the Appendix. Note that the SDP (16) for contains no information about or : this relaxed orthogonal representation has the uncanny capacity to fool degree-two SOS about an entire family of related cuts of different sizes and qualities.
2.5. Upper and lower bounds
With these theorems in hand, we can set about producing degree-two sum-of-squares refutations and pseudoexpectations for our problems; throughout this section we will refer to these simply as ‘refutations’ and ‘pseudoexpectations’. In fact, the same construction will give us asymptotically optimal refutations and pseudoexpectations for both the coloring and partition problems.
To warm-up, we have the following simple construction of a refutation, which we will phrase in terms of the Lovász theta function and its relaxed version.
Lemma 1**.**
Let be a -regular graph, and let be the smallest eigenvalue of its adjacency matrix . Then
[TABLE]
Proof.
We construct a feasible solution to the dual SDP (17) by taking
[TABLE]
and use the fact that \big{\langle}A,\mathbb{J}\big{\rangle}=dn. ∎
By invoking Friedman’s theorem [29] that (as ) the smallest eigenvalue of a random -regular graph is with high probability larger than for any , we obtain:
Corollary 1**.**
When , for any , with high probability
[TABLE]
Putting this together with Theorems 3 and 4 gives
Corollary 2**.**
If and , with high probability there exists a refutation of a partition with a fraction of within-group edges when
[TABLE]
Setting , a refutation of -colorability exists with high probability when
[TABLE]
Note that for large , the minimum value of satisfying (21) is a factor of four above the Kesten-Stigum threshold in both the coloring and partition problems.
Our construction for this lower bound on is quite simple, but remarkably we find that for both the coloring and partition problems, it is asymptotically optimal in and . In particular,
Theorem 5**.**
For any -regular graph with girth at least , we have
[TABLE]
where is a sequence of constants which decrease to zero as .
Since for any constant a random regular graph has girth with positive probability [59, Theorem 2.12], we rely on the following result showing that is concentrated in an interval of width one. The proof is essentially the same as that of [7] for the chromatic number, and is given in the Appendix.
Lemma 2**.**
Let . If with positive probability, then with high probability.
Corollary 3**.**
If , with high probability there does not exist a refutation of a partition with a fraction of within-group edges when
[TABLE]
Setting , with high probability no refutation of -colorability exists when
[TABLE]
Thus for both problems, no degree-two sum-of-squares refutation exists until is roughly a factor of above the Kesten-Stigum threshold.
3. Constructing a Pseudoexpectation with Orthogonal Polynomials
We now prove Theorem 5 by constructing a feasible solution to the primal SDP (14): that is, unit vectors such that \big{\langle}u_{i},u_{j}\big{\rangle}=0 for every edge , and a unit vector so that \big{\langle}u_{i},\mathfrak{z}\big{\rangle}=1/\sqrt{\kappa} for all . Recall that such a collection exists if and only if .
It is convenient to instead define a set of unit vectors such that \big{\langle}v_{i},v_{j}\big{\rangle}=-1/(\kappa-1) for every edge . We claim that such a set exists if and only if . In one direction, given and with the above properties, if we define
[TABLE]
then the are unit vectors with \big{\langle}v_{i},v_{j}\big{\rangle}=-1/(\kappa-1) for . For instance, if the are orthogonal basis vectors, then the point to the corners of a -simplex. In the other direction, given we can take to be a unit vector perpendicular to all the , and define
[TABLE]
Then \big{\langle}u_{i},u_{j}\big{\rangle}=0 for , and \big{\langle}u_{i},\mathfrak{z}\big{\rangle}=1/\sqrt{\kappa} for all . This means that we can characterize the Lovász function with a slightly different SDP, which uses the Gram matrix of the :
[TABLE]
We will show that for any -regular graph with girth at least , this SDP has a feasible solution with
[TABLE]
where depends only on and tends to zero as . Therefore, there is a pseudoexpectation that prevents degree-two SOS from refuting -colorability for any . We will construct this pseudoexpectation by taking a linear combination of the “non-backtracking powers” of ’s adjacency matrix .
Denote by the matrix whose entry is the number of non-backtracking walks of length from to ; that is, walks which may freely wander the graph so long as they do not make adjacent pairs of steps for any vertices . There is a simple two-term recursion for these matrices: to count non-backtracking walks of length , we first extend each walk of length by one edge, and then subtract off those that backtracked on the last step. This gives
[TABLE]
Borrowing notation from [11], we can write in closed form as
[TABLE]
where is a polynomial of degree . Specifically,
[TABLE]
and for the satisfy the Chebyshev recurrence
[TABLE]
We can write explicitly as
[TABLE]
and is the th Chebyshev polynomial of the second kind (note that ).
Let denote the Kesten-McKay measure on the interval , which after scaling by describes the typical spectral density of a random regular graph [44]:
[TABLE]
Then the polynomials are orthonormal with respect to this measure. That is, if we define the inner product
[TABLE]
then
[TABLE]
If the girth of the graph is at least , there is no way for a non-backtracking walk of length or less to return to its starting point or to a neighbor of its starting point, so \big{\langle}\mathbbm{1},A^{(t)}\big{\rangle}=\big{\langle}A,A^{(t)}\big{\rangle}=0 for . We can thus satisfy the diagonal and edge constraints of (24) by considering solutions of the form
[TABLE]
since the first two terms ensure that has s on its diagonal and on the edges. If we write
[TABLE]
our job is to optimize the coefficients for so as to minimize , and hence , while ensuring that .
The eigenvalues of the matrix are of the form where ranges over all of ’s eigenvalues. Therefore, if and only if for all eigenvalues of . Friedman’s celebrated theorem [29] shows that, with high probability, the eigenvalues of are contained in the set
[TABLE]
for any . Thus we require that
[TABLE]
We will relax this condition slightly by demanding just that is nonnegative on , although as we will see the resulting optimum is achieved by a function which is nonnegative on all of . First we use orthonormality (29) to write the coefficients as inner products,
[TABLE]
Then we optimize the pseudoexpectation as follows,
[TABLE]
When the degree of is even, we can solve this optimization problem explicitly. Set , and let be the roots of in decreasing order; it follows from standard arguments about orthogonal polynomials that these are all in the support of , i.e., in the interval . Consider the following polynomial of degree ,
[TABLE]
where
[TABLE]
is a normalizing factor to ensure that \big{\langle}q_{0},s\big{\rangle}=1. We claim that is the optimum of (33). To prove this, we begin with a general lemma on orthogonal polynomials and quadrature. The proof is standard (e.g. [58]) but we include it in the Appendix for completeness.
Lemma 3**.**
Let be a sequence of polynomials of degree which are orthogonal with respect to a measure supported on a compact interval . Then the roots of form a quadrature rule which is exact for any polynomial of degree less than , in that
[TABLE]
for some positive weights independent of .
Now let . In view of Lemma 3, for any polynomial of degree at most , the inner product can be expressed using the roots of as a quadrature,
[TABLE]
Note that for every , since is the left-most root. If impose the constraints that for all , then . If we also impose the constraint , then
[TABLE]
with equality if and only if for all . Since obeys this equality condition, we have
[TABLE]
and this is the minimum possible value of subject to the constraints that and for . Moreover, on all of , so in fact obeys the stronger constraint (32).
Referring back to (31) gives
[TABLE]
and so
[TABLE]
Finally, we obtain (22) by defining . Since as tends to infinity333The fact that as can be deduced, for example, by using the definition of in (27) to observe that and have opposite signs, and then applying the intermediate value theorem. , we have as , completing the proof.
We end with a brief note on the above construction. Recall that our project for the last several pages has been to set the coefficients of non-backtracking paths of length in a feasible solution to the SDP (24),
[TABLE]
As discussed in the Appendix, this matrix can be translated into a degree-two pseudoexpectation for the coloring problem: a linear operator that claims to give the joint distribution of colors at at each pair of vertices and . The reader will find there that is related to the ‘pseudocorrelation’ between vertices and , by
[TABLE]
Our expansion of in terms of non-backtracking paths means that, for most pairs , this pseudoexpectation depends only on the shortest path distance . Specifically, whenever and the shortest path is unique, we have , and if then . One might think that in the limit of large , the optimal pseudoexpectation would make the natural choice that : in that case, the pseuodocorrelation would decay just as if these shortest paths were colored uniformly at random, ignoring correlations with the remainder of the graph. However, a quick calculation shows that this choice is in fact not optimal. In fact, the optimal coefficients we derive above cause the pseudocorrelation to decay more quickly with distance than this naïve guess, namely (in the limit of large and large girth) as .
Acknowledgements
We are grateful to Charles Bordenave, Emmanuel Abbe, Amin Coja-Oghlan, Yash Deshpande, Marc Lelarge, and Alex Russell for helpful conversations. Part of this work was done while C.M. was visiting École Normale Supérieure. Part of this work was done while R.K. was a researcher at Microsoft Research New England. C.M. is supported by the John Templeton Foundation and the Army Research Office under grant W911NF-12-R-0012.
Appendix A Proof of Theorems 3 and 4
We prove Theorems 3 and 4 by directly simplifying the SDP that defines feasible degree-two pseudoexpectations. The first step is a broad result on the structure of these objects that applies to any set of constraints which includes the boolean (9) and single-color (10) constraints and is suitably symmetric; we then specialize to the coloring and partition problems.
Recall that a degree-two pseudoexpectation for a system of polynomials is a linear operator which satisfies
- •
- •
for any polynomials and such that
- •
for any polynomial with
We can identify such objects with PSD matrices of the form
[TABLE]
where and . It is useful to think of as a block matrix, with a block corresponding to each pair of vertices . Consistency with the boolean and single-color constraints (9), (10) then controls the diagonal elements and row and colum sums of each of these blocks,
[TABLE]
Moreover, each of our constraints is fixed under permutations of the colors, and inherits this symmetry. That is the matrix carries with it a natural action that simultaneously permutes and . This action preserves the spectrum of as a matrix, as well as every hard constraint. By convexity, we may assume that is stabilized under it, by beginning with an arbitrary pseudoexpectation and averaging over its orbit.
This assumption substantially constrains and simplifies . In particular we are free to (i) assume that and (ii) assume that each block in has only two distinct values: one on the diagonal and the other off the diagonal. In other words, the pseudoexpectation claims that the marginal distribution of each vertex is uniform, and that joint marginal of any two vertices depends only on the probability that they have the same or different colors. As a result, for each we can assume that is a linear combination of the identity matrix and the matrix of all 1s, and that the row and column sums of are all . In that case for each we can write
[TABLE]
for some , or equivalently that
[TABLE]
for some matrix . Note that
[TABLE]
so (37) requires that for all .
Since the pseudoexpectation (36) consists of with an additional row and column, we consider the following lemma. We leave its proof as an exercise for the reader.
Lemma 4**.**
For any matrix , vector and scalar ,
[TABLE]
if and only if .
Since is the -dimensional vector whose entries are all , we have . Thus (40) and Lemma 4 imply that if and only if
[TABLE]
Since is a projection operator, this in turn occurs if and only if
[TABLE]
To summarize, finding a pseudoexpectation is equivalent to finding a PSD matrix with for all , such that remains PSD when we subtract the rank-one matrix . However, we have thus far only reasoned about the boolean and single color constraints, and including either the coloring or cut constraint places an additional restriction on . In the case of coloring, we demanded that
[TABLE]
for every edge . This implies that , and so for each edge. Collecting these observations, a pseudoexpectation for coloring exists exactly when , where
[TABLE]
Finally, note that where . Applying Lemma 4 again then gives exactly the PSD (14) for the Lovasz function, thus completing the proof of Theorem 3.
In the case of good partitions, we required that
[TABLE]
but this means that
[TABLE]
Following the path above, a degree-two pseudoexpectation exists for community detection when , where
[TABLE]
A priori, it seems that we may need to solve a different SDP for each value of , but a bit more work shows that this is not the case. Lemma 4 lets us transform the SDP (16) for to the following problem,
[TABLE]
The following lemma then shows us how to relate optima of (45) to those of (44) for any in the disassortative range , thus completing the proof of Theorem 4.
Lemma 5**.**
For any ,
[TABLE]
Proof.
We show how to translate back and forth between solutions of (44) and (45). Given a matrix , define
[TABLE]
It is easy to check that if and only if , and if and only if . Finally, if we set
[TABLE]
then
[TABLE]
so if and only if . Thus (45) is feasible for if and only if (44) is feasible for . Since and are the smallest and respectively for which this is the case, (47) implies (46). ∎
Appendix B Proof of Lemma 3
It is immediate that there is such a quadrature rule for polynomials of degree strictly less than , since the space of linear functionals on such polynomials has dimension and is thus spanned by the linearly independent functionals which evaluate at the roots . Now let . We can divide by to write where . We have
[TABLE]
since is orthogonal to all polynomials of degree less than and has roots . This verifies exactness of the quadrature rule for polynomials of degree smaller than .
To show that the weights are positive, let and let be the polynomial with double roots at every root of save . Since is everywhere nonnegative and is a polynomial of degree , we have
[TABLE]
but since is nonnegative, must be positive.
Appendix C Proof of Lemma 2
The proof closely follows [7, Theorem 4] which shows that the chromatic number of is concentrated on two adjacent integers, and which is in turn based on the proof in [42] of two-point concentration for with . Recall the configuration model [59], where we make “copies” of each vertex corresponding to its half-edges, and then choose uniformly from all perfect matchings of these copies. If we denote the set of such matchings by and condition the corresponding multigraphs on having no self-loops or multiple edges, the resulting distribution is uniform on the set of -regular graphs, and occupies a constant fraction of the total probability of . Thus any property which holds with high probability for holds with high probability for as well.
If are two perfect matchings in , we write if they differ by a single swap, changing to or . The following martingale inequality [59, Theorem 2.19] shows that a random variable which is Lipschitz with respect to these swaps is concentrated.
Lemma 6**.**
Let be a constant, and let be a random variable defined on such that whenever . Then
[TABLE]
Now fix , and define as the minimum number of edge constraints in the SDP (14) violated by an otherwise feasible solution with . This meets the Lipschitz condition with . By assumption with positive probability. Lemma 6 then implies that (say) , in which case with high probability.
Let denote the set of endpoints of the violated edges. Then there is an orthogonal representation of the subgraph induced by and a unit vector such that \big{\langle}u_{i},\mathfrak{z}\big{\rangle}=1/\sqrt{\theta} and \big{\langle}u_{i},u_{j}\big{\rangle}=0 if and . Our goal is to “fix” on the violated edges, and if necessary on some additional vertices, to give an orthogonal representation for all of .
As in [7, 42], we inductively build a set of vertices as follows. Given , let where , , and and each have at least one neighbor in . We define as the step at which there is no such pair and this process ends. Let denote ’s neighborhood, i.e., the set of vertices outside which have a neighbor in . Then is an independent set, since otherwise the process would have continued. We make the following claim:
Lemma 7**.**
With high probability, the subgraph induced by is 3-colorable.
Proof.
For all we have . Moreover, the subgraph induced by has at least edges and thus average degree at least . On the other hand, a crude union bound shows that for any and any , there is an such that, with high probability, all induced subgraphs of containing or fewer vertices have average degree less than . Since with high probability, this implies that for all , and in particular that .
The same union bound then implies that with high probability the subgraph induced by , and all its subgraphs, have average degree less than . But this means that this subgraph has no 3-core: that is, it has at least one vertex of degree less than 3, and so will the subgraph we get by deleting this vertex, and so on. Working backwards, we can 3-color the entire subgraph by starting with the empty set and adding these vertices back in, since at least one of the three colors will always be available to them. ∎
To define our orthogonal representation, let be a unit vector such that \big{\langle}\mathfrak{z},w\big{\rangle}=\big{\langle}u_{i},w\big{\rangle}=0 for all ; such a vector exists since . Then define
[TABLE]
Then , and \big{\langle}w,\mathfrak{z}^{\prime}\big{\rangle}=\big{\langle}u_{i},\mathfrak{z}^{\prime}\big{\rangle}=1/\sqrt{\theta+1} for all . Moreover, there exist three mutually orthogonal unit vectors such that \big{\langle}y_{j},\mathfrak{z}^{\prime}\big{\rangle}=1/\sqrt{\theta+1} and \big{\langle}y_{j},w\big{\rangle}=0 for all . This follows from the fact that the following matrix is PSD whenever , in which case it can be realized as the Gram matrix of :
[TABLE]
Finally, let be a proper 3-coloring of the subgraph induced by . Then the following is an orthogonal representation of ,
[TABLE]
and \big{\langle}v_{i},\mathfrak{z}^{\prime}\big{\rangle}=1/\sqrt{\theta+1} for all . This gives a feasible solution to the SDP (14) with , implying that .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] E. Abbe, A.S. Bandeira, and G. Hall. Exact recovery in the stochastic block model. IEEE Transactions on Information Theory , 62(1):471–487, 2016.
- 2[2] Emmanuel Abbe. Community detection and stochastic block models: recent developments. J. Machine Learning Research , 2017. to appear.
- 3[3] Emmanuel Abbe and Colin Sandon. Community detection in general stochastic block models: Fundamental limits and efficient algorithms for recovery. In Proc. 56th Annual Symposium on Foundations of Computer Science, FOCS , pages 670–688, 2015.
- 4[4] Emmanuel Abbe and Colin Sandon. Detection in the stochastic block model with multiple clusters: proof of the achievability conjectures, acyclic BP, and the information-computation gap. Ar Xiv preprints , 1512.09080, 2015.
- 5[5] Emmanuel Abbe and Colin Sandon. Achieving the KS threshold in the general stochastic block model with linearized acyclic belief propagation. In Proc. Neural Information Processing Systems (NIPS) , pages 1334–1342, 2016.
- 6[6] Emmanuel Abbe and Colin Sandon. Crossing the KS threshold in the stochastic block model with information theory. In IEEE Intl. Symp. on Information Theory, ISIT , pages 840–844, 2016.
- 7[7] Dimitris Achlioptas and Cristopher Moore. The chromatic number of random regular graphs. In Proc. 8th International Workshop on Randomization and Computation (RANDOM) , pages 219–228, 2004.
- 8[8] Dimitris Achlioptas and Assaf Naor. The two possible values of the chromatic number of a random graph. Ann. Math. , 162:1335–1351, 2005.
