Replica Symmetry in Upper Tails of Mean-Field Hypergraphs
Somabha Mukherjee, Bhaswar B. Bhattacharya

TL;DR
This paper investigates the properties of the mean-field variational problem for the upper tail of edge counts in hypergraphs, showing a universal replica symmetric phase for regular hypergraphs and connecting to graphon limits.
Contribution
It establishes the existence of a universal replica symmetric phase in the variational problem for regular hypergraphs and links the problem to graphon limits in dense graphs.
Findings
Universal replica symmetric phase for regular hypergraphs
Variational problem minimized by constant functions
Connection to graphon limits in dense graphs
Abstract
Given a sequence of -uniform hypergraphs , denote by the number of edges in the random induced hypergraph obtained by including every vertex in independently with probability . Recent advances in the large deviations of low complexity non-linear functions of independent Bernoulli variables can be used to show that tail probabilities of are precisely approximated by the so-called 'mean-field' variational problem, under certain assumptions on the sequence . In this paper, we study properties of this variational problem for the upper tail of , assuming that the mean-field approximation holds. In particular, we show that the variational problem has a universal replica symmetric phase (where it is uniquely minimized by a constant function), for any sequence of regular -uniform hypergraphs, whichβ¦
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Replica Symmetry in Upper Tails of Mean-Field Hypergraphs
Somabha Mukherjee
Department of Statistics, University of Pennsylvania, Philadelphia, USA, [email protected]
Β andΒ
Bhaswar B. Bhattacharya
Department of Statistics, University of Pennsylvania, Philadelphia, USA, [email protected]
Abstract.
Given a sequence of -uniform hypergraphs , denote by the number of edges in the random induced hypergraph obtained by including every vertex in independently with probability . Recent advances in the large deviations of low complexity non-linear functions of independent Bernoulli variables can be used to show that tail probabilities of are precisely approximated by the so-called βmean-fieldβ variational problem, under certain assumptions on the sequence . In this paper, we study properties of this variational problem for the upper tail of , assuming that the mean-field approximation holds. In particular, we show that the variational problem has a universal replica symmetric phase (where it is uniquely minimized by a constant function), for any sequence of regular -uniform hypergraphs, which depends only on . We also analyze the associated variational problem for the related problem of estimating subgraph frequencies in a converging sequence of dense graphs. Here, the variational problems themselves have a limit which can be expressed in terms of the limiting graphon.
1. Introduction
Given a -uniform hypergraph with vertex set and hyperedge set (which is a collection of -element subsets of ) and fixed, construct a random sub-hypergraph of as follows: sample each vertex in with probability and consider the induced sub-hypergraph of on the set of sampled vertices. Denote by the number of edges in this random sub-hypergraph, which can be written, more formally, as
[TABLE]
where are i.i.d. . (Note that .) Numerous celebrated problems in combinatorial probability can be re-formulated in terms of (1.1), for some choice of .
- (1)
Subgraphs in ErdΕs-RΓ©nyi random graphs: The number of copies of a fixed graph in the ErdΕs-RΓ©nyi random graph can be formulated in terms of (1.1) as follows: Consider the hypergraph (to be referred to as the -counting hypergraph) with vertex set of as the edge set of the complete graph , and edge set of as the collection of the edge sets of all copies of in . Then is a -uniform regular hypergraph111A hypergraph is said to be -regular if every vertex is incident on exactly hyperedges in . and is precisely the number of copies of in .
- (2)
Arithmetic progressions in a random set: Given , define the -AP counting hypergraph as the hypergraph with vertex set and edge set the set of all term arithmetic progressions in . Then is the number of -term arithmetic progressions in a random subset of , where every element is included in the subset independently with probability .
- (3)
Estimating motif frequencies in large graphs: Efficiently counting motifs in a large graph such as the number of edges or triangles (more generally, subgraph counts) is an important statistical and computational problem [21]. One natural strategy to reduce storage and computational costs is to randomly sample subsets of vertices, where natural estimates of subgraph counts are often of the form (1.1) above (see Section 1.2 for details).
Concentration inequalities for , for a sequence of hypergraphs , are well-known (see [18, 27] and the references therein). In this paper, we study the precise large deviations upper tail probabilities of , in the fixed regime, which involves determining the exact asymptotics of
[TABLE]
Note that is a random multi-linear polynomial indexed by , and establishing its upper tail asymptotics falls in the framework of non-linear large deviations, introduced in the seminal paper of Chatterjee and Dembo [12]. Here, they studied the large deviations of a general random function , where and are i.i.d. , and came up with a notion of complexity of the gradient of the function (along with additional smoothness properties), under which the tail probabilities of (and the associated Gibbs measure on ) can be well-approximated by the so-called βmean-fieldβ variational problem, which is an entropic variational problem over the set of product measures on the hypercube . Thereafter, Eldan [17] obtained an improved set of conditions, which involved computing the Gaussian width of the gradient of the function, under which the above reduction holds. Similar results for Gibbs measures beyond the hypercube were obtained by [4, 28], and recently by Austin [2] for very general product spaces.
These results can be used to obtain various sufficient conditions on a sequence of hypergraphs for which the probability in (1.2) can be approximated by the corresponding mean-field variational problem. This motivates the following abstract definition:
Definition 1.1**.**
(Mean-field hypergraphs) Given a -uniform hypergraph and , the upper-tail mean field variational problem for is defined as:
[TABLE]
where
- β
,
- β
, is the relative entropy of with respect to ; and
- β
, for .
Moreover, a sequence of -uniform hypergraphs is said to be mean-field (for the upper tail problem) if
[TABLE]
for all .
The mean-field condition has been established for several natural hypergraph sequences. For example, it follows from results in the landmark paper of Chatterjee and Varadhan [14], that for any fixed graph , the -counting hypergraph defined above is mean-field. In this case, the associated variational problems themselves have a limit, which can be expressed as an optimization problem in the space of graphons (the continuum limit of graphs [22]). Lubetzky and Zhao [23] analyzed these variational problems, and identified the precise region of replica symmetry (set of all points where the optimization problem is uniquely minimized by the constant function ) for regular graphs . The validity of (1.4) for the number of arithmetic progressions in a random set, and properties of the associated variational problem was established in [8]. Recently, Dembo and Lubetzky [16] derived the large deviations for subgraph counts in the uniform random graph model (the uniform distribution over graphs with vertices and edges). Here, the variational problem in the rate function coincide with those studied in statistical physics in the context of constrained random graphs, where symmetry breaking configurations are often attained by block (βmulti-podalβ) graphons (see [19, 20, 25] and the references therein).
For a general hypergraph sequence the results in [12, 17] can be applied to obtain different sufficient conditions on for which the approximation 1.4 holds. For instance, BriΓ«t and Gopi [11] computed the Gaussian-width a general multilinear polynomial, which combined with [17, Theorem 5] gives the following sufficient condition for a sequence of -uniform hypergraphs to be mean-field:222The first condition in (1.5) controls the Gaussian width of the gradient of the function [11, Corollary 6.1], while the second condition controls the Lipschitz constant. To verify the mean-field condition using [17, Theorem 5], one also needs to check a few technical conditions related to the continuity of the variational problem, all of which can be easily verified when (1.5) holds.
[TABLE]
where and are the maximum degree and the maximum co-degree of , respectively.333For a -uniform hypergraph , the degree of a vertex (to be denoted by ) is the number of hyperedges in containing , and the maximum degree . Similarly, for , the co-degree of (denoted by ) is the number of hyperedges in containing both and , and the maximum co-degree . These assumptions are satisfied for a variety of hypergraphs, from dense -uniform hypergraphs (where ) to much sparser hypergraphs such as the -AP counting hypergraph (where . The condition in (1.5) can be significantly improved for graphs (2-uniform hypergraphs), where the mean-field condition has been extensively studied and well-understood [4, 17]. In this case, a sufficient condition for sequence of graphs to be mean-field is
[TABLE]
that is, the graph is not βtoo sparseβ (number of edges is much larger than the number of vertices) and not βtoo irregularβ (maximum degree is of the same order as the average degree). In the appendix (Proposition A.1), we show that
[TABLE]
is another simple sufficient condition for a sequence of -uniform hypergraphs to be mean-field for the upper tail problem. The conditions in (1.5) and (1.7) are, in general, incomparable. However, there are cases where (1.7) improves upon (1.5) (see Example 3 in Appendix A). Moreover, (1.7) recovers the condition for graphs (1.6) as a special case.
Remark 1.1**.**
In the approximation (1.4) one can often allow the sparsity parameter , to go to zero with , at appropriate rates. Determining the optimal dependence on the sparsity parameter for a specific sequence of hypergraphs is, in general, a challenging problem. For example, for upper tails of subgraphs in the ErdΕs-RΓ©nyi random graph , Chatterjee and Dembo [12, Theorem 1.2] established the validity of (1.4), using their notion of gradient complexity, for certain regimes of the sparsity parameter . The dependence on was later improved by Eldan [17], and, very recently, Cook and Dembo [15] and Augeri [1], independently and simultaneously, established (1.4) for cycle counts in , under almost optimal sparsity conditions. The associated variational problems in the sparse regime was precisely analyzed in [7, 24].
In this paper, we study asymptotic properties of the variational problem (1.3) for a sequence -uniform hypergraphs in the fixed regime, assuming the mean-field approximation (1.4) holds. The following is the summary of our results:
- (1)
In the fixed regime, it is notoriously difficult to solve the variational problem (1.3) explicitly for a general sequence of hypergraphs. Instead, one searches for the region of replica symmetry, that is, the set of for which the variational problem is uniquely minimized by the constant vector (recall ). We show in Theorem 1.1 that any sequence of regular -uniform hypergraphs has a (universal) region of replica symmetry, which depends only on , but not on the specific choice of the hypergraphs. Moreover, in a sense to be made precise below, the replica symmetry region we identify for regular -uniform hypergraphs is tight.
- (2)
We also analyze the variational problem arising in the motif frequency estimation problem, for a converging sequence of dense graphs (Section 1.2). In this case, the variational problems themselves have a limit which can be expressed, using the limiting graphon, as an optimization problem over the space of functions from (Theorem 1.2), giving the exact Bahadur slope [3] of the corresponding estimate.
1.1. Replica Symmetry for Regular Hypergraphs
We begin with the following theorem, which identifies the precise region of universal replica symmetry for regular -uniform hypergraphs. To this end, recall the definition of the mean-field variational problem from (1.3).
Theorem 1.1**.**
Fix . Then the following hold:
- (a)
(Replica Symmetry) Suppose is a sequence of regular -uniform hypergraphs. If the point lies on the convex minorant of the function , then . Moreover, if is a sequence of mean-field, regular -uniform hypergraphs, then
[TABLE] 2. (b)
(Replica Symmetry Breaking) If the point does not lie on the convex minorant of , then there exists a sequence of mean-field, regular -uniform hypergraphs (depending on and ) such that , and
[TABLE]
The proof of this theorem is given in Section 2. It shows that the variational problem (1.3) has a region of replica symmetry for any regular -uniform hypergraph, which is determined by the convex minorant of the function , and this region is tight over the class of all regular -uniform hypergraphs: For a regular -uniform hypergraph denote by
[TABLE]
the set of all points where replica symmetry is preserved. Then the theorem above can be re-stated as:
[TABLE]
where is the collection of all regular -uniform hypergraphs and is the set of all such that and lies on the convex minorant of the function .444Note that Theorem 1.1 does not construct a regular -uniform hypergraph for which , since the symmetry breaking construction depends on and . We are only able to obtain such a construction when is restricted to be on the concave part of the function (see Example 1 for details). On the other hand, in Example 2, we construct a sequence of irregular graphs which exhibit symmetry breaking everywhere, showing that the regularity assumption on the hypergraphs in Theorem 1.1 is necessary for obtaining a universal region of symmetry.
Remark 1.2**.**
For a specific sequence of hypergraphs the region of replica symmetry might be larger. For instance, if is the sequence of complete -uniform hypergraph on vertices, it is easy to check that replica symmetry is preserved for all . Another example is the replica symmetry region in the upper tails of subgraphs in the ErdΕs-Renyi random graph . To this end, given a fixed connected graph , recall the definition of the -counting hypergraph from above: the vertex set of is the edge set of the complete graph , and the edge set of is the collection of the edge sets of all copies of in . Then , the number of copies of in . Lubetzky and Zhao [23] studied the replica symmetry region in the upper tail variational problem for . It follows from their results that , where is the maximum degree of .555This is the exact replica symmetric region when is -regular, that is, ([23, Theorem 1.1]). On the other hand, Theorem 1.1 above shows that , since is a regular -uniform hypergraph. (Note that , unless , in which case is a star-graph and the two regions are the same.)
1.2. Subgraphs in Dense Graphs
In this section, we explore the application of the general framework introduced above, in counting subgraphs of vertex-percolated graphs. Given and a graph the vertex-percolated graph is the random induced subgraph ,666Given a graph and , denotes the induced subgraph of on the set . where is obtained by including every element of independently with probability . For a fixed graph , denote by the number of copies of the graph in . More formally,
[TABLE]
where:
- β
are i.i.d. and ,
- β
is the adjacency matrix of ,
- β
is the set of all -tuples with distinct indices.777For a set , the set denotes the -fold cartesian product . Thus, the cardinality of is ,
- β
is the automorphism group of , that is, the group of permutations of the vertex set such that if and only if .
Note that
[TABLE]
where is the number of copies of in .
Remark 1.3**.**
As mentioned before, the statistic (1.9) arises in the problem of estimating motif frequencies in large graphs [21]. Here, given a large graph , the goal is to efficiently (without storing or searching over the entire graph) estimate motif frequencies (subgraph counts) of , by making local queries on . Klusowski and Wu [21] proposed the subgraph sampling model, where one has access to the random induced subgraph , where is obtained by sampling each vertex in independently with probability . In this model, by (1.10) above, is an unbiased estimate of the subgraph count .
The large deviation tail probabilities for can be derived using the general theory described above. To see this, note that can be rewritten as (1.1) by defining the -uniform hypergraph which has vertex set and a hyperedge for every copy of in .888Technically this is a βmulti-hypergraphβ, because a subset of vertices might contain several copies of in . However, the general theory can be easily modified to include hypergraphs with multiple edges. In this section, we show that the large deviation variational problem for itself has a limit, when is a converging sequence of dense graphs. We begin with some preliminaries on graph limit theory. The formal statement of the result is given in Section 1.2.1.
1.2.1. Graph Limit Theory Preliminaries
The theory of graph limits developed by LovΓ‘sz and coauthors [22] has received phenomenal attention over the last few years. It connects various topics such as graph homomorphisms, Szemerediβs regularity lemma, and quasirandom graphs, and has found many interesting applications in statistical physics, extremal graph theory, statistics and related areas (see [5, 6, 13, 25, 26, 29, 30] and the references therein). Here we recall the basic definitions about the convergence of graph sequences. If and are two graphs, denote the homomorphism density of into by , where denotes the number of homomorphisms of into .
To define the continuous analogue of graphs, consider to be the space of all measurable functions from into that satisfy , for all . For a simple graph with , let
[TABLE]
Definition 1.2**.**
[9, 10, 22] A sequence of graphs is said to converge to , if for every finite simple graph ,
[TABLE]
The limit objects, that is, the elements of , are called graph limits or graphons. A finite simple graph can also be represented as a graphon in a natural way: Define , that is, partition into squares of side length , and let in the -th square if , and 0 otherwise. Observe that for every simple graph and therefore the constant sequence converges to the graph limit . It turns out that the notion of convergence in terms of subgraph densities outlined above can be suitably metrized using the so-called cut distance (see Section 3 for details).
1.2.2. Large Deviations of Subgraph Counts in Vertex-Percolated Dense Graphs
Consider a sequence of graphs converging to a graphon and a fixed graph . In this case, the limit of the upper tail large deviation probability for can be described as a variational problem in the space of graphons. To this end, given a measurable function , a fixed graph , and graphon , define
[TABLE]
(Note that when is the constant function 1, , as defined in (1.11).)
Theorem 1.2**.**
Suppose is a sequence of graphs converging to a graphon and is a fixed graph satisfying . Then, for ,
[TABLE]
with
[TABLE]
where the infimum is taken over the set of all measurable functions .
The proof of this result and various examples are given in Section 3. Note that the assumption implies that the limiting density of in is non-vanishing, and ensures, among other things, that the mean-field assumptions are satisfied. This also gives the exact Bahadur slope [3] of the unbiased estimate (recall Remark 1.3) for the subgraph count , in the subgraph-sampling model (see Remark 3.1 for details).
2. Proof of Theorem 1.1
Proof of Replica-Symmetry: The proof for the replica symmetry case relies on the following lemma.
Lemma 2.1**.**
For any -uniform -regular hypergraph , and , , where is as in Definition 1.1.
Proof.
Note that it suffices to show , for , since . To this end, define the function,
[TABLE]
This is a smooth function on the compact set , and hence the minimum of is attained at some point . Therefore, to prove the lemma, it suffices to show that .
To show this, let . Then, , for all . This implies,
[TABLE]
Hence, using (2.1),
[TABLE]
where the last step uses , for all . β
To show replica symmetry, suppose lies on the convex minorant the function . Let denote the convex minorant of the function . Note that is increasing on the interval , and on . Hence, for such that ,
[TABLE]
The proof of replica symmetry now follows, since there is equality everywhere in (2.2), if , for all .
Proof of Replica Symmetry Breaking: Here, suppose does not lie on the convex minorant of the function . Then there exist such that the point lies strictly above the line segment joining and , that is, there exists such that
[TABLE]
By continuity, for all in a neighborhood of , which means, there exists such that
[TABLE]
Therefore, choose large enough so that , and
[TABLE]
where (with ). (Note, this choice of depends on and .)
Now, let be the disjoint union of complete -uniform hypergraphs on vertices each. This is a dense hypergraph, that is , and, hence, satisfies assumptions (1.5), which implies that is a sequence of mean-field, regular -uniform hypergraphs. Assuming that the vertices of are labelled , define as
[TABLE]
where . Then using , , and as in Definition 1.1,
[TABLE]
On the other hand, the entropy satisfies,
[TABLE]
which completes the proof of Theorem 1.1.
Note that in the construction above the hypergraphs which break symmetry at , depend on and . Whether it is possible to obtain a sequence of regular, mean-field -uniform hypergraphs not depending on and , which breaks symmetry whenever does not lie on the convex minorant of , remains open. In the example below, we construct a sequence of regular, mean-field -uniform hypergraphs (not depending on and ) which breaks symmetry whenever the point lies on the concave part of .999If , the function is convex if and only if . On the other hand, if , then the function has exactly two inflection points (both to the right of ), with a region of concavity in the middle (see [23, Lemma A.1] for details).
Example 1**.**
Assume that the point lies on the strictly concave part of the curve of . Hence, there exist two points in a neighborhood of such that
[TABLE]
Let be even, and let be the disjoint union of two complete -uniform hypergraphs with vertices each, and vertices labelled labelled and , respectively. Define as: and . It is easy to check that belongs to the constraint space of (1.3), and by (2.3), , which shows replica-symmetry-breaking.
3. Proof of Theorem 1.2
We begin with a few definitions from graph limit theory. The notion of graph convergence in terms of subgraph densities in Definition 1.2 can be metrized using the cut-metric, which we recall below:
Definition 3.1**.**
[22] The cut-distance between 2 graphons , is defined as
[TABLE]
The cut-metric between 2 graphons , is defined as
[TABLE]
with the infimum taken over all measure-preserving bijections , and , for .
Hereafter, we assume that is a sequence of graphs converging to a graphon , which is equivalent to [9, Theorem 3.8]. Now, for a fixed graph , define the multi-hypergraph with vertex set and a hyperedge for every copy of in . The assumption implies that , and
[TABLE]
that is, the sequence satisfies assumption (1.5). Therefore, is mean-field, and by Definition 1.1, for every :
[TABLE]
where
[TABLE]
with
- β
and , and
- β
The argument above shows that to prove Theorem 1.2, it suffices to prove that the limit of (3.2) equals (1.15). To this end, suppose is a sequence of graphs such that . Then it follows from [9, Lemma 5.3] that there exists a permutation such that , where is a graph obtained by relabelling the vertices of by . Moreover, the variational problem (3.2) is invariant under vertex permutations, that is, . This implies, to derive the limit of (3.2), we can, without loss of generality, assume .
We begin with the following lemma, which follows by a telescoping argument identical to the proof of [9, Theorem 3.7]. We omit the details.
Lemma 3.1**.**
Let be a sequence of graphs such that . Then, for any fixed graph ,
[TABLE]
where the supremum runs over all measurable functions .
Given a function and a graphon , recall the definition of from (1.13). Also, denote by the class of all functions , which are constant on the intervals , for every . Then, recalling (3.2),
[TABLE]
where
[TABLE]
and , which is , when is the constant function . The adjustment by is required to move from the sum over all indices in to the sum over distinct indices in (recall (3.2)). (Note that if is a clique.) However, the asymptotic contribution of this adjustment is small:
[TABLE]
Lemma 3.2**.**
If is a sequence of graphs such that , then for every fixed graph and ,
[TABLE]
where .
Proof.
Fix . For each , choose such that
[TABLE]
By Lemma 3.1, and , as . Then there exists such that, by (3.4),
[TABLE]
for all large . Now, define . Then
[TABLE]
by choosing small enough so that . This implies, recalling (3) and (3.6),
[TABLE]
where . Note that , as , and by arguments similar to the proof of [12, Lemma 5.8], it follows that
[TABLE]
where the -term depends on , and , but not on , and goes to zero as . This implies . The other direction can be proved similarly. β
Next, let denote the class of all measurable functions , such that is continuous at every irrational point in . Then define
[TABLE]
Note that , for each , which gives Now, fixing , choose such that
[TABLE]
For , define
[TABLE]
Clearly, , and hence, , for all . Moreover, since ,
[TABLE]
Now, since for every irrational , and is a bounded, continuous function on , by the dominated convergence theorem. Hence, by taking limits in (3.9) and using (3.8), we have . Finally, since is arbitrary, by (3.5), we get
[TABLE]
Therefore, to complete the proof of Theorem 1.2, it suffices to show (recall (1.15)). Clearly, . For the other direction, let and take a measurable function such that
[TABLE]
By standard measure theoretic arguments, there exists a continuous function such that: and
[TABLE]
where the last step uses (3.11). Hence, defining , gives
[TABLE]
where the last step uses (3.11). Finally, since , as , by arguments similar to the proof of [12, Lemma 5.8],
[TABLE]
where the -term goes to zero as . This combined with (3.12) shows that , which completes the proof of Theorem 1.2.
Remark 3.1**.**
By arguments similar to the proof of Theorem 1.2, an analogous variational problem can be established for the two-sided tail probability. More precisely, if is a sequence of graphs converging in cut-metric to a graphon , and is a fixed graph satisfying , then for any fixed , it follows that
[TABLE]
where
[TABLE]
is the exact Bahadur slope [3] of , the estimate of in the subgraph sampling model described in Remark 1.3.
The variational problem in (1.15) reduces to a finite dimensional optimization problem, if the limiting graphon is a block function (constant on finitely many blocks). These functions are dense in the space of graphons and arise naturally as limits of stochastic block models.
Remark 3.2**.**
(Block Graphons) Suppose is a sequence of graphs converging in cut-metric to the following -block graphon:
[TABLE]
where is a non-zero symmetric matrix with entries in , and form a measurable partition of , with , for all (here denotes the Lebesgue measure on ). Then, for any fixed graph , the homomorphism density
[TABLE]
Further, for , define
[TABLE]
Then the RHS of (1.15) becomes,
[TABLE]
which is a finite dimensional optimization problem with variables. To see (3.13), note that for any in the constraint space of (1.15), the vector defined by
[TABLE]
belongs to the constraint space of (3.13). Then by the convexity of ,
[TABLE]
which shows that (3.13) is at most the RHS of (1.15). Conversely, for some in the constraint space of (3.13), the function belongs to the constraint space of (1.15) and . Hence, the RHS of (1.15) is equal to (3.13).
We conclude with an example of a sequence of non-regular graphs which exhibits replica symmetry breaking, for all .
Example 2**.**
(Bipartite Graphs) Consider the sequence of complete bipartite graphs , with . In this case, the limiting graphon is a two-block function with block sizes and , with [math] on the diagonal blocks and 1 on the off-diagonal blocks. Then by Theorem 1.2, for ,
[TABLE]
In this case, is 2-block, hence, by Remark 3.2,
[TABLE]
- β’
: Then any in the constraint space of (3.14) satisfies , and by the convexity of and the fact that it is increasing on the interval , it follows that , showing replica symmetry, for all values of .
- β’
: In this case, the graph sequence is irregular. Note that (3.14) equals the minimum of the function:
[TABLE]
Note that is differentiable on and , showing replica-symmetry-breaking for all values of .
Appendix A A Simple Mean-Field Condition
In this section we derive a simple sufficient condition for a sequence of -uniform hypergraphs to be mean-field for the upper tail problem (recall Definition 1.1), using the framework of non-linear large deviations developed in [17]. To this end, we need a few definitions: The Lipschitz constant of a function , is defined as:
[TABLE]
where
[TABLE]
denotes the discrete partial derivative of with respect to the -th coordinate. The Gaussian-width of a set is defined as:
[TABLE]
where follows the standard dimensional Gaussian distribution . Finally, the gradient-complexity of is defined as:
[TABLE]
where .
For , define the function:
[TABLE]
where is as in Definition 1.1. It follows from [17, Theorem 5] that a sequence of -uniform hypergraph is mean-field for the upper tail problem if
[TABLE]
In [11, Corollary 6.1] it was proved that the above conditions are satisfied whenever (1.5) holds. The first condition in (1.5) ensures , while the second condition implies . In the following proposition, we derive another easy sufficient condition for (A.2) to hold, in terms of the number of hyperdges in . As a special case, this recovers the mean-field condition for graphs (1.6).
Proposition A.1**.**
A sequence of -uniform hypergraphs is mean-field for the upper tail problem if
[TABLE]
where denotes the maximum degree of the hypergraph .
Proof.
Recall the definition of from (A.1). Note that,
[TABLE]
The second condition in (A.3) then implies that .
Therefore, it suffices to show (recall (A.2)). Note that for a standard dimensional Gaussian vector and ,
[TABLE]
where . By the Cauchy-Schwarz inequality,
[TABLE]
where is the column vector of length , having entries indexed by subsets of having size . Note that , where is the matrix with entries , where varies over the set of all subsets of having size , and . Hence,
[TABLE]
by the first condition in (A.3). β
In general, the conditions in Proposition A.1 and those in (1.5) are incomparable. The maximum co-degree condition in (1.5) holds for many sparse hypergraphs, such as the -AP counting hypergraph. On the other hand, Proposition A.1 requires very high edge-density, but allows for larger co-degrees, as illustrated in the example below. To this end, for a finite set and a positive integer , denote by the set of all subsets of with cardinality .
Example 3**.**
Fix , and take . For , suppose
[TABLE]
and let be the hypergraph with vertex set and hyperedge set . Note that
[TABLE]
is a collection of disjoint copies of the complete -uniform hypergraph on vertices. Next, define the hypergraph with vertex set and hyperedge set
[TABLE]
the collection of all -element subsets of containing and . Finally, let be the sequence of hypergraphs obtained by taking the disjoint union of
[TABLE]
Note that . Moreover,
[TABLE]
and . Now, it is easy to check that condition (A.3) is satisfied. On the other hand, using ,
[TABLE]
showing that the first condition in (1.5) is not satisfied.
For the case (which corresponds to graphs), (1.5) always implies (A.3). To see this note that for any sequence of non-empty graphs , . Therefore, the first condition in (1.5) is equivalent to , which implies , the first condition in (A.3). For an example where (A.3) holds, but (1.5) does not, consider a sequence of -regular graphs on vertices, with . In this case, condition (A.3) is trivially satisfied (note that ), but the first condition in (1.5) does not hold, since
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] F. Augeri. Nonlinear large deviation bounds with applications to traces of Wigner matrices and cycles counts in ErdΕs-RΓ©nyi graphs. ar Xiv:1810.01558 , 2018.
- 2[2] T. Austin. The structure of low-complexity Gibbs measures on product spaces. ar Xiv preprint ar Xiv:1810.07278 , 2018.
- 3[3] R. R. Bahadur. Rates of convergence of estimates and test statistics. The Annals of Mathematical Statistics , 38(2):303β324, 1967.
- 4[4] A. Basak and S. Mukherjee. Universality of the mean-field for the Potts model. Probability Theory and Related Fields , 168(3-4):557β600, 2017.
- 5[5] S. Bhamidi, G. Bresler, and A. Sly. Mixing time of exponential random graphs. Ann. Appl. Probab. , 21(6):2146β2170, 2011.
- 6[6] S. Bhamidi, S. Chakraborty, S. Cranmer, and B. Desmarais. Weighted exponential random graph models: Scope and large network limits. Journal of Statistical Physics , 173(3-4):704β735, 2018.
- 7[7] B. B. Bhattacharya, S. Ganguly, E. Lubetzky, and Y. Zhao. Upper tails and independence polynomials in random graphs. Advances in Mathematics , 319(313β347), 2017.
- 8[8] B. B. Bhattacharya, S. Ganguly, X. Shao, and Y. Zhao. Upper tails for arithmetic progressions in a random set. International Mathematics Research Notices , to appear, 2018.
