Evolution of high-order connected components in random hypergraphs
Oliver Cooley, Mihyun Kang, and Christoph Koch

TL;DR
This paper studies the emergence and properties of high-order connected components in random hypergraphs, identifying thresholds and behaviors similar to classical graph connectivity but in a hypergraph setting.
Contribution
It extends classical connectivity results to high-order connectivity in hypergraphs, providing thresholds, component sizes, and hitting time results.
Findings
Determines the asymptotic size of the giant component after its emergence.
Establishes the threshold for $j$-connectivity in random hypergraphs.
Proves the hypergraph becomes $j$-connected when the last isolated $j$-set disappears.
Abstract
We consider high-order connectivity in -uniform hypergraphs defined as follows: Two -sets are -connected if there is a walk of edges between them such that two consecutive edges intersect in at least vertices. We describe the evolution of -connected components in the -uniform binomial random hypergraph . In particular, we determine the asymptotic size of the giant component shortly after its emergence and establish the threshold at which the becomes -connected with high probability. We also obtain a hitting time result for the related random hypergraph process -- the hypergraph becomes -connected exactly at the moment when the last isolated -set disappears. This generalises well-known results for graphs and vertex-connectivity in hypergraphs.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Evolution of high-order connected components in random hypergraphs
Oliver Cooley, Mihyun Kang and Christoph Koch
{cooley,kang,ckoch}@math.tugraz.at
Institute of Optimization and Discrete Mathematics, Graz University of Technology, 8010 Graz, Austria
Abstract.
We consider high-order connectivity in -uniform hypergraphs defined as follows: Two -sets are -connected if there is a walk of edges between them such that two consecutive edges intersect in at least vertices. We describe the evolution of -connected components in the -uniform binomial random hypergraph . In particular, we determine the asymptotic size of the giant component shortly after its emergence and establish the threshold at which the becomes -connected with high probability. We also obtain a hitting time result for the related random hypergraph process – the hypergraph becomes -connected exactly at the moment when the last isolated -set disappears. This generalises well-known results for graphs and vertex-connectivity in hypergraphs.
The authors are supported by Austrian Science Fund (FWF): P26826, W1230.
DOI: http://dx.doi.org/10.1016/j.endm.2015.06.077. Copyright ©2015. This manuscript version is made available under the CC-BY-NC-ND 4.0 license http://creativecommons.org/licenses/by-nc-nd/4.0/
1. Evolution of random graphs
The theory of random graphs was founded in the late 1950s by Erdős and Rényi describing the evolution of the random graph process The vertex set of this process is and initially there are no edges present. In each step of the process, add an edge between a pair of vertices chosen uniformly at random amongst all pairs of vertices that do not already form an edge. In the early stages of this process, all connected components are small and then, within very short time, they merge into a single component of linear size – the giant component. This remarkable phenomenon, first proved in [8], is known as the phase transition of the random graph process .
It is often more convenient to analyse the binomial random graph : The vertex set is and every pair of vertices is connected by an edge with probability independently. Incorporating various strengthenings the phase transition can be summarised as follows. (All asymptotic statements are with respect to and by whp we abbreviate “with probability ”.)
Theorem 1** (Bollobás [2]; Łuczak [11]).**
Let be a real function satisfying and
- (a)
If , then whp all components in have size at most ; 2. (b)
If , then whp the size of the largest component in is , while all other components have size at most .
As we continue to add edges one by one, more and more components are consumed by the giant component and eventually the graph becomes connected. In fact, Bollobás and Thomason [4] showed that this happens precisely at the moment when the last isolated vertex disappears – thereby relating a global graph property to its minimal local obstruction. Denote the hitting time of connectivity by , i.e. is the minimal such that is connected, and the hitting time for the disappearance of the last isolated vertex by .
Theorem 2** (Bollobás and Thomason [4]).**
Whp in we have .
2. Evolution of random hypergraphs – Main results
Given an integer a -uniform hypergraph consists of a set of vertices and a set of edges, where each edge contains precisely vertices. In particular, -uniform hypergraphs are simply graphs. Given an integer we say that two -sets (sets of distinct vertices) and are -connected if there is a sequence of edges such that and for all A -component is a maximal set of pairwise -connected -sets. The hypergraph is -connected if every two -sets are -connected. A -set is called isolated if it is not contained in any edge. Note that connectivity in graphs corresponds to the case and .
We consider the -uniform random hypergraph process : The vertex set is and initially there are no edges present. In each step of the process, we add an edge for a -set chosen uniformly at random from all -sets that do not already form an edge. Instead of analysing this process directly we usually consider the -uniform binomial random hypergraph with vertex set , where every -set is an edge with probability independently. This way there are no dependencies between different edges. It is well-known that both models are very similar and results can be easily transferred from one to the other using standard techniques (e.g. [9]).
For any the case of vertex-connectivity () is well-studied and results analogous to Theorem 1 were obtained in [1, 3, 10, 13]. Theorem 2 has also recently been extended for vertex-connectivity in -uniform hypergraphs [12].
However, for high-order connectivity () not much was known until recently. This is due to the fact that vertex-connectivity can usually be studied with tools which are very similar to those used for graphs. By contrast, analysing high-order connectivity is often significantly more sophisticated and thus these methods are usually not sufficient. Recently Cooley, Kang, and Person [5] showed that the sharp threshold for the emergence of the giant component in is
[TABLE]
We strengthen this result and provide the asymptotic size of the largest -component in the weakly supercritical regime of the phase transition.
Theorem 3**.**
Let and be integers. Let be a real function satisfying , and , for some constant . Then whp the size of the largest -component in satisfies
[TABLE]
while all other -components contain -sets.
The proof of Theorem 3 fundamentally uses a powerful tool, the smooth boundary lemma, which provides insight into the structure of the unique largest -component in the supercritical regime. The precise statement of this lemma requires a large amount of additional notation and thus we omit it and refer to [6] for the details. Instead we explain the notion of ‘smooth sets’ on a more intuitive level in Section 3.1.
An elegant application of the notion of smoothness arises when studying the threshold for -connectivity in The key idea is that the giant component contains a smooth set of large size.
Lemma 4**.**
Let and be integers. Let be a real function satisfying and . Then whp the unique largest -component in contains a subset of at least -sets with the following property for all integers :
Every -set is contained in -sets of .
Based on Lemma 4 we provide an elementary proof showing that the hypergraph process becomes -connected exactly at the moment when the last isolated -set disappears. Let be the hitting time for -connectivity in the random hypergraph process and let denote the time-step of in which the last isolated -set disappears.
Theorem 5**.**
Let and be integers. Then whp in we have .
It follows that, in the binomial random hypergraph , the properties of being -connected and having no isolated -sets share the common sharp threshold
[TABLE]
In fact we obtain a slightly stronger result.
Theorem 6**.**
Let and be integers. Let be a real function satisfying and .
- •
If , then whp contains isolated -sets (and is therefore not -connected);
- •
If , then whp is -connected (and therefore contains no isolated -sets).
3. Proof outlines
3.1. Smooth sets in large components
The intuition behind the smooth boundary lemma is the following: For an arbitrary -set , we explore its component via a breadth-first search process, i.e. generation by generation. If the component of happens to be large, the sizes of the generations have a tendency to grow and thus most generations should have a ‘reasonable’ size already early on in the process. However, once the generations are not too small, random fluctuations should start to even themselves out over time. Thus generations should begin to look ‘smooth’ in the sense that any set of at most vertices is contained in approximately the ‘right’ number of -sets of any smooth generation. Usually we will only run the exploration process for as long as necessary, resulting in a partial component. If so, the last generation that is discovered before the process was stopped is called the boundary and will be of special interest in the proof of Theorem 3, since it contains all the -sets that are still active. This will be discussed further in Section 3.2.
In other scenarios, the boundary does not play such a crucial role or may not be large enough on its own, in which case we may consider the smooth set given by the union of all smooth generations in the partial component. It turns out that this union contains almost all -sets of the partial component. As an immediate consequence we obtain Lemma 4. The details can be found in [6, 7].
3.2. Emergence of the giant component
First we study the number of -sets in ‘large’ components. For a given -set we explore its component via a breadth-first search process and approximate this process by a supercritical branching process. In order to control the second moment we have to study two exploration processes and make sure that one of them being large does not increase the probability of the other becoming large too much. We first run one exploration process, stop it as soon as we know whp that it will grow large, and then consider a subprocess of the second exploration process in which no -sets containing a -set from the first (partial) component is present. The smooth boundary lemma ensures that the resulting process is still close to a (sufficiently) supercritical branching process. Therefore the number of -sets in large components is concentrated around its expectation and Theorem 3 follows by a sprinkling argument. The details can be found in [6].
3.3. Hitting times and connectivity threshold
It is convenient to split the proof of Theorem 5 into two parts: First we analyse the structure of the binomial random hypergraph in the regime where the expected number of isolated -sets begins to vanish. In particular, we show that whp in this regime the random hypergraph consists only of one non-trivial component and isolated -sets. Then we transfer this structure to the corresponding time-range in the random hypergraph process by classical contiguity results.
For integer-valued random variables and , we say converges in distribution to , denoted by , if we have for every integer . We apply the Chen-Stein method for Poisson-approximation to the number of -sets of degree (i.e. -sets that are contained in precisely edges).
Theorem 7**.**
Suppose for some real function . Then we have, for any integer
[TABLE]
Set and and note that by Theorem 7 with we know that whp contains isolated -sets, while whp does not. Next set and note that by Lemma 4 whp the unique largest component in already contains a smooth subset which contains every -set at least times.
For any we may construct in a two-round exposure such that , where , since . In particular this means that we may assume that and the smooth set are contained in . A first moment argument shows that whp does not contain any non-trivial components consisting of at most -sets. However, for much larger components this argument would not be sufficient, since the number of non-isomorphic hypergraphs consisting of a given number of -sets grows too rapidly. This is where the smooth set comes into play. It allows us to give an upper bound for the expected number of large components that are not connected to in by calculating the expected number of connected subsets of size approximately , which are not connected to in . Since is smooth the number of forbidden edges can be bounded uniformly. It follows that and hence, by Markov’s inequality, whp only the component containing is non-trivial. It remains to observe that the error-bounds in all these statements are strong enough to transfer them to and apply a union bound over time-steps.
Theorem 6 is an immediate corollary of Theorem 5 and Theorem 7. The details can be found in [7].
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Michael Behrisch, Amin Coja-Oghlan and Mihyun Kang “The order of the giant component of random hypergraphs” In Random Struct. & Alg. 36.2 , 2010, pp. 149–184 DOI: 10.1002/rsa.20282 · doi ↗
- 2[2] Béla Bollobás “The evolution of random graphs” In Trans. Amer. Math. Soc. 286.1 , 1984, pp. 257–274 DOI: 10.2307/1999405 · doi ↗
- 3[3] Béla Bollobás and Oliver Riordan “Asymptotic normality of the size of the giant component in a random hypergraph” In Random Struct. & Alg. 41.4 , 2012, pp. 441–450 DOI: 10.1002/rsa.20456 · doi ↗
- 4[4] Béla Bollobás and Andrew Thomason “Random graphs of small order” In Random graphs ’83 (Poznań, 1983) 118 , North-Holland Math. Stud. North-Holland, Amsterdam, 1985, pp. 47–97
- 5[5] O. Cooley, M. Kang and Y. Person “Largest components in random hypergraphs” In Preprint , 2014 ar Xiv: 1412.6366
- 6[6] Oliver Cooley, Mihyun Kang and Christoph Koch “The size of the giant component in random hypergraphs” In Submitted , 2015, pp. 48 pages ar Xiv: 1501.07835
- 7[7] Oliver Cooley, Mihyun Kang and Christoph Koch “Threshold and hitting time for high-order connectedness in random hypergraphs” In Electron. J. Combin. 23.2 , 2016, pp. P 48
- 8[8] Paul Erdős and Alfréd Rényi “On the evolution of random graphs” In Magyar Tud. Akad. Mat. Kutató Int. Közl. 5 , 1960, pp. 17–61
