Asymptotic normality in random graphs with given vertex degrees
Svante Janson

TL;DR
This paper proves that the number of specific tree components in random graphs with a fixed degree sequence follows a normal distribution asymptotically, using the method of moments and cumulant estimates.
Contribution
It establishes asymptotic normality for component counts in both multigraphs and simple graphs with given degrees, extending previous results and providing new proofs.
Findings
Asymptotic normality of component counts proven for both multigraphs and simple graphs.
New cumulant estimates for weakly dependent variables.
Extended results on the size of the giant component in random graphs.
Abstract
We consider random graphs with a given degree sequence and show, under weak technical conditions, asymptotic normality of the number of components isomorphic to a given tree, first for the random multigraph given by the configuration model and then, by a conditioning argument, for the simple uniform random graph with the given degree sequence. Such conditioning is standard for convergence in probability, but much less straightforward for convergence in distribution as here. The proof uses the method of moments, and is based on a new estimate of mixed cumulants in a case of weakly dependent variables. The result on small components is applied to give a new proof of a recent result by Barbour and R\"ollin on asymptotic normality of the size of the giant component in the random multigraph; moreover, we extend this to the random simple graph.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Asymptotic normality in random graphs with given vertex degrees
Svante Janson
Department of Mathematics, Uppsala University, PO Box 480, SE-751 06 Uppsala, Sweden
[email protected] http://www.math.uu.se/svante-janson
(Date: 19 December, 2018; revised 30 January, 2019)
Abstract.
We consider random graphs with a given degree sequence and show, under weak technical conditions, asymptotic normality of the number of components isomorphic to a given tree, first for the random multigraph given by the configuration model and then, by a conditioning argument, for the simple uniform random graph with the given degree sequence. Such conditioning is standard for convergence in probability, but much less straightforward for convergence in distribution as here. The proof uses the method of moments, and is based on a new estimate of mixed cumulants in a case of weakly dependent variables.
The result on small components is applied to give a new proof of a recent result by Barbour and Röllin on asymptotic normality of the size of the giant component in the random multigraph; moreover, we extend this to the random simple graph.
2010 Mathematics Subject Classification:
05C80, 60C05; 60F05
Partly supported by the Knut and Alice Wallenberg Foundation
1. Introduction
Let be a random (simple) graph with labelled vertices and a given degree sequence \mathbf{d}=\bigl{(}d_{1},\dots,d_{n}\bigr{)}, chosen uniformly at random among all such graphs. (We assume tacitly that is such that some such graph exists.) We will denote the vertices by ; thus has by definition degree .
The standard way to constuct a random graph is by the configuration model, which was introduced by Bollobás [11]. As is well-known, this method constructs first a random multigraph, which we denote by , and then obtains by conditioning on the event that is simple; see Section 4.
We are, as most papers in this field, interested in asymptotic results as , where the degree sequence depends on and satisfies suitable conditions. The standard method is to first prove results for the random multigraph and then obtain corresponding results for by conditioning as above. In the present paper, we make the common assumption that the (asymptotic) degree distribution has a finite second moment; see Section 2 for precise assumptions. Then, it is well-known that \operatorname{\mathbb{P}{}}\bigl{(}G^{*}(n,\mathbf{d}_{n})\text{ is simple}\bigr{)}\geqslant c for some (at least for large ), see Remark 2.3, and as a consequence, any property that holds w.h.p. (i.e., with probability tending to 1) for holds w.h.p. also after conditioning on being simple. Hence, the transfer of such results from the multigraph to the simple graph is trivial. (Cf. Bollobás and Riordan [13], where transfer is made possible by far from trivial arguments also when \operatorname{\mathbb{P}{}}\bigl{(}G^{*}(n,\mathbf{d}_{n})\text{ is simple}\bigr{)}\to 0.)
However, it has repeatedly been remarked that this simple method fails for distributional results, for example that some random variable defined by the graph is asymptotically normal, since probabilities like may be changed by the conditioning. Hence, although in many cases it seems intuitively clear that a few loops or multiple edges should not affect the asymptotic results, and we still expect the same asymptotic distribution for as for , it is typically difficult to prove this rigorously. We know only two papers where this has been done for asymptotic normality of some variables, in both cases by first proving a result for and then showing that the proof can be modified to work for : Janson and Luczak [24] showing asymptotic normality of the size of the -core (using a rather complicated extra argument for ) and Riordan [36] showing asymptotic normality of the size of the giant component in the weakly supercritical case (using a simple extra argument for noting that the proof only uses local explorations involving vertices).
The purpose of present paper is to do this in another case. Barbour and Röllin [8] recently proved, for , a theorem on asymptotic normality of a class of “local” statistics that include, for example, the number of small components of a given type. They further used this to show asymptotic normality of the size of the giant component in the supercritical case. Our main results show that the same results on small components and the size of the giant hold for the random simple graph . (We also weaken somewhat the technical conditions for these results in [8].) Precise statements are given in Section 3 below.
We achieve these results by the time-honoured method of moments. (Barbour and Röllin [8] use Stein’s method.) Using the method of moments, we show joint convergence of, e.g., the number of components of a given type and the numbers of loops and pairs of parallel edges in ; we may then obtain the result for by conditioning on the latter numbers being 0. In order to do this, we thus show convergence of mixed moments. Calculations of means and (co)variances are rather straightforward, and the central part of the proof is to obtain bounds on higher-order cumulants. This is similar to results by Féray [17, 18].
To describe the idea, consider as a simple case the covariance of two indicators, each indicating that a particular set of vertices (and half-edges) form a copy of a given graph . If the two sets of vertices are disjoint, then and are only weakly dependent; we exploit this by constructing a modification of that is independent of , and such that and . This implies \operatorname{Cov}(I_{1},I_{2})=\operatorname{\mathbb{E}{}}(I_{1}I_{2})-\operatorname{\mathbb{E}{}}(I_{1}I_{2}^{\prime})=\operatorname{\mathbb{E}{}}\bigl{(}I_{1}(I_{2}-I_{2}^{\prime})\bigr{)}=O\bigl{(}\operatorname{\mathbb{E}{}}I_{1}\operatorname{\mathbb{E}{}}I_{2}/N\bigr{)}. The general case is an extension of this, although the details are quite technical, see Section 6. (The idea to construct a suitable independent modification is used also in the Stein coupling constructed by [8], although their modification is both constructed and used differently from our modifications.)
One feature of the construction that may be of independent interest is that we in the proof use the bipartite version of the configuration model to construct a random bipartite graph with the vertices of as vertices on one side, and the edges of as vertices on the other side; equivalently, we construct first the random bipartite graph obtained by bisecting each edge in ; see Section 4.
Remark 1.1**.**
As remarked by Barbour and Röllin [8], there are not many papers at all proving asymptotic normality for statistics of the multigraph ; apart from [24], [36] and [8] just mentioned, we know of Angel, van der Hofstad and Holmgren [1] (number of loops and multiple edges when the second moment of the degree distribution is infinite; this is obviously not relevant for simple graphs), KhudaBukhsh, Woroszylo, Rempała and Koeppl [30] (an epidemic on the graph), Ball [4] (a more general epidemic model, and the giant component in site or bond percolation), and Athreya and Yogeshwaran [3] (certain statistics in a subcritical case).
Remark 1.2**.**
The method of moments is a very old method. Applications of it are typically messy and lead to long calculations using combinatorial estimates of multiple sums, while other methods may give shorter and more elegant proofs. Nevertheless, it is a powerful method that often works in combinatorial problems. I have seen several cases where results first have been proved by the method of moments and later reproved using other methods. It seems likely that the results here will be another example of this in the future. For example, perhaps a combination of Stein’s method for normal approximation and Stein–Chen’s method for Poisson approximation might be used instead of the method of moments to show the joint convergence used in our proofs below.
2. Assumptions and notation
2.1. Some notation
. . is the set of all permutations of .
is the descending factorial. Similarly,
[TABLE]
. We interpret and .
Unspecified limits are as ; w.h.p. (with high probability) means with probability tending to 1 as . and denote convergence in distribution and probability, respectively. and denote positive constants that may be different at each occurrence.
If is a (multi)graph, we let denote its vertex set and its edge set; furthermore, and are the numbers of vertices and edges of , and is its number of components. The number of vertices of degree is denoted by . For convenience, we often write for and for .
The degree of a vertex is denoted . (A loop in a multigraph contributes 2 to the degree of its endpoint.) Thus .
The component containing a vertex is denoted . It may be regarded as a rooted multigraph, i.e., a multigraph with a distinguished vertex, viz. . We denote the components of arranged in decreasing order by (with ties resolved by any fixed rule, e.g. lexicographically).
If is a random variable and an event, then \operatorname{\mathbb{E}{}}(X;\,\mathcal{E}):=\operatorname{\mathbb{E}{}}\bigl{(}X\boldsymbol{1}\{\mathcal{E}\}\bigr{)}.
We may ignore obvious roundings and write e.g. or when we mean the nearest larger or smaller integer.
2.2. Basic assumptions
As in the introduction, we assume that for each , we are given a degree sequence . Thus depends on , as do many other quantities introduced below, but often we omit from the notation for convenience. Also as above, is the random simple graph with degree sequence , and is the random multigraph with degree sequence given by the configuration model. (We assume tacitly that is such that a graph with these vertex degrees exists; in particular, is even.) Let
[TABLE]
Thus the random graph has vertices and edges.
Let , the number of vertices of degree in . Further, let denote the degree of a uniformly random vertex, i.e., is a random variable with the distribution
[TABLE]
We will always assume the following.
- (A1)
, the degree of a randomly chosen vertex, converges in distribution to a random variable with a finite and positive mean . In other words, there exists a probability distribution such that
[TABLE]
and . 2. (A2)
. Assuming (A1), this is equivalent to being uniformly integrable.
(See e.g. [19, Theorem 5.5.9] for the equivalence with uniform integrability.) Sometimes we will also use one or several of the following assumptions.
- (A3)
. Assuming (A1), this is equivalent to being uniformly integrable. (This will always be assumed when studying .) 2. (A4)
for every . This implies uniform integrability of every powers , and thus, assuming (A1), for every ; in particular (A2) and (A3). (We use this strong condition only in a few results.) 3. (A5)
. This is the supercritical case, when w.h.p. has a giant component of order , see Molloy and Reed [32, 33] with refinements in, e.g., [25], [13], [28]. 4. (A6)
and . This excludes some less interesting cases; see Remark 3.5.
The degree sequences will be fixed throughout the paper, and unspecified constants may depend on them through (explicit or implicit) constants in these assumptions, for example the distribution .
Note that by (2.2), . Hence, (A2) is equivalent to
[TABLE]
In particular, since by (A1), . Hence, in estimates with unspecified constants, it does not matter whether we use e.g. or .
Remark 2.1**.**
We assume that has vertices. This is customary, but as always just for notational convenience. See [13], where the results are formulated with the given allowed to have arbitrary lengths .
Remark 2.2**.**
One common version of the configuration model uses random vertex degrees that are i.i.d. copies of a random variable (e.g. ignoring one half-edge when the sum of degrees is odd). Then, (A1) holds a.s., and thus (with suitable assumptions on ), the results below apply conditioned on the vertex degrees. However, unconditioned results are somewhat different, see Section 12.
Let . The uniform integrability of in (A3) means that for any (deterministic) sequence , ; this implies (choosing e.g. ) that
[TABLE]
(A3) obviously also implies
[TABLE]
Remark 2.3**.**
The condition (2.7) (together with e.g. (A1)) implies
[TABLE]
see [21; 23]; hence (A1)–(A3) imply (2.8). The lower bound (2.8) is the basis for most applications of the configuration models to the random simple graph , since if often allows simple conditioning as said in the introduction.
When also (2.6) holds, (2.8) can easily be shown using the method of moments; this has been the standard method since the introduction of the configuration model [11], see [21] for a proof with no further assumptions. This method is also the basis of our proofs below.
Remark 2.4**.**
We will assume (A3) for our results for the simple graph . In fact, the main results really require only (2.6) and (2.7); by considering subsequences we may then assume that for some , and the proofs then hold with replaced by in e.g. (7.7). However, we prefer to state the results for the more natural condition (A3). (An example satisfying (2.6) and (2.7) but not (A3) is obtained from any example satisfying (A1)–(A3) by changing the degrees of vertices to .)
As said above, it is shown in [21; 23] that (2.8) holds also without assuming (2.6), but the proofs are then somewhat more complicated, and we do not know whether they can be used to replace the assumption (A3) by (2.7) in the theorems below. In fact, it seems likely that the results extend also to cases without (2.7), but that would require completely different methods.
2.3. The corresponding branching process
The usual exploration process reveals the edges (i.e., pairings of half-edges) in one by one as they are needed, starting with the half-edges at a vertex and then continuing with the unpaired half-edges at the neighbours of , and so on, until the component is fully explored. Let be a uniformly random vertex in . As is well-known, see e.g. [13, Lemma 4] for a formal statement, assuming (A1)–(A2), the exploration process of may be approximated by a branching process (regarded as a rooted tree, finite or infinte), in the sense that for any fixed , the first generations of the two processes may be coupled such that they are isomorphic w.h.p. (In particular, if one of the processes has only non-empty generations, then so has the other w.h.p.) The branching process is a Galton–Watson process where the root has offspring distribution as in (A1), and all other vertices have offspring distribution where has the size-biased distribution
[TABLE]
For a rooted unlabelled tree , we define
[TABLE]
with meaning isomorphism (i.e., equality) as unlabelled rooted graphs. Similarly, if is an unrooted unlabelled tree, we define by (2.10), now interpreting as isomorphism (equality) of unlabelled unrooted graphs.
For random variables that are functionals of , we write
[TABLE]
summing over finite unlabelled (rooted or unrooted) trees . Obviously, it here suffices that is defined for finite trees.
We denote the probability generating function of by
[TABLE]
In the supercritical case (A5), let be the unique root in of
[TABLE]
(If (A5) does not hold, we may take , which always satisfies (2.13).) As is well-known, then \operatorname{\mathbb{P}{}}\bigl{(}|{\mathcal{T}}|<\infty\bigr{)}=f(\zeta). See further Sections 3.3 and 11.
3. Main results
The main results are stated below. Proofs are given in later sections. We begin with some notation for subgraph counts.
Let and be (multi)graphs. (Think of as big and small.) We often regard the small graph as unlabelled; in particular, when we talk about several distinct graphs , we mean non-isomorphic. However, for formal definitions it is convenient to regard both and as labelled, meaning that both vertices and edges, and also half-edges, are labelled. (For simple graphs it suffices to label vertices, since the edges are identified by their endpoints. Similarly, half-edges need to be labelled only for loops.) We count subgraphs of isomorphic to in four different ways:
- •
is the number of labelled copies of in .
- •
is the number of unlabelled copies of in .
- •
is the number of labelled isolated copies of in .
- •
is the number of unlabelled isolated copies of in .
Thus, can be defined as the number of injective maps , mapping vertices to vertices, edges to edges and half-edges to half-edges such that the relations between them are preserved. is the number of such maps such that the vertices in the image of have no other edges than those in , i.e., the degrees are preserved. Furthermore,
[TABLE]
where is the number of automorphisms of . If is connected, then is the number of components of isomorphic to .
The simple relations (3.1) shows that it is equivalent to study labelled or unlabelled copies. Nevertheless we will consider both since it often is natural to count unlabelled copies (for example when counting components), but labelled copies often (but not always) are more convenient in our proofs.
We simplify the notation and write Z_{H}:=Z_{H}\bigl{(}G(n,\mathbf{d}_{n})\bigr{)} and Z^{*}_{H}:=Z_{H}\bigl{(}G^{*}(n,\mathbf{d}_{n})\bigr{)}, and similarly for the other versions of subgraph counts.
Example 3.1**.**
Let be a loop and a double edge. Then a multigraph is simple if and only if . If particular, we obtain from by conditioning on the event .
3.1. Small components
Our first theorem gives the asymptotic distributions for the number of small components of different types in . It is well-known that most small components are trees. More precisely, for a tree , there is typically a linear number of components , and asymptotic normality was recently shown by Barbour and Röllin [8] using Stein’s method. We give a new proof of this (under somewhat weaker conditions), since this is the basis of our work below. We complement this with the easy results that the number of components isomorphic to a given conneted unicyclic graph has an asymptotic Poisson distribution, and that there w.h.p. is no small (i.e. fixed size) component with more than one cycle. The latter results follow by standard moment calculations and are presumably known, although we do not know any specific reference.
Theorem 3.2** (Mainly [8]).**
Consider the random multigraph and assume (A1)–(A2) and (A6).
- (i)
If is a tree, then
[TABLE]
for some given by (3.7) below. Furthermore,
[TABLE]
and, if ,
[TABLE]
Moreover, for every tree ,
[TABLE]
where is given by (2.10) (for unrooted ). 2. (ii)
If is a connected unicyclic multigraph, so , then
[TABLE]
with as in (3.3). 3. (iii)
*If is a connected multigraph with more than one cycle, i.e., , then w.h.p. *
Moreover, the limits in (i)–(iii) hold jointly, for any finite number of distinct connected multigraphs , with a joint limit N\bigl{(}0,\Sigma\bigr{)} for the trees, for a covariance matrix \Sigma=\bigl{(}\sigma_{H_{i},H_{j}}\bigr{)} given by
[TABLE]
and with the Poisson limits in (ii) independent of each other and of the joint normal limit in (i). Furthermore, this holds with convergence of all mixed moments. In particular, if and are trees, then
[TABLE]
Moreover, if each tree has and satisfies the condition in (3.4), then the covariance matrix is non-singular.
In (3.7), is the Kronecker delta, equal to 1 when (as unlabelled graphs); furthermore, recall that and .
Remark 3.3**.**
The case , i.e. , has to be excepted in (3.4), since trivially is deterministic, so .
Remark 3.4**.**
The moment convergence in (3.2) implies that if is a tree such that , then (3.2) can be written
[TABLE]
so really is asymptotically normal. The same applies in other theorems below, e.g. Theorem 3.9.
Remark 3.5**.**
The assumption (A6) excludes the two extremal cases and , which are less interesting in the present context; in these cases, there are only a few () small components except (possibly) isolated vertices and edges. Theorem 3.2 hold also when (A6) fails, except that then for every tree (and (3.4) may fail) and thus e.g. (3.2) says only that the variable converges to 0 in probability; cf. Theorem 3.10 and Remark 7.11, where (A6) is not assumed.
Remark 3.6**.**
Cf. the similar results for Erdős–Rényi graphs and , shown by e.g. [16], [6], [7]; see Example 12.3.
Remark 3.7**.**
In spite of (3.3), we cannot in general replace by its asymptotic value in (3.2); the reason is that this would require \operatorname{\mathbb{E}{}}\overline{Z}^{\mathsf{u}*}_{H}=n\lambda_{H}+o\bigl{(}n^{1/2}\bigr{)}, and this rate of convergence in (3.3) does not hold without further assumptions on the rate of convergence in (2.4). In particular, it does not hold in the case of random vertex degrees mentioned in Remark 2.2, see Section 12.
Remark 3.8**.**
As said above, we reprove the result by Barbour and Röllin [8] on tree components. We do not treat the more general class of local statistics studied in [8], and leave it as an open problem whether our methods apply to them in full generality. On the other hand, we include below results on subgraph counts not covered by [8].
One of our main results is that Theorem 3.2 transfers to the simple random graph , under weak assumptions. Obviously, we have to consider only simple graphs , since if contains a loop or a multiple edge.
Theorem 3.9**.**
Assume (A1)–(A3) and (A6). Then, for simple graphs , the results in Theorem 3.2 hold also for the random simple graph and variables , with the same , and . Furthermore, if is a tree, then
[TABLE]
and thus it does not matter whether we normalize as in (3.2) using or .
3.2. Small subgraphs
With a stronger moment assumption on the degrees, we have similar results for the number of copies (not necessarily isolated) of a given tree. We assume (A4), i.e., that every moment is bounded, and thus in particular that has finite moments of all orders. (This implies (A2)–(A3) as said above.)
Theorem 3.10**.**
- (i)
For every tree there exists such that
[TABLE]
Moreover, (3.11) holds jointly for any number of such trees , with a joint limit for some covariance matrix . Furthermore, the limits hold with convergence of all moments. 2. (ii)
The results in (i) hold also for the random simple graph and variables , with the same and . Furthermore,
[TABLE]
and thus it does not matter whether we normalize as in (3.11) using or .
Remark 3.11**.**
We consider only trees in Theorem 3.10. If is a cycle, then the proofs below show that and are asymptotically Poisson distributed. However, for general unicyclic this is not true; this is similar to the case for , see e.g. [14] and [27, Example 3.21]. It can be shown that for a general connected unicyclic , and converge to a compund Poisson distribution. This is besides the point of the present paper, so we leave the details to the reader.
Remark 3.12**.**
We leave explicit calculations of asymptotics of means and variances in Theorem 3.10 to the reader. Note that (for the simple graph ) there are further trivially deterministic cases: , the number of edges in ; similarly, for any , . We do not know whether there also are further cases where the variance (perhaps of some linear combination) is and thus vanishes in the limit taken in (3.11).
Remark 3.13**.**
The condition (A4) assumes that all moments of the degree distribution are bounded. This is presumably stronger than necessary; it seems likely that it is enough that for some depending on for convergence in distribution (3.11), although (A4) presumably is required for moment convergence. Some condition of this type is necessary, since otherwise may be dominated by a few vertices of high degrees, as in the following example.
Example 3.14**.**
Let have one vertex of degree , vertices of degree , vertices of degree 3 and the rest, about , of degree 1. Then (A1)–(A3) are satisfied, with and . Let be the tree with obtained by taking two disjoint stars and joining their central vertices by an edge.
Let be the number of edges in between and a vertex of degree . Each such edge is the central edge in labelled copies of , while the other edges are central edges in at most copies of each. Hence, Z^{*}_{H}=\bigl{(}2+o(1)\bigr{)}n^{5}+O\bigl{(}n^{3}\bigr{)}. Furthermore, it is easy to see (e.g. by the method of moments) that . Hence,
[TABLE]
Thus, after suitable norming, and are asymptotically Poisson distributed and not normal.
3.3. Giant component and susceptibility
It is well-known that in the supercritical case when (A5) holds, there is w.h.p. a unique component of order in or , known as the giant component; see Molloy and Reed [32, 33] and, e.g., [25], [13], [28]; moreover, , which we recall from Section 2.3 is the survival probability of the branching process . Ball and Neal [5] proved (under some extra technical conditions, in particular the existence of a third moment ) that the variance satisfies , with given by (3.16) below. Moreover, Barbour and Röllin [8] proved (under the same techical conditions) that for the random multigraph , the size is asymptotically normal. We reprove this here with our methods (removing some unnecessary conditions); moreover, we show that the result holds for the simple random graph too. (Similar results for and have been known for a long time, see e.g. [37], [34], [35], [12].)
Remark 3.15**.**
In the weakly supercritical case, where but , Riordan [36] showed asymptotic normality of the size (and nullity) of the giant component, both for and (assuming that the degrees are bounded), using methods different from ours. Note that in this case, the giant is smaller: , but . It does not seem to be possible to prove these results by the method in the present paper.
Theorem 3.16** (Partly [5] and [8]).**
(i) * Assume (A1)–(A2) and (A5)–(A6). Then the size of the giant component in has an asymptotically normal distribution:*
[TABLE]
where is given by, with and as in (2.12)–(2.13),
[TABLE]
Furthermore, .
(ii) * Assume (A1)–(A3) and (A5)–(A6). Then the results of 3.16 hold for too, with the same .*
The formula (3.16) is given (in an equivalent form) by Ball and Neal [5]. They also state that , but the proof seems omitted; we give a proof in Section 10 under our (weaker) assumptions.
The proof of Theorem 3.16 is given in Sections 9–11; it is based on the results above and a truncation argument; as in [5] and [8], we obtain results for the giant by subtracting all small components, and the main ideas in this part of the proof are similar. As part of the proof, we show a more general result in Theorem 3.17; we first introduce more notation.
A graph functional is a real-valued functional defined for unlabelled multigraphs . Let be a graph functional, and define for a multigraph
[TABLE]
summing over all unlabelled connected ; the middle equality holds because each component is counted times in the sum over .
In the supercritical case, the second sum in (3.16) may be dominated by the single term coming from the giant component, and it is sometimes more interesting to exclude it and consider only small components. We define
[TABLE]
Recall also the notation defined in (2.11).
Theorem 3.17**.**
(i) * Assume (A1)–(A2) and (A5). Let be a graph functional such that*
[TABLE]
for some constant and all connected . Define by (3.17). Then
[TABLE]
and
[TABLE]
where, summing over unlabelled, unrooted trees ,
[TABLE]
with all sums and expectations absolutely convergent.
(ii) * Assume (A1)–(A3) and (A5). Then the results in 3.17 hold for too.*
Remark 3.18**.**
If for some , then cannot contain any vertices of degree , so . Hence the terms in the last sum in (3.17) with vanish and are not a problem (recall that we interpret as 0); we could write the sum as a sum over . The same applies to other sums below with in the denominator.
Remark 3.19**.**
We do not assume (A6) in Theorem 3.17, since the results hold also when (A6) fails (so by (A5)), but this case is less interesting since then , because a.s. and thus .
Example 3.20**.**
Let . Then (3.17) yields
[TABLE]
Hence the asymptotic normality of in (3.14) is equivalent to the asymptotic normality of and , which follows from Theorem 3.17, with . We will show in Lemma 11.1 that this yields (3.16), and in Lemma 10.5 that , assuming (A5) and (A6).
Similarly, we may take , so that (3.17) yields, for or ,
[TABLE]
Theorem 3.17 thus shows asymptotic normality of ; more generally, taking a linear combinatíon , we obtain joint asymptotic normality of and . (Cf. [36] for the weakly supercritical case not studied here.) We conjecture that the limit distribution has a non-singular covariance matrix; however, we have not verified this; see Remark 10.6.
Example 3.21**.**
Let ; then (3.17) yields
[TABLE]
which is the modified susceptibility studied in [22]. It was shown there that, assuming (A1)–(A3) and (A5)–(A6), converges in probability to . Theorem 3.17 yields convergence of to the same limit and, moreover, asymptotic normality
[TABLE]
for some given by (3.17). This could be evaluated explicitly similarly to Lemma 11.1, but we have not done so. We show in Example 10.7 that .
Remark 3.22**.**
We assume (A5) (supercriticality) in Theorems 3.16 and 3.17. This is for the truncation argument allowing us to ignore large components. (For example Lemma 9.4.)
In the critical case , there are typically a few large components of order , see [36, Theorem 1.3], and Theorem 3.16 does not hold; thus Theorem 3.17 does not hold.
In the subcritical case, Theorem 3.16 is not interesting, but other functionals are, for example the susceptibility in Example 3.21. We conjecture that Theorem 3.17 holds under suitable conditions, now for rather than (the difference should be negligible), but that stronger conditions than above (on the degree sequences or on the graph functional, or both) are required to keep the contribution from large components small. Perhaps (A4) will do; alternatively, (3.18) may be replaced by a stronger assumption. The paper is long as it is, and we do not consider this case further.
4. The configuration model
The standard way to constuct a random multigraph is by the configuration model, which was introduced by Bollobás [11]. (See [9; 38] for related models and arguments.) As is well-known, we then assign a set of half-edges to each vertex ; this gives a total of half-edges, and we choose a perfect matching of them uniformly at random. This defines by regarding each pair of half-edges in the matching as an edge.
We will use this standard version in Section 8, but in the main part of our arguments (Section 7), it will be convenient to use a variation of this construction that yields the same result in a somewhat more circuitous way. To see it, we may start with the standard construction above, but also assume that we label the edges by putting a cuff on each edge, with the cuffs labelled (uniformly at random). Furthermore, each cuff has two half-edges, labelled and (randomly), joined to one each of the half-edges making the edge. (The half-edges are now really quarter-edges, but we keep the name.)
Our version of the configuration model can now be described as follows. Let be the set of half-edges assigned to vertex in the standard model above. Let be set of all half-edges and label them (arbitrarily) as . Take also a second set of half-edges representing the half-edges at the cuffs, with and at cuff , which we now for emphasis denote . Let be a uniformly random permutation, and join each to . We denote the result by . We interpret as a bipartite graph by merging all half-edges in into the vertex , and the half-edges and into the cuff ; thus the two vertex sets are and .
Finally, to obtain , we merge the two edges at each cuff into one, and forget the cuffs. This yields evidently the same result as the standard configuration model.
The original vertices will sometimes be called real vertices.
Remark 4.1**.**
The reader may recognize that our construction is just the standard configuration model construction of a random bipartite graph with the real vertices on one side and the cuffs (each of degree 2) on the other side, followed by contractions eliminating all cuffs.
5. Cumulants
Our proofs are based on the method of moments, in the form using cumulants, see e.g. [27, Section 6.1]. We denote the -th cumulant of a random variable by , and the mixed cumulant of random variables by . (The variables are assumed to have finite moments.) We recall the following properties of mixed cumulants, see e.g. [27, p. 147] or [31].
- (1)
( times). 2. (2)
is multilinear in . 3. (3)
if can be partitioned into two non-empty sets of random variables that are independent of each other. 4. (4)
, summing over all partitions of into non-empty sets , . 5. (5)
, summing as in (4).
6. The main lemma
In this section we state and prove Lemma 6.1 below, whch is the central part of the proof of the results in the present paper. Although the lemma is motivated by its application to the configuration model and random graphs, we state it in a self-contained way. The lemma could be derived using the general theory by [18], and is very similar to [17, Theorem 1.4], but we give a complete proof, using the following notation.
Suppose that and that for each we are given and two sequences and . (These are fixed throughout this section.)
Let be uniformly random and define the random indicator variables
[TABLE]
Our goal in this section is to estimate the mixed cumulant .
Let and . Let be the graph with vertex set and an edge if or . (In other words, there is an edge when and use a common or a common in the definition (6.1).) The connected components of are called blocks. Let be the number of blocks, and denote the blocks by (in some order, e.g. lexicographic); thus and form a partition of . Furthermore, let
[TABLE]
i.e., the number of distinct pairs .
Lemma 6.1**.**
With notations as above,
[TABLE]
where is a constant that may depend on and but not on .
Remark 6.2**.**
The estimate \bigl{\lvert}\kappa\bigl{(}Y_{1},\dots,Y_{r})\bigr{)}\bigr{\rvert}\leqslant CN^{-e} is straightforward; in particular, the case of (6.3) is easy. If the indicators were independent for disjoint pairs , then the variables belonging to different blocks would be independent, and thus the mixed cumulant would vanish when . Of course, in our setting these indicators are not independent, but they are only weakly dependent and Lemma 6.1 yields a substitute with an estimate that becomes smaller when the number of blocks gets larger. We will see later that this is sufficient for our purposes.
Proof of Lemma 6.1.
In this proof, will denote various constants that may depend on and , but not on or . The same holds for the implied constants in .
For each set , let , , and, generalizing (6.2),
[TABLE]
Thus, . Note that the sets are disjoint.
The idea of the proof is, hardly surprisingly, that the values for different are almost independent, and thus indicators for different are almost independent. However; this is only “almost”, and therefore we approximate using truly independent variables, constructed in a careful way.
We begin by constructing the random permutation in a special way. Define recursively and by
[TABLE]
Assume . (Otherwise (6.3) is trivial if is large enough.)
Let be independent uniformly random permutations in , and let be the random string consisting of the first values of . We write the elements of as , . Using a fixed bijection between and , we also regard as an array , with and .
For and a set of indices , say that an element is bad if it also occurs in for some . (I.e., if it equals for some .) Finally, define for as for the smallest such that this element is not bad. Note that we have defined in (6.5) so large that there is always at least one good element for each . (We will not use for ; we may define these arbitrarily to make a permutation if desired, but we may also just ignore them.)
By construction, for a given and , are distinct for , and these values are distinct from for all , and .
In particular, consider the case for all . Then all , for and , are distinct, and by symmetry, they equal any sequence of distinct values in with the same probability. Hence, they have the same (joint) distribution as the values for . (Recall that the sets are disjoint.) Consequently, we may assume that
[TABLE]
For , let be the unique index such that . Given , define further a modification of in (6.1) by
[TABLE]
[TABLE]
Define also a random variable denoted by
[TABLE]
This can be regarded as a kind of inclusion–exclusion, or Möbius inversion, and (6.9) implies the inverse relation,
[TABLE]
We now conclude from (6.8), (6.10) and the multilinearity of mixed cumulants (2) that
[TABLE]
The number of terms in this sum is , so it suffices to estimate each term individually.
Thus, fix a sequence with . Let be the graph with vertex set and an edge (with ) if .
Note first that a variable by (6.9), (6.7) and the construction of , depends only on and for . Consequently, if is disconnected, so we may divide into two parts and with no edges between them, then the random variables depend only on for , , and thus these two sets of variables are independent. Hence, (3) yields the following.
Claim 1**.**
If is disconnected, then \kappa\bigl{(}Y_{1;\Delta J_{1}},\dots,Y_{r;\Delta J_{r}}\bigr{)}=0.
Consequently, it suffices to consider the case when is connected. For a set , let
[TABLE]
By (4), the mixed cumulant in (6.11) can be written as a linear combination of products
[TABLE]
where is a partition of . The number of terms and their coefficients are , and thus it suffices to estimate each product (6.13).
Define for , and the indicator
[TABLE]
and let be the array of all these indicators. Thus tells us exactly which coincidences there are among the values . (Recall that there are no such coincidences with the same .) This also determines, for each and , the indices of the elements that are bad when constructing , and thus exactly which element that equals, for each . This implies the following.
Claim 2**.**
For any given sequence of sets , determines exactly what coincidences, if any, there are among all for and .
Furthermore, by symmetry, conditioned on , any sequence of non-coinciding values has the same distribution as a sequence drawn without replacement from .
Let , and consider . This is, recalling (6.7), a product of factors of the type . There may be some pairs that are repeated; we let be the product after deleting all factors such that repeats a previous pair. Note that the number of factors remaining in is by (6.4). Suppose that
[TABLE]
for two of the remaining factors and . If further , then , by our construction of , and then since we have eliminated repetitions. On the other hand, if , then and belong to different blocks, so by the definition of blocks and thus . Hence, in any case, (6.15) implies , and thus
[TABLE]
This and Claim 2 show that conditioned on , either vanishes because the product contains conflicting indicators, or else all occurring in the factors remaining in are distinct and obtained by drawing without replacement from . Hence, in any case,
[TABLE]
This is valid for any . Hence, by the definitions (6.12) and (6.9)
[TABLE]
Suppose that , and suppose that and have no common element. Then, for every , an element is bad for if and only if it is bad for , and thus for all ; hence for . Consequently, (6.9) shows that then for every such that . In contrapositive form, this shows the following, recalling (6.14),
Claim 3**.**
If , and , then and have at least one common element, i.e., for some and .
If is a graph with vertex set , say that covers if for every edge with , there exist and such that . For a set , let be the graph with edges
[TABLE]
Then, by (6.12) and Claim 3, if , then covers . This implies that we can improve (6.18) to
[TABLE]
Hence,
[TABLE]
Recall that we have fixed , so is for each a fixed non-random graph on . If is any graph on , let be the number of edges in a spanning forest. (This equals minus the number of components of .) We claim that if is any graph on ,
[TABLE]
To see this, we may replace by a spanning forest, so it suffices to show (6.22) when is a forest. We do this by induction on ; the base case being trivial. If , let be a leaf in , let be the edge incident to , and let . Recall that covers if and have a common element. Thus, conditioning on for all , the probability that covers is at most . Hence,
[TABLE]
and (6.22) follows by induction.
Combining (6.21) and (6.22), we find
[TABLE]
Finally, consider as in (6.13) a partition of . Then (6.2) and (6.4) imply
[TABLE]
Furthermore, by (6.19). If we take a spanning subtree for each , then the union of these subtrees has the same components as . Hence, if is connected, is connected and has thus at least edges, which implies
[TABLE]
Consequently, (6.24) implies by (6.25)–(6.26),
[TABLE]
As said above, the mixed cumulant in (6.11) is by (4) and (6.12) a linear combination of such products, and thus (6.27) yields
[TABLE]
We have here assumed that is connected, but as said in Claim 1, the cumulant vanishes otherwise, so (6.28) holds in general.
Finally, also as said above, the result (6.3) follows from (6.11) and (6.28). ∎
7. Proofs of Theorems 3.2, 3.9 and 3.10
We prove first Theorem 3.2 for , and show then how the proof can be extended to Theorem 3.9 for the simple graph . Theorem 3.10 follows by simple modifications.
As said earlier, we use the method of moments in the form with cumulants; we thus show convergence of all (mixed) cumulants. The proofs will use Lemma 6.1 to estimate higher cumulants, as detailed below. In addition, we estimate means and variances (and higher moments for unicyclic components) by standard methods; for the sake of focussing the presentation, we state these results as the following lemmas but postpone their proofs to Section 8. Recall that Z^{*}_{H}:=Z_{H}\bigl{(}G^{*}(n,\mathbf{d}_{n})\bigr{)},
Lemma 7.1**.**
Assume (A1)–(A2). Then, for the random multigraph :
- (i)
If is a tree, then
[TABLE]
with given by (3.3). 2. (ii)
If are trees, then
[TABLE]
where is given by (3.7). 3. (iii)
If is a connected unicyclic multigraph, then
[TABLE]
with as in (3.3). Moreover, for any distinct such multigraphs , and integers , the mixed factorial moments converge:
[TABLE] 4. (iv)
If is a connected multigraph with more than one cycle, i.e., , then
[TABLE]
Lemma 7.2**.**
Assume (A1)–(A2). For every tree , (3.5) holds, i.e., .
Lemma 7.3**.**
Assume (A1)–(A2) and (A6). If is a tree with , then . More generally, if are distinct trees with and for each , then the matrix \bigl{(}\sigma_{H_{i},H_{j}}\bigr{)} is non-singular.
The next lemma shows that, assuming the second moment condition (A3), Lemma 7.1(iii) extends to include also the numbers of loops and pair of multiple edges , cf. Example 3.1. (We have to restrict to simple graphs ; if is not simple, there is obviously a strong dependence between and or .)
Lemma 7.4**.**
Assume (A1)–(A3). Let be connected unicyclic simple graphs. Then, for any integers ,
[TABLE]
where
[TABLE]
As said above, we postpone the proofs of these lemmas to Section 8.
Fix a (multi)graph . Let , and assume that the vertices of are labelled . Our aim is to estimate the cumulants for (fixed) and corresponding mixed cumulants. In this section, denotes constants that may depend on the order and the fixed (small) graph (and corresponding graphs below), but not on ; the same holds for the implicit constants in .
We use the version of the configuration model described in Section 4. A copy of in then corresponds to a copy of in , where is obtained from by subdividing each edge into two, and regarding the new vertices as cuffs, and where we only count copies of that map real vertices to real vertices and cuffs to cuffs. We consider also the identity of the half-edges used in the construction, and see that an isolated labelled copy of in is described by the following data:
- (1)
For each vertex : a real vertex such that . Furthermore, are distinct. 2. (2)
For each edge in : two half-edges and , and also a cuff and a labelling of the two half-edges at as and . Furthermore, all these half-edges for are distinct.
Each such family of data \phi:=\bigl{(}\nu(i),\alpha_{ij},\alpha_{ij}^{\prime},\chi_{ij},\beta_{ij},\beta_{ij}^{\prime}\bigr{)} defines a possible isolated labelled copy of . Thus, if is the indicator that exists in , and is the set of all such data , then
[TABLE]
Furthermore, exists in if and only if the construction of yields edges and for each , which is equivalent to and . Consequently, each is the product of indicators of the type . Each is thus a random variable of the type in (6.1).
Consider now a sequence of multigraphs. By (7.8) and multilinearity (2), the mixed cumulant \kappa\bigl{(}\overline{Z}^{*}_{H_{1}},\dots,\overline{Z}^{*}_{H_{r}}\bigr{)} can be expanded as
[TABLE]
where the mixed cumulants may be estimated by Lemma 6.1.
It remains to estimate the parameters and in Lemma 6.1 and the corresponding number of terms in (7.9). This is done in the following lemmas, using a standard type of argument that is common in applications of the method of moments in combinatorial problems.
Lemma 7.5**.**
Assume (A1)–(A2). Let be a sequence of connected multigraphs. Then
[TABLE]
where we sum over all unlabelled bipartite multigraphs that can be written as a union where and, as above, is obtained from by subdividing each edge into two.
We consider only unions respecting the bipartition between real vertices and cuffs. Note that the set of in (7.10) is finite and independent of .
Proof.
For notational simplicity we consider the case of a single multigraph , i.e., ; the proof for mixed cumulants is the same.
Consider one term in the sum in (7.9); it is given by indicators corresponding to copies of . Let . We apply Lemma 6.1 with . Note that in (6.2) equals . If the graph in Section 6 has an edge , then and have a common half-edge, and thus a common vertex (real or cuff); hence and are subgraphs of the same component of . This yields a surjective map from the blocks of to the components of , and thus the number of blocks in Lemma 6.1 satisfies . Hence, Lemma 6.1 yields
[TABLE]
Now let us count the number of that yield a union isomorphic to some given bipartite multigraph . Each cuff in and its 2 half-edges may be chosen in ways. Let be a real vertex in , and let be its degree. Also, let . We consider by definition only copies of such that each vertex has degree equal to the corresponding vertex in , and thus . Hence, for each such choice of corresponding to , we have at most choices of the half-edges incident to it; hence the number of choices of and its half-edges is .
Consequently, each vertex (real or cuff) in gives choices of corresponding vertex and half-edges in . Hence, the number of isomorphic to a given is O\bigl{(}n^{v(\mathcal{F})}\bigr{)}. Finally, given , we can choose in ways. Hence, the total number of terms in (7.9) corresponding to a given is O\bigl{(}n^{v(\mathcal{F})}\bigr{)}, and by (7.11), their total contribution is O\bigl{(}n^{v(\mathcal{F})-e(\mathcal{F})-q(\mathcal{F})+1}\bigr{)}. This yields (7.10). ∎
Lemma 7.6**.**
*Let , where is a sequence of connected multigraphs. *
- (i)
Then . 2. (ii)
If furthermore at least one is not a tree, then .
Proof.
Consider first the case when is connected. We may then reorder such that each is connected; furthermore, we may do this starting with any of the multigraphs as , and in (ii) we may thus assume that is not a tree.
Now, choose for each a spanning tree , and define , . Thus each is connected.
Let and let . Since is a subgraph of , which is a tree, we see that is a forest, and since is connected, is not empty. Hence, . Consequently, since ,
[TABLE]
when . Hence, by induction . Furthermore, is a spanning subgraph of , and thus and . Consequently,
[TABLE]
Moreover, since is connected, and if is not a tree, then . Consequently, (7.13) yields (i) and (ii) in the case .
If , i.e., is disconnected, denote the components of by , . Then, by what just has been shown, , and if some is not a tree, then . Hence, the result fullows by summing over all components . ∎
Lemma 7.7**.**
Assume (A1)–(A2). Let be a sequence of connected multigraphs. Then
[TABLE]
Furthermore, if at least one is not a tree, then
[TABLE]
Proof.
An immediate consequence of Lemmas 7.5 and 7.6, applying the latter to . ∎
Proof of Theorem 3.2.
Define
[TABLE]
First, (iii), the case when , is easy. By Lemma 7.1(iv) and Markov’s inequality, . (Recall that is a non-negative integer.) In particular, . Furthermore, (7.15) shows that every cumulant , and thus every moment is bounded by (5). This implies uniform integrability of every power, and thus for every . Convergence (to 0) of joint moments with other follows by the Cauchy–Schwarz inequality when we have shown convergence of moments also in (i) and (ii).
For (i) and (ii), and joint convergence of both, we use the method of moments, in the cumulant version. Let be random variables defined for (unlabelled) trees and unicyclic multigraphs to have the claimed joint limit distribution:
- (i)
For trees , have a joint normal distribution with and \operatorname{Cov}\bigl{(}{\widetilde{X}}_{T_{1}},{\widetilde{X}}_{T_{2}}\bigr{)}=\sigma_{T_{1},T_{2}}. 2. (ii)
For unicyclic , with independent of all other .
We then claim that for any and trees or connected unicyclic multigraphs ,
[TABLE]
Indeed, this implies by (5) convergence of all mixed moments. Furthermore, normal and Poisson distributions have finite moment generating functions, and thus the joint distribution of any finite number of is determined by its mixed moments. Hence, (7.17) implies convergence
[TABLE]
as claimed in the theorem. Furthermore, all moments converge, as just seen.
Note that the right-hand side in (7.17) vanishes except in the two cases and all are the same unicyclic multigraph, or and both and are trees. This follows by the independence assumption in (ii) above together with (3), and the fact that all mixed cumulants of order vanish for joint normal distributions.
In order to show (7.17), we consider three cases.
Case 1: Every is a tree. Consider three subcases. First, if , then
[TABLE]
Secondly, if , then (7.16) and (7.2) yield
[TABLE]
Thirdly, if , then (7.14) in Lemma 7.7 together with (3.1) and (7.16) yields
[TABLE]
This verifies (7.17) in each of the three subcases.
Case 2: Every is unicyclic. This case is really nothing new. By Lemma 7.1(iii), we have convergence of all mixed factorial moments to the corresponding moments of , which by well-known algebraic identities is equivalent to convergence of all mixed moments, and thus to convergence of all mixed cumulants. (See [27, Section 6.1] and (4)–(5).)
Case 3: At least one is a tree and at least one is unicyclic. Suppose that there are trees (not necessarily distinct) among . Then, by (7.15) in Lemma 7.7 together with (3.1) and (7.16),
[TABLE]
Hence, (7.17) holds in this case too.
This completes the verification of (7.17) in all cases; as said above, this proves (7.18), with convergence of all moments.
Finally, Lemma 7.1 shows (3.3), Lemma 7.2 shows (3.5), Lemma 7.3 shows the first equivalence in (3.4), and the second is obvious from (3.3). Lemma 7.3 yields also the final claim on non-singularity of the covariance matrix \Sigma=\bigl{(}\sigma_{H_{i},H_{j}}\bigr{)}. ∎
To show Theorem 3.9 for the simple random graph , we combine Theorem 3.2 with asymptotics of the counts and of loops and double edges. Note that counts all occurences of , which in general differs from , the number of isolated occurences. The latter is already included in Theorem 3.2, but requires an extra argument.
and can be expressed as sums (7.8) for sets of data as above, with the difference that in (1) above, we omit the condition on the degree .
We may just as well consider a somewhat more general situation: we say that a marked multigraph \widetilde{H}=\bigl{(}H,(\iota_{w})_{w\in H}\bigr{)} is a multigraph where each vertex has a mark . We then define as the number of labelled copies of in such that each bound vertex corresponds to a vertex in with the same degree (while there is no restriction for a free vertex). Hence, if all vertices in are free, , and if all vertices are bound, . We write, as in the unmarked case, and .
Assume that for convenience. Then is given by (7.8) for a set of defined by (1)–(2), but with the degree condition in (1) omitted for free vertices. Let cuffs in subdivided multigraphs be bound by default. Furthermore, define , where are marked multigraphs, by taking the union and marking a vertex as bound if it is bound in some , and free otherwise.
We extend Lemma 7.5 as follows.
Lemma 7.8**.**
Assume (A1)–(A2). Let be a sequence of connected marked multigraphs. Then
[TABLE]
where we sum over all unlabelled bipartite multigraphs that can be written as a union where and is obtained from by subdividing each edge into two as above.
Proof.
We follow the proof of Lemma 7.5. The only difference is when counting the number of ways we can choose a copy of a vertex and its half-edges. If is bound, then as in the proof of Lemma 7.5, for some , and then the half-edges can be chosen in at most ways for each , giving as before choices of vertex and half-edges.
On the other hand, if is free, then we bound for each real vertex the number of choices of the half-edges by ; hence, the total number of choices of and its half-edges is at most
[TABLE]
This gives an additional factor for each free , and thus (7.23) instead of (7.10). ∎
Lemma 7.9**.**
Let with , where and is a sequence of connected multigraphs, and , where and each is a cycle , . Say that a vertex is free, and let
[TABLE]
Then .
Proof.
Let ; thus the result is .
We use induction on . If , then each free vertex has degree , and thus . Hence, the result follows from Lemma 7.6(ii) applied to .
Now suppose that the results hold for some , and add another cycle , of length , say. Denote the old by , so . We consider the changes in the quantities , which we denote by , and so on. We treat three cases separately.
(i). If the new cycle is disjoint from , then , , and ; the latter since all new vertices have degree 2 and thus do not contribute to . Hence, .
(ii). Suppose that is edge-disjoint but not vertex-disjoint from . Then and . Each vertex in contributes 1 to and 0 to , while each vertex in contributes 0 to and at most 1 to , since the new cycle increases the degree by 2. (If the vertex is not free in , the contribution to is 0.) Hence, each vertex contributes at most 1 to , and thus . Consequently, .
(iii). Suppose that is has some edge in common with . Then . The edges in form disjoint paths . Suppose that contains edges. Then contributes to , the internal vertices in contribute each to and the two endpoints of contribute at most each to . There are no other contributions, and thus and consequently .
We have shown that in all cases, which completes the induction. ∎
Lemma 7.10**.**
Assume (A1)–(A3). Let be trees or connected unicyclic simple graphs. Then, the joint limits in distribution in Theorem 3.2(i)(ii) hold jointly with
[TABLE]
with and given by (7.7), and with the limits in (7.26) and (7.27) independent of each other and of the limits for in Theorem 3.2(i)(ii).
Proof.
We extend the proof of Theorem 3.2, and show that the convergence (7.17) of joint cumulants holds also if we consider besides the variables also and (possibly repeated several times), and the corresponding and , independent of each other and all . We regard , , as symbols used only to denote these variables; they are not any graphs; for convenience we may say , but it should be interpreted in this formal sense.
We have in the proof of Theorem 3.2 proved (7.17) when there is no among , by considering three different cases separately. We now consider two further cases.
Case 4: Some is a , and some is a tree. Let be the number of that are trees. Regard each as a marked multigraph ; if we let every vertex be free, and otherwise we let every vertex be bound. By (3.1), applied also to , we find similarly to (7.22)
[TABLE]
which we estimate by (7.23). If , then (A3) implies, through (2.6) and (2.7),
[TABLE]
Furthermore, if is as in Lemma 7.8, then a vertex is free if and only if it belongs to only for . In particular, if is free then . Let be as in (7.25) with , the union of for and the union of the other . Then Lemma 7.8, (7.29), (7.25) and Lemma 7.9 yield
[TABLE]
Consequently, (7.28) yields
[TABLE]
Hence, (7.17) holds in this case too.
Case 5: Some is a , but no is a tree. This is similar to Case 7. Lemma 7.4 shows convergence of all mixed factorial moments, which is equivalent to convergence of all mixed moments and of all mixed cumulants.
This shows that (7.17) holds in all cases, which implies joint convergence in distribution (7.18) as above. ∎
Proof of Theorem 3.9.
Theorem 3.2(iii) (multicyclic ) transfers immediately by (2.8). Consider thus only trees and unicyclic .
The joint convergence in distribution in (i) and (ii) for is an immediate consequence of Lemma 7.10 and conditioning on . (We keep the normalization by in (3.2).)
Furthermore, with as in (7.16), we have by Theorem 3.2 convergence of every moment , and thus . Furthermore, (2.8) holds (as said in Remark 2.3, and a consequence of (7.26)–(7.27)); hence for every even integer ,
[TABLE]
In other words, the conditioned random variables \bigl{(}X_{H}\mid G^{*}(n,\mathbf{d}_{n})\text{ is simple}\bigr{)} have bounded moments of arbitrary order; hence the convergence in distribution of these variables just shown implies that moment convergence holds also for the conditioned variables, i.e., for . Convergence of mixed moments follows by the same argument.
In particular, this shows that for a tree ,
[TABLE]
which shows (3.10), and completes the proof. ∎
Proof of Theorem 3.10.
(i): As the proof of Theorem 3.2 (only Case 7 is relevant), but using Lemma 7.8 (with all vertices free) instead of Lemma 7.7; the extra factors in (7.23) are all by the assumption (A4), so this does not affect the rest of the proof.
(ii): As the proof of Theorem 3.9, using again Lemma 7.8 and (A4) instead of Lemma 7.7. ∎
Remark 7.11**.**
Theorem 3.10 extends, with the same proof, to marked trees , where as above each vertex is marked as either bound or free. This includes Theorems 3.2, 3.9 and 3.10, but also mixed cases. More generally, we may consider marked multigraphs where each vertex is marked with a set of allowed degrees; we count only copies such that each corresponds to a vertex with . For example, we may count edges such that one endpoint has prime degree and the other is a leaf. The proofs above hold for this case too; we say that a vertex is bound if is finite, and free otherwise. If some vertex is free we assume (A4) as in Theorem 3.10; if all vertices are bound, the assumptions (A1)–(A2) or (A1)–(A3) in Theorems 3.2 and 3.9 are enough.
8. Means and variances
In this section we prove Lemmas 7.1–7.4 used in the proofs in Section 7. We will here use the standard version of the configuration model, recalled at the beginning of Section 4; we thus use the half-edges (see Section 4) and take a random perfect matching of them.
Proof of Lemma 7.1.
Let be a multigraph and let be the number of vertices of degree in , . We argue as in Section 7; we denote the possible isolated labelled copies of in by , and obtain in analogy with (7.8)
[TABLE]
where now is the indicator that exists as an isolated subgraph in . is specified by:
- (1)
For each vertex : a vertex such that . Furthermore, are distinct. (As (1).) 2. (2)
For each vertex : also a bijection between the half-edges at and the half-edges at . These bijections define how the half-edges in are paired in .
For each , there are vertices such that , and we may choose the corresponding in (1) in ways; then for each of these vertices the bijection of half-edges in (2) may be chosen in ways. Hence, the number of is
[TABLE]
(The product in (8.2) is really finite, since only for finitely many .) Each is a product of indicators of specific pairings of the type . Hence,
[TABLE]
Consequently, (8.1), (8.2) and (8.3) yield the exact formula
[TABLE]
As , we obtain from (8.4) using (A1) and defined in (3.3),
[TABLE]
Using (3.1), this proves (7.1), (7.3) and (7.5).
Now consider two connected multigraphs and , and let . By (8.1),
[TABLE]
Let and condition on . Then is a component of , and the rest of the graph, i.e. , is given by another instance of the configuration model, with replaced by and by . Consequently, if we first only consider and such that and are disjoint, then, using (8.4) for ,
[TABLE]
If and are not disjoint, then both can occur as components only if they coincide as unlabelled graphs. Hence, if , then \operatorname{\mathbb{E}{}}\bigl{(}\overline{Z}^{*}_{H}\overline{Z}^{*}_{H^{\prime}}\bigr{)} is given by (8.7), while if , then there is an additional term . We switch to counting unlabelled copies, using (3.1) as usual, and obtain
[TABLE]
Consider first the case when and are unicyclic. If for every such that , then all fractions in (8.8) tend to 1 as , and thus (8.8) and (7.3) yield
[TABLE]
If for some with , then by (7.3) and the definition (3.3) of (or directly by (8.5)), and (8.8) implies that (8.9) still holds. If , (8.9) yields (7.4) with and ; if , (8.9) yields \operatorname{\mathbb{E}{}}\bigl{(}\overline{Z}^{\mathsf{u}*}_{H}\bigr{)}_{2}\to\lambda_{H}^{2}, another instance of (7.4) ( and ).
This shows (7.4) when the degree , The general case is proved in the same way. The factorial moments in (7.4) means that we only count copies that are distinct, and therefore disjoint, and we may condition on one indicator as in (8.7) and use induction; this is a standard argument and we omit the details.
Finally, consider the case of two trees and . Then (8.8) yields
[TABLE]
If whenever , then
[TABLE]
Hence, (8.10) and (7.3) then yield
[TABLE]
If so for some with , then (8.4) yields , and it is easily seen from (8.10) and an expansion as in (8.11) that \operatorname{Cov}\bigl{(}\overline{Z}^{\mathsf{u}*}_{H},\overline{Z}^{\mathsf{u}*}_{H^{\prime}}\bigr{)}=o(n). Hence, (8.12) holds in this case too.
We have shown (8.12) in general; this is the same as (7.2) with the definition (3.7), which completes the proof of Lemma 7.1. ∎
Remark 8.1**.**
We used labelled copies in the proof for convenience and transferred the results to unlabelled copies by (3.1). Alternatively, and essentially equivalently, we may define if and as unlabelled graphs, and note that then . Hence splits into equivalence classes of elements each. Define as a subset of consisting of one element from each equivalence class. Then, cf. (8.1) and (3.1), and
[TABLE]
which can be used instead of (8.1), yielding the same results.
Proof of Lemma 7.4.
This is similar to the proof of the special case (7.4) in Lemma 7.1.
The special case is shown in the proof of (2.8) in [21, Section 7 including Remark 6]. In general, we take copies of for and , and condition on for all such and . Then
[TABLE]
by the case just discussed applied to , and (7.6) follows by the argument in (8.7). (It is here convenient to count unlabelled copies as in Remark 8.1.) ∎
Proof of Lemma 7.2.
Let be a uniformly random vertex in . Then,
[TABLE]
since each component isomorphic to contains vertices that are possible choices of . Hence, using (7.1),
[TABLE]
Furthermore, by the coupling of the exploration process and the branching process discussed in Section 2.3,
[TABLE]
Proof of Lemma 7.3.
Suppose that this fails, and that are distinct trees with , and for some real numbers . Hence, by (3.8),
[TABLE]
Hence, for any tree , by the Cauchy–Schwarz inequality and (3.8),
[TABLE]
and thus by (3.8) again,
[TABLE]
For two trees and , define
[TABLE]
so that (3.7) can be written
[TABLE]
Thus, (8.20) yields, for every tree ,
[TABLE]
By assumption (A6), there exists an such that . Given any tree with , we may replace a leaf by a vertex of degree , joined to new leaves. Denote the result by ; this is a tree with and e\bigl{(}T^{(1)}\bigr{)}=e(T)+r-1. Hence, for any other tree ,
[TABLE]
Repeat this procedure and obtain a sequence of trees , , with (replacing an arbitrary leaf each time). If is large enough, then for , and thus and (8.23) yields
[TABLE]
If , then also for every , and thus (8.25) yields for large . Furthermore, it follows from (8.24) that this sum is a linear function of , and thus it vanishes for all , i.e.,
[TABLE]
In particular we can take in (8.26), and see that for every tree with and . Hence, the second term in (8.23) vanishes for every tree with , and thus (8.23) and (8.26) imply
[TABLE]
for every such tree . However, taking , this yields , a contradiction. ∎
9. Proof of Theorem 3.17
We say that a graph functional has finite support if for only finitely many unlabelled , or equivalently, if there exists such that
[TABLE]
Lemma 9.1**.**
Assume (A1)–(A2). Let be a graph functional with finite support and define by (3.16). Then
[TABLE]
Proof.
Let be a uniformly random vertex in , and couple the exploration process of with as in Section 2.3. Let be as in (9.1). If the first generations of the two processes are equal, then either (as unlabelled rooted graphs), or both have at least edges, and in both cases . Hence, noting that (9.1) also implies that is bounded,
[TABLE]
Furthermore, conditioning on ,
[TABLE]
Thus, taking the expectation,
[TABLE]
The result (9.2) follows by combining (9.5) with (9.3). ∎
Lemma 9.2**.**
Assume (A1)–(A2). Let be a graph functional with finite support, let be as in (9.1) and let . Then, for so large that and , with ,
[TABLE]
Proof.
Let and be independent uniformly random vertices in . Then
[TABLE]
We reveal the edges in in a special order. (Cf. the related argument in [5, Section 4.2].) First, let and denote independent exploration processes of and , starting at and as above. (These may thus conflict. Think of them as exploring different copies of .) Let and be given by and , respectively, and note that and are independent. Next, start from scratch and reveal first the edges of according to the process , but stop when edges have been found, or when is exhausted (if it has less than edges), where again is as in (9.1); let denote the explored part of . Then reveal edges according to (starting from ) as long this does not involve any half-edge already paired. When the first conflict occurs, abandon and pair the remaining half-edges uniformly at random in any order. This yields a copy of , and we define , , for it.
By the construction, and (9.1), . Furthermore, only if a conflict has occurred when revealing one of the first edges according to . This may happen either because belongs to , or because one of the half-edges already found during the exploration of the first edges of is paired by with one of the half-edges in . Condition on . Since we reveal only at most edges and thus vertices in , the first possibility has probability , and the second possibility has probability for each of at most pairings; hence, we obtain the union bound, using ,
[TABLE]
The construction shows that and have the correct joint distribution, while and have same individual distribution as these, but are independent of each other. Thus, by (9) and (9.4),
[TABLE]
Consequently, recalling ,
[TABLE]
Furthermore, by (9.8), since is determined by ,
[TABLE]
Hence, (9.11) yields
[TABLE]
which yields (9.6). ∎
Lemma 9.3**.**
Assume (A1)–(A2). For every , there exists such that if is any subset with , then . In particular, every subgraph of with has (deterministically).
Proof.
We have , where is a uniformly random index in . The uniform integrability of (see (A2)) means (see e.g. [19, Theorem 5.4.1]) that there exists such that for any event with , we have \operatorname{\mathbb{E}{}}\bigl{(}D_{n};\,\mathcal{E}\bigr{)}<\varepsilon. Let . Then and \sum_{i\in\mathcal{J}}d_{i}=n\operatorname{\mathbb{E}{}}\bigl{(}D_{n};\,\mathcal{E}\bigr{)}. ∎
We consider now only the supercritical case (A5).
Lemma 9.4**.**
Assume (A1)–(A2) and (A5). Then there exist , and such that, for ,
[TABLE]
Proof.
Consider the exploration process, starting at a random vertex and restarting at a new random vertex when a component is completely explored. Let be the number of unpaired half-edges in the explored part when pairings have been made. In particular, .
By (A5), , and thus there exists such that
[TABLE]
Let . Consider until further notice only that are so large that for , and also , see (A1) and (2.5).
Let be such that . During the first steps of the exploration process, there is always at least unused vertices of degree , and thus the probability that the next pairing is with a half-edge at an unused vertex of degree is at least, noting that ,
[TABLE]
Let be independent copies of a random variables with the distribution
[TABLE]
and define .
By (9.17), we can couple the exploration process with the variables such that for every , , and thus, by induction, .
Suppose now that the component has exactly edges. Then , and hence . However, (9.18) implies that
[TABLE]
and thus . Furthermore, is bounded, and thus for every real . Hence, , and thus there exist such that . Consequently, using a Chernoff bound,
[TABLE]
This proves (9.14), with , when is large enough. The result (9.14) extends to all , since it is trivial for small if is large enough.
We turn to (9.15). Let be as in Lemma 9.3. Then, any component with has at most edges, and at least . Consequently, by (9.14), if ,
[TABLE]
Hence (9.15) too holds, if we redefine and . ∎
We have a similar estimate for the supercritical branching process .
Lemma 9.5**.**
Assume (A1)–(A2) and (A5). Then, for the branching process in Section 2.3,
[TABLE]
Proof.
This follows by standard branching process theory.
Alternatively, by the coupling in Section 2.3, or by (9.3) with , we have for every fixed , \operatorname{\mathbb{P}{}}\bigl{(}|\mathcal{C}(V)|=k\bigr{)}\to\operatorname{\mathbb{P}{}}\bigl{(}|{\mathcal{T}}|=k\bigr{)}, and thus (9.22) follows from (9.15). ∎
Proof of Theorem 3.17.
The proofs of the two parts are essentially identical, and we give the details only for 3.17.
3.17: Write .
We truncate. Define, for ,
[TABLE]
and consider the corresponding and .
Let be a fixed small number, chosen so small that Lemma 9.4 holds, as well as the following argument.
If , then either , and thus , or and thus by Lemma 9.3. If is small enough, then both events have probability O\bigl{(}e^{-cn}\bigr{)} by [13, Theorem 2]. Since (3.18) implies \Psi^{\dagger}(G^{*}_{n}),\Psi_{\leqslant\varepsilon n}(G^{*}_{n})=O\bigl{(}n^{m+1}\bigr{)}, we thus have
[TABLE]
Hence, it suffices to prove the results for .
For every , (3.18) and Lemma 9.4 yield
[TABLE]
and thus by (9.5) and Lemma 9.2 (with )
[TABLE]
Let, for ,
[TABLE]
Then, assuming ,
[TABLE]
and thus, using (9.25), (9.28) and Minkowski’s inequality,
[TABLE]
Furthermore, for every fixed , is a functional with finite support, and (3.16) yields a finite linear combination
[TABLE]
Terms where has a cycle have variance , by Theorem 3.2 or directly by Lemma 7.7. Hence, they can be ignored, and Theorem 3.2 (with Remark 3.5) yields, by (i) and joint convergence for different trees,
[TABLE]
where
[TABLE]
summing over pairs of trees of order less than .
We will verify below that the sums in (3.17) converge absolutely. Thus, as , defined in (3.17). The estimate (9), which is uniform in , implies
[TABLE]
This together with the limit (9.34) for each fixed and imply, see e.g. [10, Theorem 4.2] or [29, Theorem 4.28], X_{n}\overset{\mathrm{d}}{\longrightarrow}N\bigl{(}0,\sigma^{2}_{\psi}\bigr{)}, which is (3.21).
Furthermore, Theorem 3.2 shows also that for every fixed . Hence, Minkowski’s inequality and (9) imply, for every fixed ,
[TABLE]
and thus,
[TABLE]
Take in (9.38) and then let . This yields
[TABLE]
and thus , which is equivalent to (3.20).
Similarly, if , then (9.25) and (9.27) yield
[TABLE]
Moreover, by Lemma 9.1, and as by (2.11), noting that by (3.18) and Lemma 9.5. Hence, for any fixed , (9) implies
[TABLE]
Taking and then letting yields (3.19), similarly to (9.39).
Finally, to verify absolute convergence of the sums in (3.17), suppose temporarily that , so that we may interchange order in the summations freely. Then, considering the terms in (3.7) separately and using (3.5) and (2.11),
[TABLE]
The expectations in (9.42) and (9.43) are finite by (3.18) and Lemma 9.5, and so is each expectation in (9.44). If also the sum in (9.44) converges, then (3.7) and (9.42)–(9.44) show the last equality in (3.17). In particular, if (9.44) is finite, then (3.17) yields, since ,
[TABLE]
For any , this applies to the truncation for any , since we have for every when , and thus the sum over in (9.44) converges. Hence (9.45) holds for , and letting shows (by monotone convergence) that (9.45) holds for too. We have already seen that the right-hand side in (9.45) is finite, and thus the sum on the left converges, for any satisfying (3.18).
Consequently, all sums in (9.42)–(9.44) converge when . Hence, applying this to , we see that for every satisfying (3.18), all sums in (9.42)–(9.44) converge absolutely; hence the equalities in (9.42)–(9.44) hold in general, which verifies (3.17), with absolute convergence everywhere.
3.17: The main difference is that we use Theorem 3.9 instead of Theorem 3.2. Since we now assume (A3), (2.8) holds, and thus all estimates of means and variances in the proof of 3.17 hold automatically for too by conditioning. ∎
Proof of Theorem 3.16.
As said in Example 3.20, we apply Theorem 3.17 with , together with Lemmas 10.5 and 11.1 below. ∎
10. Non-zero variance?
The asymptotic variance in Theorem 3.17 necessarily satifies ; however, is possible. We note first some trivial cases.
Example 10.1**.**
- (a)
If (A6) does not hold, so (in the supercritical case) , then for every , see Remark 3.19. 2. (b)
If , then . In fact, then counts isolated vertices, and thus is deterministic. 3. (c)
If for every tree with , then a.s. and thus . In this case, for all but a few components .
Parts (b) and (c) of Example 10.1 show that the values of for , trees with , and non-trees , do not affect .
We conjecture that the trivial cases in Example 10.1 (in combination) is the only way to get (in the supercritical case). Formally:
Conjecture 10.2**.**
If (A1)–(A3) and (A5)–(A6) hold, then
[TABLE]
Remark 10.3**.**
As said in Section 3.3, we consider for simplicity only the supercritical case in Theorem 3.17, although we expect a similar result (for ) also in the subcritical case under suitable conditions. However, note that then there are two further trivial cases with , viz. and as in Example 3.20 (and linear combinations of them), since then and , respectively, which are deterministic for .
Although we have not been able to verify Conjecture 10.2, we can show in many cases.
Lemma 10.4**.**
Assume (A1)–(A2) and (A5)–(A6). Suppose that the graph functional satisfies (3.18) and that . Then has the form, for some real constants and every tree with ,
[TABLE]
where, furthermore, for every ,
[TABLE]
Note that (10.4) is an eigenvalue equation for the vector .
Proof.
Let be the linear space of all graph functionals satisfying (3.18). The right-hand side of (3.17) is a quadratic form in . Denote the corresponding symmetric bilinear form by ; thus and
[TABLE]
Since (3.20) implies , the bilinear form (10) is positive semidefinite, and thus the Cauchy–Schwarz inequality holds for it. In particular, if , then for every graph functional . Hence, taking for a tree ,
[TABLE]
Thus, for every tree such that , using ,
[TABLE]
which yields (10.2)–(10.3); then (10.4) follows by substituting (10.2) in (10.3). ∎
Lemma 10.5**.**
Assume (A1)–(A2) and (A5)–(A6). Let . Then .
Proof.
Suppose that ; thus (10.2)–(10.4) hold by Lemma 10.4. Furthermore, (10.2) trivially holds with , and since the coefficients in (10.2) are uniquely determined for every with , (10.3) yields
[TABLE]
Summing over yields
[TABLE]
This is a contradiction. ∎
Remark 10.6**.**
As said in Example 3.20 (and more generally in Conjecture 10.2), we conjecture that also for , and for (non-zero) linear combinations of these two graph functionals, but we have failed to show this in general, and leave this as an open problem.
Frank Ball (personal communication) has noted that in the case of bounded maximum degree treated in his paper [4] (on epidemics, but with the giant component as a special case), his formulas [4, (5.25)–(5.27)] express the asymptotic variance as a sum of integrals of squares, making it easy to show that the asymptotic variance is strictly positive in these cases. It seems likely that this formula for the asymptotic variance holds more generally (perhaps by purely algebraic manipulations), which might lead to a general proof, but we have not checked the details.
Example 10.7**.**
Let . (A6) shows both and the existence of with . There exist arbitrarily large trees with all degrees in , and they have . However, (10.2) cannot hold for all such trees, since the right-hand side is bounded for them by , while is unbounded. Hence Lemma 10.4 shows that , as claimed in Example 3.21.
11. The variance of the giant
Lemma 11.1**.**
If , then the variance in (3.17) is given by (3.16).
Proof.
Let be the Galton–Watson tree with offspring distribution , with given by (2.9). Recall that has a root with a random number of copies of attached to it. The probability generating function of is given by, using (2.9) and (2.12),
[TABLE]
The probability that the supercritical Galton–Watson process is finite is the unique root of , which by (11.1) is equivalent to (2.13).
Let be conditioned on being finite. Then, see e.g. [2, Theorem 1.12.3], is another Galton–Watson process, which is subcritical and has an offspring distribution with probability generating function, using (11.1),
[TABLE]
In particular,
[TABLE]
Since is subcritical, thus .
Let be the number of vertices in with outdegree . Then, by a standard calculation for subcritical Galton–Watson trees, summing the expected number of such vertices in generation , and using (11.2)–(11.3) and (2.9),
[TABLE]
If the root of has degree , then the tree is finite with probability , and conditioned on this event, it has branches that are copies of . Hence, using (11) and (2.13),
[TABLE]
Summing over we find
[TABLE]
Hence, (3.17) with yields
[TABLE]
which yields (3.16) by expanding the square and summing, using (2.12). ∎
12. Random degrees
One reason for the importance of the model is that for several models of random graphs, if we condition on the degree sequence, then we obtain a graph of the type , for the observed degree sequence . This includes the Erdős–Rényi graphs and , and several others, see e.g. [15]. Such random graphs can thus be regarded as based on a random degree sequence . We obtain easily results for such graphs too, by conditioning on . Note that, as pointed out e.g. by Ball and Neal [5], the randomness in the degree sequence will in general affect the asymptotic variance. We illustrate this by considering in some detail the counts of small isolated trees in the basic theorem Theorem 3.9. Similar versions of e.g. Theorems 3.16 and 3.17 follow similarly; see [5] for the variance of the size of the giant component in Theorem 3.16.
Theorem 12.1**.**
Let be a probability distribution satisfying (A6). Suppose that is random, and such that, with ,
[TABLE]
jointly, where are jointly normal with and some covariances with , and furthermore that
[TABLE]
Then, for any tree ,
[TABLE]
where is as in (3.3); furthermore, joint convergence holds for several trees , with limit , where the covariance matrix is given by
[TABLE]
Proof.
For convenience, we use the Skorohod coupling theorem [29, Theorem 4.30], and may thus assume that the limits in (12.1)–(12.3) hold a.s. Then (A1)–(A3) hold a.s., and thus Theorem 3.9 applies, conditioned on the degree sequence. Hence, conditionally,
[TABLE]
Consider a tree and let . If , then, a.s., using (8.4) and remembering that and now are random, together with the a.s. versions of (12.1) and (12.2),
[TABLE]
and thus
[TABLE]
On the other hand, if , then for some with . Since , (12.1) implies a.s., and thus a.s., so a.s. Hence, (8.4) implies \operatorname{\mathbb{E}{}}\bigl{(}\overline{Z}^{\mathsf{u}*}_{H}\mid\mathbf{d}_{n}\bigr{)}=o\bigl{(}n^{1/2}\bigr{)} a.s., and thus (12.8) holds in this case too, with .
Since (12.6) holds, with the same limit, conditioned on , the limits (12.6) and (12.8) hold jointly, with independent limits. Hence, we can take their sum and obtain (12.4) with \widehat{\sigma}^{2}_{H_{1},H_{2}}:=\sigma^{2}_{H_{1},H_{2}}+\operatorname{Cov}\bigl{(}\Xi_{H_{1}},\Xi_{H_{2}}\bigr{)}, i.e., (12.5). ∎
Example 12.2**.**
One case studied also by Ball and Neal [5] is when the degrees are i.i.d. random variables that are copies of a given , which we assume satisfies . (We ignore one half-edge if the sum otherwise becomes odd.) Then, by the central limit theorem and the law of large numbers, (12.1)–(12.3) hold with
[TABLE]
Hence, Theorem 12.1 shows that (12.4) holds. Furthermore, the double sum in (12.5) is by (12.9)
[TABLE]
By simple algebra, using and , the first sum in (12.10) equals
[TABLE]
and the second and third are just
[TABLE]
Hence, (12.5) and (3.7) yield, after some interesting cancellations,
[TABLE]
Example 12.3**.**
Consider the Erdős–Rényi graphs and , where we keep the average degree constant by choosing and for some fixed . The number of isolated trees of a given size in was studied already by Erdős and Rényi [16], but in the range of considered here, their result contains an error as was pointed out by Barbour [6], who proved asymptotic normality for using Stein’s method. The result was extended by Barbour, Karoński and Ruciński [7] to counts of isolated copied of individual trees, our ; the method yields also joint convergence. (We are not aware of any similar result on asymptotic normality proved for , but such results might be in the literature.) Although this result thus can be proved directly in a rather simple way, at least for , we find it instructive to see how it follows, for both models, from the general results in the present paper.
Asymptotic normality of was shown for by [7], see also [27, Example 6.35]. This was extended to in [20, Theorem 4.1], which implies (as a consequence of [20, (4.2)]) that (12.1)–(12.3) hold for both and (with and as above), with , the Poisson distribution, and asymptotic covariances
[TABLE]
Comparing with Example 12.2, we see that has an additional term, with different signs in the two cases. To calculate the contribution from that term to (12.5), we calculate, recalling so ,
[TABLE]
Define for and for . Then, using (12.14) in (12.5) yields, using (12.13) and (12.15) in the calculations,
[TABLE]
Thus, Theorem 12.1 yields asymptotic normality of the counts of isolated trees, in both and , with asymptotic covariances (12.16).
Remark 12.4**.**
The model in Example 12.2 with thus gives a result with a covariance matrix (12.13) that is half-way between the results for and . As has been remarked before, the same is seen in the much simpler (but related) case of the number of edges; elementary calculations yield for the model in Example 12.2, for , and of course for .
Acknowledgement
I thank Frank Ball for helpful correspondence.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Angel, van der Hofstad and Holmgren [2018+] Omer Angel, Remco van der Hofstad & Cecilia Holmgren: Limit laws for self-loops and multiple edges in the configuration model. Preprint, 2016. ar Xiv:1603.07172
- 2Athreya and Ney [1972] Krishna B. Athreya & Peter E. Ney: Branching Processes . Springer-Verlag, Berlin, 1972. MR 0373040
- 3Athreya and Yogeshwaran [2018+] Siva Athreya & D. Yogeshwaran: Central limit theorem for statistics of subcritical configuration models. Preprint, 2018. ar Xiv:1808.06778
- 4Ball [2018] Frank Ball: Central limit theorems for SIR epidemics and percolation on configuration model random graphs. Preprint, 2018. ar Xiv:1812.03105
- 5Ball and Neal [2017] Frank Ball & Peter Neal: The asymptotic variance of the giant component of configuration model random graphs. Ann. Appl. Probab. 27 (2017), no. 2, 1057–1092. MR 3655861
- 6Barbour [1982] A. D. Barbour: Poisson convergence and random graphs. Math. Proc. Cambridge Philos. Soc. 92 (1982), no. 2, 349–359. MR 0671189
- 7Barbour, Karoński and Ruciński [1989] A. D. Barbour, Michał Karoński & Andrzej Ruciński: A central limit theorem for decomposable random variables with applications to random graphs. J. Combin. Theory Ser. B 47 (1989), no. 2, 125–145. MR 1047781
- 8Barbour and Röllin [2017+] A. D. Barbour & Adrian Röllin: Central limit theorems in the configuration model. Preprint, 2017. ar Xiv:1710.02644 v 1
