A large-deviations principle for all the cluster sizes of a sparse Erd\H{o}s-R\'enyi graph
Luisa Andreis, Wolfgang K\"onig, Robert I. A. Patterson

TL;DR
This paper establishes a large-deviations principle for the distribution of all component sizes in a sparse Erdős-Rényi graph, capturing phase transition phenomena and linking to coagulation models.
Contribution
It provides an explicit rate function describing microscopic, mesoscopic, and macroscopic component sizes, including the phase transition at t=1.
Findings
Explicit large-deviations rate function for component sizes.
Captures phase transition at t=1.
Links to coagulation models and gelation phenomena.
Abstract
Let be the Erd\H{o}s-R\'enyi graph with connection probability as for a fixed . We derive a large-deviations principle for the empirical measure of the sizes of all the connected components of , registered according to microscopic sizes (i.e., of finite order), macroscopic ones (i.e., of order ), and mesoscopic ones (everything in between). The rate function explicitly describes the microscopic and macroscopic components and the fraction of vertices in components of mesoscopic sizes. Moreover, it clearly captures the well known phase transition at as part of a comprehensive picture. The proofs rely on elementary combinatorics and on known estimates and asymptotics for the probability that subgraphs are connected. We also draw conclusions for the strongly related model of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
A large-deviations principle
for all the cluster sizes
of a sparse Erdős-Rényi graph
Luisa Andreis
WIAS, Mohrenstraße 39, 10117 Berlin
,
Wolfgang König
TU Berlin and WIAS, Mohrenstraße 39, 10117 Berlin
and
Robert I. A. Patterson
WIAS, Mohrenstraße 39, 10117 Berlin
Abstract.
Let be the Erdős-Rényi graph with connection probability as for a fixed . We derive a large-deviations principle for the empirical measure of the sizes of all the connected components of , registered according to microscopic sizes (i.e., of finite order), macroscopic ones (i.e., of order ), and mesoscopic ones (everything in between). The rate function explicitly describes the microscopic and macroscopic components and the fraction of vertices in components of mesoscopic sizes. Moreover, it clearly captures the well known phase transition at as part of a comprehensive picture. The proofs rely on elementary combinatorics and on known estimates and asymptotics for the probability that subgraphs are connected. We also draw conclusions for the strongly related model of the multiplicative coalescent, the Marcus–Lushnikov coagulation model with monodisperse initial condition, and its gelation phase transition.
(6 January, 2021)
MSC 2020: 05C80, 60F10, 60K35, 82B26.
Keywords and phrases. Erdős-Rényi random graph, component sizes, large deviations, empirical measure, phase transition, sizes, multiplicative coalescent, gelation.
1. Introduction
In this paper, we study the Erdős-Rényi random graph , that is, the random graph on the vertex set , where each two distinct vertices are independently connected with probability . We will be working in the sparse regime, i.e., we assume that for some fixed . This is the regime in which the famous phase transition of the emergence of a giant cluster at occurs, which was detected and characterised for the first time in the seminal paper [ER60]. For an extensive overview on the model see the classical reference [Bol01].
Our new contribution in this paper is a comprehensive study of the family of the sizes of all the connected components of , registered according to the asymptotic order of the size in the limit as . We distinguish here microscopic components (i.e., with size of order one), macroscopic components (i.e., size of order , usually referred to as giant components) and mesoscopic ones (everything in between). We summarize all this information in terms of two empirical measures and derive a large-deviation principle (LDP) for them. Our rate function is rather explicit.
Such a principle gives information about the exponential decay rate of all sorts of events, e.g, the emergence of more than one giant cluster or the presence of a non-trivial proportion of vertices in mesoscopic components. Moreover, the minimizers of the rate function represent the most likely configurations of the graph, which is expressed in terms of a law of large numbers for the objects that satisfy the LDP. In this way, we recover the mentioned phase transition and collect detailed information about the statistics of the sizes of all the components, both in the subcritical regime (where no giant component occurs) and the supercritical one.
Many investigations of the Erdős-Rényi graph and other random graphs in the sparse regime rely on approximations of subgraphs with certain Galton–Watson trees and other branching processes. We would like to stress that our approach does not use such arguments and is therefore an alternate ansatz.
In Section 1.1, we introduce our approach, in Section 1.2, we formulate our main results about large deviations and in Section 1.3 their consequences for the phase transition, and in Section 1.4 we give a literature survey.
Our original interest in this study was triggered by a desire to understand random particle processes with coagulation, in particular its simplest variant, the Marcus–Lushnikov model with multiplicative coagulation kernel. We introduce this process and its connections with our work on the LDP for the Erdős-Rényi graph in Section 1.5.
Another highly interesting connection appears with a LDP-proof for the well-known Bose–Einstein condensation phase transition that appears in the free (i.e., non-interacting) Bose gas; we will explain the similarities and the differences in Section 1.6.
1.1. Micro- and macroscopic empirical measures
Let us introduce the main objects that we study in this paper. For the remainder of the paper, we fix and will be working with the graph , where as . By
[TABLE]
we denote the sizes of all the connected components of , ordered in a decreasing way ( is the number of components). We want to describe the entire family in the limit . This is a comprehensive object, which contains several scales. In oder to adequately describe the most important two scales, it will be convenient to work with two empirical measures of the component sizes in the microscopic and macroscopic size ranges:
[TABLE]
Intuitively, while registers the proportion of components of ‘microscopic’ sizes on the scale , registers the components of ‘macroscopic’ sizes, i.e. order . Note that each of the two measures admits a one-to-one map onto the vector for fixed and therefore contains all the information contained in the vector. However, in the limit , they will be able to describe only the statistics of the microscopic, respectively macroscopic, part of the particle configuration. We would like to stress here that this issue lies at the heart of the phase transition of the emergence of a giant component, i.e., a macroscopic size.
Here is a non-technical, intuitive explanation: in the limit as , all sizes that somehow diverge, will vanish from the support of ‘at infinity’, and all sizes that are will vanish from the support of ‘at zero’. Hence, may leak out mass at infinity, and at zero. It is by no means automatic that all the mass that leaks out from the microscopic part at infinity enters the macroscopic part at zero. In order to control that, we also need to take care of the mesoscopic mass, coming from particle masses . Since here a lot of scales are contained (indeed, a continuum of scales), we will not be able to say anything about these sizes, but only about the total proportion of vertices belonging to such components.
The famous phase transition (proved first in [ER60]) says that, for , there is no loss of mass from (i.e., the first moment stays equal to one in the limit), and convergesd to the zero measure (i.e., loses all its mass), while for , the total mass of loses a positive amount equal to that retained by in the limit, and this results in a single Dirac measure. In both cases, the mesoscopic part vanishes, even though mesoscopic components are present in the graph with high probability, but their proportion is negligible.
These are assertions of the type of laws of large numbers. However, in the setting of a large-deviation principle as we are working here, we will obtain significant results also about the probabilities of several very unlikely events, like the emergence of non-trivial mesoscopic total mass, of more than one giant cluster and of different statistics of microscopic component sizes. Note that we decided to work on probabilities that are on an exponential scale and for connection probabilities of the form for general . This excludes for example all (highly interesting) phenomena that occur with respect to cluster sizes of order when considering more specified connection probabilities of the size ; see [Ald97].
Now let us give a more technical explanation of the issue about possible losses of masses, which will also set the frame for the mathematical treatment. We will conceive the discrete measure as a random element of the sequence set , where
[TABLE]
We equip with the topology of coordinate-wise convergence, which makes it compact by the Bolzano–Weierstrass theorem combined with Fatou’s lemma.
The point measure is a random element of the set , where
[TABLE]
and is the set of all measures on with values in . We equip with the topology that is induced by functionals of the form where is continuous and compactly supported. We sometimes write the elements of as with and , where extends over a finite subset of or over . Then convergence is equivalent with the pointwise convergence of each of the atoms. By similar arguments as for , also is compact. We equip the product of and with the product topology, so that it is also compact.
Important quantities are the expectations of the sub-probability distributions respectively , i.e., the maps
[TABLE]
Note that they are not continuous in the respective topologies, but only lower semicontinuous, according to Fatou’s lemma. Indeed, even though the microscopic and macroscopic expectations and are each equal to one for any , they may (and will) lose mass in the limit . We sometimes call and the total masses of the microscopic, respectively macroscopic, configuration and , since they stand for the total number of particles, after scaling.
The mathematical treatment of the mesoscopic part of the component sizes is more technical, as it requires the introduction of two cutting parameters and . Indeed, a size is called -mesoscopic if , and the definition of mesoscopic sizes requires making the limit , followed by and . There are several scales (indeed, a continuum of scales) contained in this part and in this regime it does not seem reasonable to consider an empirical measure for this part; therefore we will consider only the total proportion of mesoscopic vertices.
Let us remark that our choice of considering exclusively the size of each component, disregarding its bond structure, comes from the interest in coagulation processes, where only the sizes matter; see Section 1.5. An extension of our work to empirical measures of the components seems to require only moderate additional work, at least as it concerns the microscopic part. See Section 1.4 for earlier LDP-investigation of the components as subgraphs.
1.2. Our results: large-deviations principles
In this section, we present all our results on the LDP satisfied by the empirical measure of statistics of component sizes of the Erdős–Rényi graph , the random graph on with connection probability , and we assume that with fixed . In Section 1.3 we will draw conclusions about the phase transition from that. Our main result is the following description of the two empirical measures and in terms of a joint large-deviations principle (LDP).
Theorem 1.1** (LDP for the empirical measures).**
As , the pair satisfies a large-deviations principle with speed and rate function
[TABLE]
where we write and
[TABLE]
The proof of this theorem is in Section 3; it is based on an explicit combinatorial formula for the joint distribution of all the component sizes, followed by analysis of the arising exponential rates. We organised the three terms of in the way in which they were derived from the influences of the three parts (micro, macro and meso) in the course of the proof, even though this leads to a cancellation of terms involving and . This implies also that separate conclusions about the microscopic and the macroscopic parts can conveniently be made (see Corollaries 1.2 and 1.3). Informally, in (1.5) the terms involving , and in the logarithm derive from the combinatorial number of possibilities to decompose into the requested configuration of subsets, the term and the in the logarithm stem from the probability that these subsets are connected, and all the other terms from the probability that any of these subsets is not connected with the remainder. This interpretation is not immediate, since a number of asymptotic manipulations have been made during the proof. Similar remarks apply to (1.6). Interestingly, on the right-hand side of (1.5) we see, up to normalization, a relative entropy of with respect to the Borel distribution for a particular choice of ; a fact that will be crucial in the analysis of minimizers of the rate function, see the proofs of Corollaries 1.2 and 1.3 and of Theorem 1.5.
Let us recall the notion of an LDP: Theorem 1.1 says that, for any open set respectively closed set ,
[TABLE]
where we wrote for the probability measure for . For a comprehensive presentation of the theory of large-deviations, see e.g. [DZ10]. It is not difficult to see that since the rate function is lower semicontinuous and is compact, it is even a good rate function, i.e., its level sets are compact for any .
It is well-known in the theory of large deviations (and easy to deduce from the LDP) that for many interesting sets one also has that , for example for sets that are equal to the closure of their open kernel. There are choices of such sets that give the precise exponential rates of interesting events, for instance the event that there are a given number of components larger than , for some , or that a given component size appears with a certain least density, or that a given positive percentage of vertices are contained in components of a given range of sizes (e.g., in or in or in ), and certainly all kinds of combinations of such events.
From our main result, the LDP in Theorem 1.1, a number of other LDPs follow via the contraction principle, according which if a random variable satisfies an LDP, so does its image under a continuous transformation; see [DZ10]. Let us begin with the component size distribution of the microscopic part.
Corollary 1.2** (LDP for microscopic component size statistics).**
As , satisfies an LDP with speed and rate function , given by
[TABLE]
The first equality comes from the application of the contraction principle; while the second equality is purely analytical and it is checked in Lemma 4.1. There it is seen that, given any , it is always optimal to have all the remaining mass in one single macroscopic component.
In the same way one can investigate the macroscopic part of the system.
Corollary 1.3** (LDP for macroscopic particles).**
As , satisfies an LDP with speed and rate function , given by
[TABLE]
where (recall ).
Again, only the second equality has to be checked; this is done in Lemma 4.2. In contrast with the result above, here the optimal configuration depends on , most heavily it depends on whether or not. Indeed, if , then (and no mesoscopic part arises). However, if , then , and a non-trivial mesoscopic mass arises in the minimization; see Theorem 1.5. This peculiarity shows already a key difference between the cases and . Indeed, if one cannot have any macroscopic mass distribution such that and no difference in the minimizing strategy of the system can be seen. This is a first way to see the phase transition at from analytic properties of the rate function.
Now we come to the mesoscopic part of the particle configuration. This part comprises particle sizes on all the scales between finite and and it seems unreasonable to consider an empirical measure for it. Instead, we consider only the total mass of this mesoscopic part. Let and be two auxiliary parameters, then we define the -mesoscopic total mass as
[TABLE]
This is the number of vertices that are contained in components with a size between and . The mesoscopic total mass in a strict sense arises after taking the limits , followed by and , but this does not define a proper random variable. However, it is possible to formulate an LDP in the limit and then to study the rate function, , as and . Additionally, the proof of Theorem 1.1 shows that it is possible to define a coupled mesoscopic total mass , for any diverging sequence and vanishing sequence . This is a well-defined random variable, it satisfies an LDP with speed and the rate function is the limit of when and .
Corollary 1.4** (LDP for mesoscopic mass).**
- (1)
For any and , as , satisfies an LDP with speed and rate function , where
[TABLE] 2. (2)
For any and such that , and , the coupled mesoscopic total mass satisfies an LDP with speed and rate function
[TABLE]
The function is strictly increasing in , its minimum over is .
Corollary 1.4 part (1) is a simple consequence of the contraction principle, as the maps and are continuous. Assertion (2) follows as a byproduct of our proof of Theorem 1.1 in Section 3.
Hence, can rightfully be called the rate function for the mesoscopic total mass. Since it is positive everywhere outside [math], we have the immediate consequence that the probability that any positive percentage of the vertices lies in mesoscopic components decays exponentially towards zero. This implies the convergence in probability of towards zero with exponential decay of the probability of a decay by any positive amount. Interestingly, taking , we see that already just one mesoscopic size alone satisfies the same LDP as the entire -mesoscopic total mass in the limit , . The condition is not only a technical one, but implies that , taking care of the well-known fact that there are many clusters of size in the sparse Erdős-Rényi random graph that stem from an extreme-value statistics effect of the microscopic clusters.
1.3. Our results: the phase transition in the light of the LDP
We now proceed with the study of the main phenomenon in the sparse Erdős-Rényi random graph: the phase transition of the emergence of a giant component. We will deduce it from our large-deviations rate functions from Section 1.2. The LDPs and the identification of their strict minimiser(s) lead to laws of large numbers for a number of random quantities. Indeed, it is a standard and simple fact from large-deviations theory that a random variable that satisfies an LDP with a rate function that contains precisely one minimizer converges in probability to that minimizer. We will exploit this fact to deduce laws of large numbers. As before, the parameter will play the decisive role; recall that the connection probability of the graph was picked as as .
Consider the following functions of the total masses of the microscopic and macroscopic particles respectively:
[TABLE]
where . These two functions are not entirely analogous to as rate functions for the total masses of the micro and the macro part, because the total masses both of and are equal to one. This is consistent with the fact that the contraction principle cannot be applied to total masses, as they are not continuous functions of the measures. However, they contain rather interesting information about the phase transition.
Theorem 1.5** (Microscopic total mass phase transition).**
- (1)
For any ,
[TABLE]
Moreover, . 2. (2)
For , the minimum of is attained precisely at given by
[TABLE]
and the minimum of the function is attained precisely at with value . Therefore the infimum
[TABLE]
is attained at , where . 3. (3)
For , the minimum of the function is attained at where is the smallest positive solution to
[TABLE]
The infimum in (1.14) is attained precisely at .
The proof is found in Section 4.2.
The two different cases in (1.12) refer to the cases that the first minimum in (1.11) is attained or not. Indeed, for , the function is minimized in an optimal with . However, for this is not possible, but only minimizing sequences can be found that achieve a total mass of in the microscopic measure and displace the remaining mass to the mesoscopic part. This shows that the phase transition originates from the impossibility of picking an optimal microscopic configuration if its total mass is required to be too large; the threshold being . If this is exceeded, then a minimization can be done only with the help of some non-trivial mesoscopic part. As we mentioned above, the macroscopic configuration is always minimized in one single giant component.
The same effect is seen in , where first an optimization over with is performed, and such a balance between microscopic and mesoscopic mass can pop out if is large enough. Subsequently, optimizing over is straightforward. The equality follows from this.
In Theorem 1.5(2) and (3) we see the different behaviour for subcritical, respectively supercritical in terms of the microscopic configuration. Note that this configuration is actually given by
[TABLE]
where is the Borel distribution with parameter . We see that such an optimal cannot be found if , and this is an admissible total mass only when , marking the threshold between subcritical and supercritical regime (otherwise, there is no relevant case distinction as to the value of ). In this way, the Borel distribution appears as the natural minimizer of the microscopic part of the rate function.
In earlier work (see [Pit90]), the appearance of the Borel distribution in this context came from the observation that is the distribution of the total progeny of a Galton–Watson tree with offspring that is Poisson-distributed with parameter . The characterisation of the emerging cluster-size distribution was based on an approximation of the connected subgraphs by such trees and counting the total number of trees of a given size in the graph. This approximation argument was extended to a large-deviation setting in [BC15], see Section 1.4.
Theorem 1.5 characterises the well-known phase transition of the ermergence of a giant component at in terms of a natural notion that is familiar to statistical physics: as a non-analyticity of the limiting free energy for the total mass of the microscopic configuration, which is equal to the infimum of . Indeed, this function is zero in , but positive in .
Another characterisation of this phase transition is in terms of a law of large numbers. Indeed, combining Theorem 1.5 with the LDP in Theorem 1.1 one has
[TABLE]
In words, this means that, for any , times the number of components of size converges to in the sub-critical regime and to in the supercritical regime, while there is no macroscopic component in the first regime and there is precisely one macroscopic cluster of cardinality in the second. All these statements are in the sense of convergence in probability, and the probability of a deviation by any positive amount decays even exponentially in .
One also sees that the cut-off versions of the total masses, and , converge towards the respective cut-off versions of the limits, and their limits as and are for and for .
1.4. Related works on LDPs for Erdős-Rényi graphs
Despite the extensive literature on the Erdős-Rényi graph, there are not many results about large deviations in the sparse regime. Here we summarize, to the best of our knowledge, the existing results and how they relate to our work.
Two LDPs for the size of the largest component and for the number of isolated vertices have been derived in [O’C98]. These are two quantities that are obviously functionals of our measures , respectively of . Indeed, the largest component is the mass of the largest atom of , and the number of isolated vertices is equal to times . Both these two functionals are continuous, such that the contraction principle applies. The approach of [O’C98] is a simplified version of our comprehensive approach for the joint distribution of all the component sizes, and consequently it leads to formulas for the rate functions that are contractions of our rate function, which is straightforward to see. This explains also the remark made about the lack of convexity of the rate function in [O’C98]. Indeed, contraction often ruins convexity and contracted rate functions are rarely convex. Hence, our work includes the results of [O’C98].
A route that is inspired by statistical physics is taken in [EMH04], where the distribution of the random graph is tilted with a parameter raised to the power of the number of components, properly normalized. The analysis of the free energy of the corresponding partition sum is carried out there. Via the well-known Laplace dualism, the results is essentially equivalent to an LDP for the number of components. This functional is equal to the continuous functional in our setting. The pecularity of [EMH04] is that this model is put into relation with the -state Potts model in the limit via diagrammatic expansion techniques. In particular, they derive limiting formulas for the size of the giant component, the degree distributions inside and outside the giant component, and the distribution of small component sizes.
We already mentioned that registering each component as a subgraph (rather than only as its size) would give a priori a much more detailed description, at least for any fixed . However, in the limit as , in the LDP regime that we consider, only few subgraph configurations survive: the microscopic components survive only as spanning trees, and only those macroscopic components survive that have an excess of edges of order . The first has been carried out in [BC15], the second in [Puh05].
Indeed, the macroscopic part of our LDP is covered in [Puh05]. The author gives an LDP for the joint distribution of the total number of components, the sequence of the sizes of macroscopic ones, and the sequence of corresponding numbers of the excess edges appended with zeros. Therefore the contraction of this LDP to the total number of components and the macroscopic sizes (see [Puh05, Corollary 2.1]) is equal to the contraction of our LDP from Theorem 1.1 to the LDP for . The same is of course true, when considering exclusively macroscopic sizes (compare [Puh05, Corollary 2.2] with Corollary 1.3). The approach in [Puh05] goes along a very different route, involving recursive formulas for the graphs if increases, and consequently the form of the rate function derived there is pretty different from ours; it involves an additional minimization procedure. It would require some work to analytically check that it is identical to ours.
In [BC15], an LDP for the empirical measure of all the components rooted at the vertices, is derived with an explicit rate function. The topology used there comes from a distance that looks only at intersections of graphs with bounded sets, so it can detect only microscopic components. In this way it is contained in our results. However, [BC15] considers the components as graphs, not only as sizes, and gets therefore a much more detailed picture. Moreover, the object described in [BC15] is a size-biased version of our microscopic measure, since we are counting components of a certain size, while they consider the component containing each vertex and therefore counting a certain component proportionally to the number of vertices it contains. Hence, the LDP of [BC15] contains the microscopic part of our LDP (Corollary 1.2) via the contraction principle, but a certain normalization has to be performed to actually compare the two objects. However, let us notice that [BC15, Theorem 1.8] shows that the rate function takes the form of a sort of relative entropy with respect to a Galton Watson tree with Poisson offspring distribution (plus additional constants). This form is shown also in our contracted rate function from Corollary 1.2, which in (4.2) we rewrite in terms of a relative entropy with respect to a distribution related to the Borel distribution (which is the distribution of the total progeny of precisely a Galton Watson tree with Poisson offspring). Also in this case, one sees that the Borel distribution appears as a size-biased version of our reference distribution.
Under assumptions that imply that the connection probability of the Erdős-Rényi graph satisfies , recent progress has been made on the upper tails of sub-graph counts [CD16, Aug18, CD18].
In the case of dense graphs, that is, for with fixed , there is a complete treatment thanks to Chatterjee and Varadhan [CV11], see [Cha16] for an overview. This regime is rather different from the sparse regime, since a proper formulation of the relevant limiting objects requires a abstract setting evolving around the notion of a graphon.
1.5. Application to coagulation models
Our interest in this research came from the desire to understand dynamical particle systems with coagulation in the large-system limit. It turned out that one of the most prominent (and most simple) models, the Marcus–Lushnikov model of coagulation, see [Mar68, Gil72, Lus78], admits a one-to-one correspondence to the component sizes of the Erdős-Rényi random graph that we study in this paper. This coagulation process is a continuous-time Markov process of vectors of particle masses at time , arranged in descending order, precisely as in (1.1), where is the number of particles at time . This process is specified by the initial configuration, which we take in the monodisperse case, i.e., for all , and by the transition mechanism, which is given in terms of a symmetric, non-negative coagulation kernel . That is, we start with particles of unit mass at time 0, and in the course of the process, each (unordered) pair of particles with respective masses coagulate to a particle of mass with rate , independently of all the other pairs of particles.
The important special case of the multiplicative kernel, has the two interesting features: (1) it can be mapped onto the Erdős-Rényi random graph that we study in this paper, and (2) it exhibits an interesting gelation phase transition in the limit at time , because a gel, i.e., a particle of macroscopic size, appears. Indeed, for this process, it turned out in the review [Ald99] that, for fixed , the distribution of the family of the component sizes of defined in (1.1) with is identical to the family of particle masses in the multiplicative coalescent at time . This correspondence was not mentioned in [BP90], but was discussed one year later in [BP91], which highlights the connection between gelation in the coagulation process and the phase transition given by the formation of a giant connected component in the Erdős-Rényi random graph [ER60].
Hence, the results of this paper also recover the gelation phenomenon in rather explicit terms through a LDP in terms of the microscopic, mesoscopic and macroscopic parts, in the same way as we explained in the above sections for the Erdős-Rényi random graph.
Smoluchowski introduced a (deterministic) ODE model for the concentrations of coagulating particles in the course of his work on Brownian motion [vS16]. Indeed, it is reasonable to assume that exists under suitable conditions, see [Nor99, LM04, MN14]. These limits satisfy
[TABLE]
where is the limiting coagulation kernel (in our case, ). This is the famous Smoluchowski equation. Intuitively, the positive terms on the right-hand side of (1.16) take into account that the fraction of particles of mass increases if a particle of mass and one of mass (with ) merge and this happens with rate . On the other hand, the negative term describes that a particle of mass can coagulate with particles of any size with rate and this is why it involves an infinite sum (over all ). One can check for that appearing in Theorem 1.5 is the exact solution of (1.16), the Smoluchowski equation, also given in [Ald99, Table 2]. As a consequence, the above mentioned gelation phase transition as well as the solution of the Smoluchowski ODE are also clear from our results in Sections 1.2 and 1.3 and receive therefore a new interpretation in terms of combinatorial structures.
In the light of the process character of the Marcus–Lushnikov coagulation model, it will be desirable to derive a pathwise version of the LDP of Theorem 1.1. This will require a version of that theorem which starts from an arbitrary configuration rather than from . This may also be interesting for the time-dependent version of the Erdős-Rényi graph , but not as natural as for the Marcus–Lushnikov model. Another aspect that makes it particularly interesting for coagulation models is the availability of alternative methods in the spirit of Wentzell–Freidlin theory to derive pathwise LDPs for coagulation models, see [MPPR17]. Let us also mention that, in the renowned paper [Ald97], time is expanded around the critical value , and the mesoscopic components of the graph are compared to a stochastic process known as the multiplicative coalescent. Although we allow for fluctuations around (our LDP holds for any sequence ), we cannot capture this regime around , since we expect a LDP for mesoscopic particles to hold on a different scale. Another natural direction of our future research is an extension to the case of an inhomogeneous Erdős-Rényi graph as introduced in [BJR07]. We will defer future work to these questions.
1.6. Comparison to Bose-Einstein condensation without interaction
Our large-deviations approach to the Marcus–Lushnikov models shows remarkable similarities to another well-known phase transition in a non-spatial model, the non-interacting Bose gas. Here the situation is similar in that the gas can be conceived as a joint distribution of particles that are randomly grouped into smaller units, called cycles, which can become arbitrarily large. The natural question is then, under what circumstances do macroscopic cycles arise. An explicit answer in terms of a large-deviations analysis has been given in [Ada08], where the transition, the famous Bose–Einstein Condensation (BEC) in dimensions , is derived from the minimization of the rate function, in a way analogous to that in our Theorem 1.5. The two phase transitions differ in that the BEC transition is of saturation type, while the gelation transition is not.
For the non-interacting Bose gas in the thermodynamic limit at temperature with particle density the partition function is given by
[TABLE]
where is the centred box in with volume . The free energy per particle is then
[TABLE]
For the Erdős-Rény graph , the equivalent quantity is the rate function from (1.7). The key difference between the rate functions is that only contains terms in the total mass of microscopic components, . This reflects the fact that the giant component makes a significant contribution to the rate function in the graph model, but the condensate in the non-interacting Bose gas does not.
The respective minimisers of and are
[TABLE]
where and control the values of .
The crucial parameters are for the graph model and the inverse temperature for the Bose gas. Both models have a trivial upper bound for the total microscopic mass, , namely one. One additional upper bound arises in each model from the optimisation of the rate function with respect to the , but these are not relevant, until respectively rises to its critical value. For the graph model this bound is , because for all , and the summands take their maxima at , when they correspond to the Borel probability distribution with parameter 1. For this bound is . At this point we see a difference between the two models, because the total microscopic mass in the Bose gas remains on this bound as rises further, while for the graph model it immediately drops strictly below the bound. This explains why BEC is known as a saturation phase transition, but this description cannot be applied to gelation.
2. Preparations for the proof of the LDP
We consider the Erdős-Rényi graph under the corresponding probability measure . In Section 2.1, we derive an explicit formula for the distribution of the empirical measure of the component sizes in terms of connectivity probabilities for (smaller) Erdős-Rényi random graphs. Furthermore, we prepare in Section 2.2 for the asymptotic analysis f or with by recalling from [Ste70] some estimates and asymptotics for this connectivity probability.
2.1. The joint distribution of the component sizes
An important quantity is
[TABLE]
We will be concerned with this quantity for fixed , but with connection probability , in the limit .
We define the state space of the collection of component sizes as
[TABLE]
To each element of the state space , we associate a unique element of the space
[TABLE]
where for each , is the number of indices such that . The map is a bijection and in the following we refer to configurations equally in terms of or .
We denote by the set of all partitions of . We write for the number of sets in with cardinality . Then we can describe the joint distribution of all the component sizes of as follows.
Lemma 2.1**.**
For any , and every ,
[TABLE]
Proof.
A set of vertices is a connected component in the graph if and only if (1) no bond between any vertex in and any vertex outside has been put, and (2) the subgraph formed out of the vertices in and all the bonds between any two vertices in is connected. This has probability . Applying this reasoning to and describing the next component, and iterating this argument, shows that the product of the two products on the right-hand side of (2.4) is equal to the probability, for a given partition with sets of size for any , that the components of are precisely the sets of . Since this probability depends only on the cardinalities, the counting term completes the formula. ∎
Now we rewrite the right-hand side of (2.4) in terms of the empirical measure of , i.e., of the numbers of indices such that . Introduce the event
[TABLE]
Corollary 2.2**.**
For any , and any satisfying ,
[TABLE]
Proof.
Note that the last product on the right-hand side of (2.4) can also be written as . Hence, if is equal to the number of such that for any , then the product of the last two products can be written as
[TABLE]
The counting term is easily identified as
[TABLE]
Substituting ends the proof. ∎
(To avoid confusion, we note that there is a typographical error in Section 4.5 of [Ald99], where the factor of is missing in the exponent of (2.6).)
2.2. The probability of being connected
Our analysis of (2.6) will depend crucially on an analysis of . The next two lemmas collect results from [Ste70, Lemma 1&2, Theorem 1].
Lemma 2.3** (Bounds and asymptotics for , [Ste70]).**
For any , and any ,
[TABLE]
In particular, if with and ,
[TABLE]
The expression for the upper bound in (2.7) appears to be present (using somewhat applied chemical language) in [Flo41, equation (5)].
The following is an alternative upper bound for , which will be required for macroscopic components, together with an asymptotic result for the connection probability in the so-called sparse case, where the bond probability is proportional to the inverse of the size of the graph.
Lemma 2.4** ([Ste70]).**
For all and
[TABLE]
with . Moreover, for and and a sequence , as ,
[TABLE]
3. Proof of the LDP
In this section we prove the main result of this paper, the large-deviations principle in Theorem 1.1. Again, we fix the parameter and a sequence and consider the Erdős-Rényi graph with probability measure .
Recall the topological remarks on the two state spaces and from Section 1.1. The metrics on and on , defined by
[TABLE]
induce the respective topologies of pointwise and vague convergence. We write , respectively , for the closed -ball around , respectively for the closed -ball around . Since the rate function is lower semicontinuous in and the space is compact, we know that it is a good rate function (i.e., its level sets are not only closed but also compact). Therefore, a weak LDP implies our main result, the LDP in Theorem 1.1, and it will be sufficient to prove the following.
Proposition 3.1**.**
For any ,
[TABLE]
We split the proof of Proposition 3.1 in several lemmas and finish it at the end of Section 3. We start by bounding the cardinality of .
Lemma 3.2**.**
Let be as defined in (2.3), then
[TABLE]
Proof.
The following is an argument in [Ada08]. For any , the set has no more than elements, since
[TABLE]
Hence,
[TABLE]
∎
Thanks to Lemma 3.2, it will be sufficient to get estimates on , for any close enough to a fixed . The strategy is to divide the terms in the product representation from Corollary 2.2 into three groups, which we refer to as micro-, meso- and macroscopic, because they take into account the contribution of, respectively, micro-, meso- and macro- components. We fix two increasing sequences and in such that , and . We write
[TABLE]
where
[TABLE]
and
[TABLE]
Let us set
[TABLE]
Note that the sum of these three terms is equal to one. For the factor , we use Stirling’s formula so that uniformly in
[TABLE]
We will consider a cut-off version of the distances and introduced in (3.1), as follows:
[TABLE]
such that for a given we can measure simultaneously the distance of its microscopic part from and its macroscopic part from . We also introduce some new notation, for any we will use the notation to denote the sequence , which clearly denotes an element of . On the other hand, with we denote the point measure on with weight at the point for any and zero everywhere else. This integer valued measure clearly belongs to .
We start by looking at the term , i.e., the microscopic term and we combine it with the first term in (3.5).
Lemma 3.3**.**
Fix . Fix and pick sequences and such that for all . Then, for any , as ,
[TABLE]
where and .
Proof.
For any fixed , we use the upper bound in (2.7), the fact that and Stirling’s lower bound for (notice that for small we expect to be large, ) to obtain
[TABLE]
We obtain, uniformly for , using that ,
[TABLE]
where
[TABLE]
is the cut-off version of the rate function defined in (1.5). Recall that and that and observe that .
To prove (3.6), we notice that is continuous, it is clear that vanishes as and can therefore be estimated against such a . Moreover, we estimate (substituting by ), for any such that , with the help of the Stirling bound and Jensen’s inequality for , as follows:
[TABLE]
for some , where we used that the remainder sum is of order as and that since and that the map is decreasing in , introducing some that vanishes as . This proves the claim (3.6). ∎
Notice that the last term on the right-hand side of (3.6) cannot be further estimated with the help of continuity (since is not continuous), but will be jointly handled together with the correspondent macroscopic and mesoscopic terms. Next we focus on the term , the macroscopic term and we proceed analogously.
Lemma 3.4**.**
Fix . Fix and pick sequences and such that for all . Further assume that . Then, for any , as ,
[TABLE]
for some and that satisfy and .
Proof.
We use the upper bound in Lemma 2.4 and Stirling’s lower bound for (in this case we know that is large and we expect small). We obtain, for ,
[TABLE]
where . We pair with the second term in (3.5), and we obtain, uniformly for ,
[TABLE]
where
[TABLE]
with
[TABLE]
denotes the cut-off version of the rate function defined in (1.5). Indeed, for proving the last line of (3.10) we do the following. In the product, we add the factor and its reciprocal, substitute and turn the sum on into an integral over . Then most of the terms are easily asymptotically identified with the corresponding terms in (3.11), with possible exception of the term
[TABLE]
of which we now show that it is not larger than . Now write in terms of defined in (2.2), such that , and we pick minimal such that . Then, also using the inequalities and , we see that
[TABLE]
Recall the definition of to see that . Use Jensen’s inequality and to see that the entire last term is not larger than . To handle the last missing term in (3.12), notice (because of ) that and and therefore
[TABLE]
where we recall that we assumed that . Hence, the term in (3.12) is not larger than .
To prove (3.9), we first observe that is continuous and hence can be estimated against such a , uniformly in and such that . Furthermore, for any and any such that ,
[TABLE]
since for all . Hence, we arrived at the bound in (3.9). ∎
Notice that again we refrain from estimating the term , which needs to be coupled with the microscopic and the mesoscopic part. Then we are left to handle the middle term in (3.3).
Lemma 3.5**.**
Fix such that . Fix and pick sequences and and such that and for all . Further assume that . Then, as ,
[TABLE]
Proof.
We use again the upper bound in (2.7) and Stirling’s formula, to see that
[TABLE]
We claim that the right-hand side is equal to for some that vanishes, uniformly in , as . First note that the next-to-last term is such a term, since . Furthermore, , which shows that the terms containing and in the first product are also so small. With the same approach as in (3.8), we see the lower bound
[TABLE]
Therefore, uniformly in such that , we have arrived at the estimate (3.13). ∎
Now we collect the upper bounds above and substitute them in (3.3), to obtain the following lemma.
Lemma 3.6**.**
Fix such that . Fix and pick sequences and and such that and for all . Further assume that . Then, for any and ,
[TABLE]
where vanishes as and , followed by and .
Proof.
We collect the upper bound (3.6) from Lemma 3.3, (3.9) from Lemma 3.4 and (3.13) from Lemma 3.5. We substitute them in (3.3), also using (3.5), then we obtain, uniformly in such that and , for any and any , as ,
[TABLE]
where vanishes as and , followed by and , and we recall that . This implies the upper bound in (3.2) in the case where .
∎
In the following lemma, we implicitly use the lower semicontinuity of the maps and to show that when , then the event is empty for any such that and , if and are small enough. This will give the right super-exponential upper bound for , since .
Lemma 3.7**.**
Let such that , then there exists , and large enough such that for all :
[TABLE]
Proof.
We pick so large and so small that are larger than one, say equal to for some . Then choose and in so small that, for any such that and , we have and . Therefore we see that
[TABLE]
which yields a contradiction. ∎
The remaining of this section deals with the construction of an optimal sequence , which will give a lower bound on the probability that matches the upper bound from Lemma 3.6. For large enough define by
[TABLE]
where is an arbitrary diverging sequence in such that .
We notice that our sequence is such that the so-called mesoscopic mass is simply concentrated in components all of the same size (namely ) and surprisingly no specific requirement is imposed on , except that it diverges. We will underline this in the steps of our proof. It is clear by construction that the following hold
[TABLE]
We now give lower bounds to , starting from the formulation in (3.3) and (3.5). By abuse of notation, we will drop the index from .
Lemma 3.8**.**
Fix such that and let be defined by (3.14). Then, as ,
[TABLE]
where is defined in (3.7).
Proof.
We use the lower bound in (2.7) from Lemma 2.3 to perform a calculation similar to that for the upper bound (using now Stirling’s upper bound on ):
[TABLE]
which proves the claim (3.18). ∎
Notice that no restriction is required on in order for the above lower bound to coincide with the upper bound in the proof of Lemma 3.6 (except that diverges). Indeed, although the lower bound in (2.7) differs from the upper bound for a factor the probability is paired with (the probability of a component to be separated from any other), which is an exact term and it balances the error coming from (2.7). In a similar way, one checks the following lower bound for the term involving , for .
Lemma 3.9**.**
Fix such that and let be defined by (3.14). Then, as ,
[TABLE]
Proof.
[TABLE]
The above lower bound relies on the lower bound (2.7) in Lemma 2.3, on Stirling’s upper bound for and (which are both large) and on how we defined in (3.14). ∎
We notice that the term coming from Lemma 3.9, in order to give the desired lower bound matching the upper bound from Lemma 3.6, does not add any condition on the sequence , which is just supposed to diverge.
Lemma 3.10**.**
Fix such that and let be defined by (3.14). Then, for all , as ,
[TABLE]
where is defined in (3.11), and depends on and it goes to zero when .
Proof.
Now, fix . We use the lower bound (2.7) from Lemma 2.3 and Stirling’s upper bound on and we write:
[TABLE]
where the remainder is defined as
[TABLE]
By defining \ell_{k}:=\alpha\big{(}\frac{k-1}{N},\frac{k}{N}\big{]} and using that , we see that
[TABLE]
[TABLE]
Finally, we use that and Jensen’s inequality (since is concave) to give the bound
[TABLE]
Exploiting the bounds above, we can say that
[TABLE]
where depends on and it goes to zero when . Notice that we could not handle directly the terms from to in one go, so we decided to fix a fictitious threshold , where can be chosen arbitrarily close to zero. This however does not affect the choice of and there is no need to add some condition on to handle such small, but macroscopic, components.
The remaining terms (the purely macroscopic ones) in (3.3) are treated as follows. We use Stirling’s upper bound for and we see that
[TABLE]
Let us focus on the integral \int_{(\delta,1]}x\Big{[}\log\Big{(}\frac{\mu_{\lfloor Nx\rfloor}(\frac{1}{N}t_{N})^{\frac{1}{\lfloor Nx\rfloor}}}{x}\Big{)}-\frac{t}{2}(1-x)\Big{]}\alpha({\rm d}x). Now by (2.8) in Lemma 2.4, we know that the integrand converges pointwise to
[TABLE]
Since, for large enough,
[TABLE]
which is clearly integrable over with respect to , we can apply the dominated convergence theorem and we get
[TABLE]
where is defined in (3.11). Now, (3.21) together with (3.22) imply (3.20). ∎
Finally, we combine below the lower bounds in Lemma 3.8, 3.9 and 3.10.
Lemma 3.11**.**
For all such that , there exists a sequence such that , (3.15) and (3.16) hold, and
[TABLE]
Proof.
From the lower bounds in (3.18), (3.19) and (3.20), we get that there exists with and
[TABLE]
so (3.23) follows on taking the limit .
∎
Now we are ready to prove Proposition 3.1 by combining the above lemmas.
Proof of Proposition 3.1.
Fix and and recall the definition of in (2.5), then we see that
[TABLE]
Because of Lemma 3.2, we only have to give asymptotic estimates on the single summands on the right-hand side of (3.25). Let such that . We fix any diverging sequence and vanishing sequence with and , the we see that
[TABLE]
as a consequence of Lemma 3.6. Taking first and to zero and then and , we get the desired upper bound.
Given any fixed and , we can construct the sequence from Lemma 3.11 and see that, for such a sequence
[TABLE]
for an arbitrarily small.
Finally, if are such that , Lemma 3.6 gives us that
[TABLE]
∎
4. Corollaries and study of the rate functions
In this section we analyse, for fixed , the minima of the rate function, , over the configurations respectively , and afterwards the minimima of the rate functions for the total masses, , and . In particular, we will see in Lemma 4.2 that there is a drastic difference when minimizing over if or not. Clearly, no such difference can be spotted when , since we do not allow for . However, when we see that is perfectly admissible and we interpret this as an analytic sign of the phase transition in .
4.1. Rate functions for the microscopic part
We start by minimizing , for a fixed over all compatible . We will obtain the rate function for the microscopic part, and we will see that this minimum is attained for of the form . Informally speaking, the following in particular implies that, with probability tending to one, there is at most one macroscopic particle.
Lemma 4.1** (Analysis of the microscopic rate function).**
Fix and recall that , then
[TABLE]
where , and are defined in the statement of Theorem 1.1.
Proof.
Clearly
[TABLE]
Fix and . Note that since is a point measure with . We have, denoting ,
[TABLE]
since is strictly decreasing in . Indeed,
[TABLE]
We want to prove that for . For , this is obvious from above, and for , this is easily seen as follows.
[TABLE]
since for all . Hence, we see that for , and (4.1) follows.
Furthermore, when we study , we see that its derivative is
[TABLE]
which is strictly negative if . Since if , we see that is strictly decreasing in , and hence the optimal value of is . ∎
Now the proof of Corollary 1.2 directly follows from Theorem 1.1, Lemma 4.1 and the contraction principle since the projection is continuous in the product topology. Let us mention that, since , we can rewrite
[TABLE]
with
[TABLE]
with for all . Hence, the term is the relative entropy of two non-normalized measures and . Notice that the reference measure is such that
[TABLE]
where
[TABLE]
are the probabilities of the Borel distribution with parameter . The total mass of is therefore given by
[TABLE]
where is Borel distributed with parameter . Now this expectation [AP98, §4.5] is precisely , which explains why we added and subtracted the term to in order to obtain the formulation (4.2). As already mentioned in Section 1.4, the above entropy form for the rate function (4.2) strictly relates to the rate function obtained in [BC15, Theorem 1.8] which also takes the form of an entropy with respect to a standard Galton Watson tree, whose total progeny is precisely Borel distributed.
Let us analyse the minimising statistics of the macroscopic part.
Lemma 4.2** (Analysis of the macroscopic rate function).**
Fix and recall that , then
[TABLE]
where .
*Furthermore, the unique minimizer is equal to , defined in (1.13). *
Proof.
As in the proof of Lemma 4.1, we see that
[TABLE]
with defined in (4.3). Fix . Since is strictly convex on the convex set , we see by evaluating the variational equations that the only candidate for a minimiser in the interior is
[TABLE]
with such that . Interestingly, we can identify , where is determined by and is defined in (4.4). Note that is not summable for . Hence, must be picked such that . The largest value that can be realised in this way is by picking . Hence, the preceding is possible at most for . By continuity and strict monotonicity of in , indeed, any can be uniquely realized, by picking such that . In this case, it is clear that the minimizer of in the interior of is equal to
[TABLE]
as claimed in (1.13), with value
[TABLE]
where we used that if , then , see [AP98, §4.5]. Now we give an argument why realises the minimum of over . We show that any such minimiser must be positive in every component. Indeed, if for some , then we consider , defined by
[TABLE]
with such that and such that for any sufficiently small . Now a simple insertion shows that , if is small enough, since the slope of at zero is . Hence, cannot be a minimizer. On the other hand, has the property that all directional derivatives of in all admissible directions with compact support are zero; hence it is the minimizer of over for .
When , it is possible to pick a sequence of such that (pick as for some suitable ). Furthermore, since is a relative entropy, we know that Hence, the infimum of over for is equal to [math]. This shows that the infimum over in the last line of (4.6) is equal to , and (4.5) follows. ∎
Then, the proof of Corollary 1.3 directly follows from Theorem 1.1, Lemma 4.2 and the contraction principle, since the projection is continuous.
Finally, let us draw some conclusions regarding the mesoscopic mass. As stated after Corollary 1.4, it is not possible to apply the contraction principle, if we want to derive an LDP for the sequence of random variables , however we can still identify the rate function by minimizing over all pairs such that . Even if the contraction principle cannot be applied directly, the following lemma proves that the rate function has exactly the expected form, given by (4.9).
Lemma 4.3**.**
Fix . Then, for any and any and such that ,
[TABLE]
Proof.
We first verify that, for a fixed ,
[TABLE]
Fix , then for a fixed
[TABLE]
since the infimum is attained in , as proved in Lemma 4.1. Then, with the same procedure of Lemma 4.2, we see that the infimum over is attained in
[TABLE]
giving
[TABLE]
Minimizing then for , we see that the infimum is attained in , the smallest solution to
[TABLE]
which is , for all and otherwise. By substituting the optimal in (4.10), we see that (4.9) holds.
Now, notice that the procedure to get the upper bound in the proof of Proposition 3.1 implies in a straightforward way that
[TABLE]
In the same way, from the proof of Proposition 3.1, we borrow the strategy of constructing a “recovery sequence”, this time using and to construct as in (3.14). This gives
[TABLE]
∎
The proof of the second point in Corollary 1.4 follows as a direct consequence of Lemma 4.3.
4.2. Proof of Theorem 1.5
Item (1) follows by Lemma 4.1 and 4.2. Following the approach of those proofs, one can easily see that the order of minimization is not important, in particular:
[TABLE]
The minimizer, given a certain microscopic mass , is seen to take the form
[TABLE]
where is defined in (1.13) and is a single macroscopic component. Imposing a certain microscopic (respectively macroscopic) mass influences the optimal configuration. Indeed, although it is optimal for the system to avoid mesoscopic mass (as seen in Corollary 1.4(2)), the impossibility of minimizing the microscopic configuration under a certain constraint on the mass, namely , forces the system to actually have a mesoscopic mass of size . The same happens when we impose a macroscopic mass which is too small, namely . The form of the function in (1.12) comes directly from such minimization procedures.
Let us now prove assertion (2). The form of the minimizing follows from Lemma 4.2. Fix . Then is strictly decreasing in . Indeed
[TABLE]
where we introduced the function , which is decreasing in . Hence, monotonicity of in follows from
[TABLE]
The first inequality follows by observing that the function is nonnegative for all , since , , and is strictly decreasing in , since . The second inequality follows from the fact that for all (substitute ), since , and is strictly increasing in . Therefore, is minimized in , which implies the conclusion.
Now we turn to assertion (3). For , the derivative of writes as follows
[TABLE]
It is clear that is strictly increasing in , while for , we need to go back to (4.11). The right inequality there is still true for any . Since the quotient in (4.11) is strictly increasing in and since is strictly convex in , the unique zero of is given by the unique solution of
[TABLE]
which is precisely the solution of (1.15). The remaining assertions follow.
Acknowledgements
The authors acknowledge three anonymous referees for their careful reviews and many suggestions for improving the exposition. This research has been funded by the Deutsche Forschungsgemeinschaft (DFG) through grant CRC 1114 “Scaling Cascades in Complex Systems”, Project C08.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[Ada 08] S. Adams. Large deviations for empirical cycle counts of integer partitions and their relation to systems of Bosons. In Analysis and stochastics of growth processes and interface models , pages 148–172. Oxford Univ. Press, Oxford, 2008.
- 2[Ald 97] D. J. Aldous. Brownian excursions, critical random graphs and the multiplicative coalescent. Ann. Probab. , 25(2):812–854, 1997.
- 3[Ald 99] D. J. Aldous. Deterministic and stochastic models for coalescence (aggregation and coagulation): a review of the mean-field theory for probabilists. Bernoulli , 5(1):3–48, 02 1999.
- 4[AP 98] D. J. Aldous and J. Pitman. Tree-valued Markov chains derived from Galton-Watson processes. Annales de l’Institut Henri Poincare (B) Probability and Statistics , 34(5):637 – 686, 1998.
- 5[Aug 18] F. Augeri. Nonlinear large deviation bounds with applications to traces of Wigner matrices and cycles counts in Erdős-Rényi graphs. arxiv preprint 1810.01558 , 2018.
- 6[BC 15] C. Bordenave and P. Caputo. Large deviations of empirical neighborhood distribution in sparse random graphs. Probability Theory and Related Fields , 163(1-2):149–222, 2015.
- 7[BJR 07] B. Bollobás, S. Janson, and O. Riordan. The phase transition in inhomogeneous random graphs. Random Structures & Algorithms , 31(1):3–122, 2007.
- 8[Bol 01] B. Bollobás. Random Graphs . Cambridge Studies in Advanced Mathematics. Cambridge University Press, 2 edition, 2001.
