The configuration model for Barabasi-Albert networks
M.L. Bertotti, G. Modanese

TL;DR
This paper introduces a rewiring method to generate random networks with specific degree distributions and correlations, enabling comparison with real Barabasi-Albert networks and exploring network properties like disassortativity.
Contribution
The authors develop a novel rewiring technique for creating correlated networks with prescribed degree distributions, specifically applied to Barabasi-Albert networks, and analyze their structural properties.
Findings
Networks with $eta extgreater=2$ are fully connected.
Small degree nodes exhibit disassortativity.
The method allows tuning of the assortativity coefficient.
Abstract
We develop and test a rewiring method (originally proposed by Newman) which allows to build random networks having pre-assigned degree distribution and two-point correlations. For the case of scale-free degree distributions, we discretize the tail of the distribution according to the general prescription by Dorogovtsev and Mendes. The application of this method to Barabasi-Albert (BA) networks is possible thanks to recent analytical results on their correlations, and allows to compare the ensemble of random networks generated in the configuration model with that of "real" networks obtained from preferential attachment. For ( is the number of parent nodes in the preferential attachment scheme) the networks obtained with the configuration model are completely connected (giant component equal to 100%). In both generation schemes a clear disassortativity of the small…
| Diffusion times | |||
|---|---|---|---|
| network | |||
| BA1 | 4.84 | 4.22 | 3.64 |
| BA2 | 5.20 | 4.54 | 3.94 |
| BA3 | 5.40 | 4.74 | 4.10 |
| UNC | 5.50 | 4.84 | 4.20 |
| BA4 | 5.54 | 4.86 | 4.22 |
| BA5 | 5.66 | 4.96 | 4.32 |
| DIS | 5.76 | 5.06 | 4.40 |
| ASS | 6.00 | 5.50 | 4.98 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
The configuration model for Barabasi-Albert networks
M.L. Bertotti 111Email address: [email protected]
G. Modanese 222Email address: [email protected]
Free University of Bozen-Bolzano
Faculty of Science and Technology
I-39100 Bolzano, Italy
Abstract
We develop and test a rewiring method (originally proposed by Newman) which allows to build random networks having pre-assigned degree distribution and two-point correlations. For the case of scale-free degree distributions, we discretize the tail of the distribution according to the general prescription by Dorogovtsev and Mendes. The application of this method to Barabasi-Albert (BA) networks is possible thanks to recent analytical results on their correlations, and allows to compare the ensemble of random networks generated in the configuration model with that of “real”networks obtained from preferential attachment. For ( is the number of parent nodes in the preferential attachment scheme) the networks obtained with the configuration model are completely connected (giant component equal to 100%). In both generation schemes a clear disassortativity of the small degree nodes is demonstrated from the computation of the function . We also develop an efficient rewiring method which produces tunable variations of the assortativity coefficient , and we use it to obtain maximally disassortative networks having the same degree distribution of BA networks with given . Possible applications of this method concern assortative social networks.
I Introduction
In spite of the large number of existing studies on Barabasi-Albert (BA) networks, their two-point correlation functions have been completely analysed only recently by Fotouhi and Rabbat fotouhi2013degree , who have given the full expressions of the conditional probabilities in the large network limit for any value of the parameter (the number of child nodes in the preferential attachment process).
Concerning the assortativity properties of BA networks, in previous work some estimates of the Newman coefficient were found newman2002assortative . According to these estimates, for large (number of nodes), vanishes as . It was therefore generally believed that BA networks are almost uncorrelated, and numerical simulations appeared to confirm this. However, more recent asymptotic estimates fotouhi2018temporal ; bertotti2018bass yield a different result: vanishes only as for large . It should be recalled that for real networks with the same scale-free exponent (), the coefficient is always small in absolute value, so even this small total disassortativity is significant.
By computing the function of BA networks (average nearest neighbor degree of a node of degree ) we have shown in bertotti2018bass that it is strongly decreasing for small and slowly increasing for large . This means that the total slight disassortativity measured by the coefficient is in fact the result of an unexpected mixed assortative/disassortative behavior of these networks.
This peculiar structural property may have an influence on the dynamics on BA networks. In particular, we have shown in bertotti2018bass for the Bass innovation diffusion model on a network, that finite BA networks exhibit the fastest diffusion among scale-free networks with exponent . This result was obtained by solving numerically the coupled nonlinear differential mean-field equations of the model bertotti2016bass ; vespignani2012modelling (a system of equations for a network with maximum degree ) and finding the time of the diffusion peak in dependence on the network structure. Comparisons were made between BA networks, uncorrelated networks, disassortative networks built according to a method by Newman newman2003mixing and assortative networks built with a recipe we have recently developed bertotti2016bass . Results are displayed in Tab. 1. The networks employed had typically maximum degree , which corresponds to nodes. It turns out that the fastest diffusion process occurs on BA networks with . Note that the peak diffusion time in the Bass model is, unlike in the SI model, independent from the initial conditions, and gives therefore useful information on the diffusion features of the network.
In this work we use for the first time the correlation functions found in fotouhi2013degree in order to build in the configuration model networks which display these peculiar correlations, and investigate their properties.
The configuration model newman2010networks is a method for the generation of random networks having an assigned degree distribution. It is therefore a powerful extension of the original concept of random network introduced by Erdös, and has been extensively studied with analytical and numerical methods, especially for the case of scale-free networks. Classical results concern the conditions for the formation of a giant component molloy1995critical and its clustering features.
Some authors have also raised the question of whether it is possible to generate networks with pre-assigned correlations. Newman has proposed for this purpose in newman2003mixing a method based on a degree-preserving rewiring procedure; more recently, the issue has been also discussed by Bassler et al. bassler2015exact and by Boguna et al. boguna2003class . The practical applications of these ideas have been, until now, rather limited. Yet, from the applicative point of view the possibility of an efficient generation of networks with given correlations is quite attractive.
For example, social networks are known to be generally assortative, and in order to study diffusion processes on these networks in the mean-field approximation it is very useful to construct mathematically certain families of assortative correlation matrices bertotti2016bass . If it is possible to produce explicit realizations of networks with such correlations, these can be used to obtain a further characterization of the diffusion process, possibly also with agent-based methods etc. In fact, an assortative rewiring has been proposed already in newman2003mixing and in xulvi2004reshuffling , but with some limitations; in the first case the assortative matrices employed do not generally satisfy a positivity criterium, in the second case no correlations matrices are employed, and the rewiring criterium works on an heuristic basis.
With the above applications in mind, our aim in this work is to use the correlations matrices of BA networks and the rewiring procedure by Newman to test under controlled conditions the configuration model with pre-assigned correlations. In fact, one of the features of BA networks which makes them so popular and widely used for the simulation of real networks is that they can be readily generated via a preferential attachment procedure. Since their correlation matrices are now available, by re-constructing them in the configuration model we can compare the features of the resulting ensemble of networks with those of the networks produced by preferential attachment. As we shall see, this gives useful insights on the method in general.
The rest of the paper is organized as follows. In Section II we discuss the mentioned re-wiring procedure which generates by using the configuration model an ensemble of networks having as prescribed correlations the correlations of BA networks. Some features of the networks of the ensemble obtained in this way are then discussed, including the behavior of their average nearest neighbours degree function . The re-wiring procedure is also adapted in Section III to generate maximally disassortative scale-free networks having the same exponent as the BA ones. Section IV concludes by discussing the results and some potential follow-ups. Finally, in the Appendix some definitions and the expressions for the case of BA networks of some quantities used throughout the paper are recalled.
II The configuration model with Newman rewiring
II.1 Discretized degree distribution
Given the total number of nodes in the network, suppose that we want to assign to each node a degree according to a degree distribution with the form of a power law . As discussed in boguna2004cut the maximum node degree present in the network can be obtained from through the relation
[TABLE]
This means that is the degree above which one expects to find at most one node. For the case of , we obtain .
In practice one can set, for finite networks, for , and normalize accordingly, by defining
[TABLE]
Then one can define
[TABLE]
as the average number of nodes with degree present in the network, where “Round” denotes rounding to the nearest integer.
In this way, however, we find that becomes zero when , which is considerably smaller than the value given by the integral criterium (1). The reason is that we are essentially discarding the fractional expectation values found from (3), instead of cumulating them as in (1).
This procedure has been employed in the influential paper aiello2000random in order to generate scale-free networks with the configuration model. This work, however, pre-dates Ref. boguna2004cut and the widespread use of preferential attachment for the generation of scale-free networks, especially of the BA type. Actually it is immediate to realize, by plotting the degree distribution of finite BA networks generated via preferential attachment, that a random succession of hubs in the degree interval is always present. These hubs play an important role in several dynamical processes on the network.
Therefore we shall use in the following, to obtain the discretized degree distribution , not the simple recipe (3) but one of three different improvements of it, which give practically equivalent results for the networks considered in this work:
(1) “Cumulation” method. In this method, for the values of are cumulated, as increases, until their sum exceeds 1; at this point, one hub is created, the cumulation procedure starts again, and so on.
(2) “Random hubs” method. The idea is the following: if the average number of nodes with degree is smaller than 1, say , then a node with this degree will be created in each realization with probability . Extending the procedure to all degrees, a random variable is generated for each value of , and then denoting by Int() the integer part of and by Dec() its decimal part, one sets if and if . The number of nodes is therefore not fixed, with random variations of for each degree (in particular, with values [math] or in the tail of the distribution), such to respect the degree distribution in an ensemble average.
(3) The most general way for generating the degrees of the nodes is to use a probability transformation method. For this one needs to define first a vector , , where and denotes the normalized degree distribution. The values of define breakpoints of the unit interval (0,1). After generating a random number in this interval, a new node is introduced with degree if , and the procedure is repeated times. This method has the advantage of allowing the generation of exactly nodes.
II.2 Description of the wiring and re-wiring algorithm
After a degree has been assigned to each candidate node (or “stub”) , in the classical configuration model a certain number of links is randomly attached to the stubs, until each stub reaches its planned degree. In our algorithm this wiring procedure is not random, but partially deterministic. This is more efficient and makes sense because the wiring is followed by a massive random re-wiring phase (see below) which preserves the degrees of the nodes but makes the correlations close, in an ensemble average, to the “target” correlations .
The wiring procedure starts from Node 1, whose degree is the largest in the network. Among the remaining ones, distinct nodes are chosen randomly and connected to it. For each of the nodes chosen, the number of available stubs is decreased by one. Then the same steps are repeated for Node 2 and so on, with exclusion of nodes whose stubs are all already connected. The final product will be a list of links of the form , where and denote two nodes (). Provided is even, we have
[TABLE]
For the rewiring according to the Newman procedure, we choose at random in the list of the links two links and . Denote with the excess degrees of these nodes and define the quantities
[TABLE]
where is the “target” correlation matrix that we want to approach in the rewiring. Then
- •
If the rewiring is performed, i.e., the links , are replaced by , .
- •
If , we define and then generate a random number .
- •
If , the rewiring is performed.
- •
If and , the rewiring is performed.
Then another couple of links is chosen and the same steps are repeated.
The ergodicity property of this rewiring procedure has been discussed in newman2003mixing . As empirical criterium for the attainment of equilibrium we set an average of rewirings per node. The fraction of successful rewirings for the present case of BA networks turns out to be larger than 0.5. Therefore, can be taken with a safe margin as a total number of attempts necessary for our trial networks with . This can be accomplished in less than 1 second on a normal machine. The time scales linearly with the size of the network. We chose to report here on the size also for practical reasons of visualization of the network and of its function (see below, Sect. II.4).
The parameter and the number of rewirings given above are only one of many possible safe choices and do not substantially affect the properties observed in the networks. Concerning the choice of the rewiring algorithm itself, we are not aware of any alternative to the Newman algorithm, if the purpose of the rewiring is to obtain networks having (in a statistical sense) pre-defined two-point “target” correlations.
II.3 Properties of BA networks obtained in the configuration model
For a BA network with (in the following also denoted as BA1), the correlation is zero, according to the general formulas of Fotohui and Rabbat. This particular case is also obvious if one considers the totally-connected growth process of the network as obtained in the preferential attachment scheme: no node of degree 1 can be connected to a node of the same degree, otherwise an isolated pair would be formed.
In the configuration model applied to scale-free networks with random rewiring, isolated pairs do usually form, and in large numbers, with the effect of a considerable reduction of the giant component. However, when we apply the configuration model to the degree distribution of a BA1 network, followed by a Newman rewiring with target correlations of the BA1 type (obtained from the as in eq. (10) in the Appendix, the resulting number of isolated pairs is always zero, because the condition is enforced in an effective way. The size of the giant (connected) component is about . Most of the disconnected small components are triples (Fig. 1), whose origin is quite interesting. The correlation is non zero for BA1 networks. In fact, in the growth process with preferential attachment, connected tails of variable length can arise, in which the last node contributes to the correlation and the intermediate nodes contribute to the correlation . When the network is re-constructed in the configuration model, isolated triples arise, because the non-vanishing conditional probability allows to attach two nodes of degree 1 to a central node of degree 2 “without knowing” that on the other side of this central node there is no connection to the giant component. This is a simple demonstration of the general fact that the knowledge of the degree distribution and two-point correlations is insufficient to completely characterize a network.
For BA degree distributions with (Fig. 2) the Newman rewiring always generates a fully connected network (giant component equal to 100%). This has little to do with the correlations, but is due instead to the large average connectivity of these degree distributions, namely . For uncorrelated networks, it is known that the size of the giant component in the configuration model grows quickly as grows newman2010networks . We can readily check this with our algorithm, for example, through an initial wiring with the BA2 degree distribution, followed by a Newman rewiring with target correlations .
II.4 Function of BA networks obtained with the configuration model
A possible way to check if the two-point correlation functions of the BA networks have been correctly reproduced in the configuration model is to plot the function , also known as average nearest neighbours degree distribution. Due to the partial summation in its definition, this function depends only on one argument and is therefore easier to analyse than the full ; moreover, it has a direct qualitative interpretation in terms of assortativity and disassortativity of the network. For an uncorrelated network it is constant and equal to . By computing the function of the BA correlations given by Fotouhi and Rabbat, we have shown that it is decreasing at small , reaching a minimum for a almost proportional to ( in the range ), ad then it is slightly increasing for large .
This behavior is reproduced in the configuration model but, as expected, with large fluctuations, if one plots for different BA networks belonging to the statistical ensemble obtained with the Newman rewiring. Graphically one can visualize such an ensemble with a “cloud” plot (see Fig. 3). This plot is completely analogous to a cloud plot of the functions for BA networks obtained at random with the preferential attachment algorithm.
Note that the degrees of the largest hubs in Fig. 3 are randomly generated according to the “random hubs” method described in Sect. II.1. This is very similar to what one obtains using a standard randomized preferential attachment algorithm: if one analyzes a relatively small number of realizations, one will find that the largest hubs present have variable degrees, and most of the degrees in the tail of the degree distribution are actually missing.
Alternatively to the cloud diagram, one can compute the average of the function over a rewiring ensemble, and compare it to the obtained with the correlations of Fotouhi and Rabbat. In this case, the degrees of the largest hubs need to be fixed, otherwise the average for large is meaningless due to the missing hubs in each realization. Therefore one must employ for this comparison the “cumulated probability” method of Sect. II.1. A typical result is shown in Fig. 4. Even with a small ensemble (e.g. 40 networks, in the figure) compares well with .
The assortativity coefficient of the configuration model ensemble, which condensates the information on the correlations into a single number, is for the examples given (BA2, ), , with standard deviation . The value of computed from the FR correlations is larger in absolute value: . The difference can probably be explained as due to the fluctuations.
III Maximally disassortative networks with scale-free exponent 3
We have seen that finite BA networks are moderately disassortative, and not uncorrelated as frequently stated in the literature. A natural question arises: how relevant is this disassortativity? For scale-free networks with the same exponent () what is the lowest possible value of attainable? And what is the aspect of networks with such minimum , compared to BA networks? We recall that the coefficient of real-world scale-free disassortative networks is rarely more negative than , see bertotti2018bass , even if in general the coefficient of biological and technological networks can be smaller, especially for small size networks. For example, the protein-protein interaction network of H. pylori () has and the protein-protein interaction network of C. elegans () has . This fact has important consequences for the spectra of the networks jalan2015assortative .
The configuration model offers a powerful tool for the exploration of such issues. In this section we shall show that it is possible to do an efficient rewiring of a network with the BA degree distribution which decreases the value of , and consistently yields a minimum value which is approximately three times the value of the coefficient of the corresponding BA network. For instance, for a maximum degree and we have . This makes sense for BA networks with , because for the giant component obtained from the configuration model is not complete.
With the pseudo-random wiring process described in Sect. II.2 we obtain a network in the form of a list of links. We shall now perform on this network a rewiring process with the aim of increasing or decreasing its assortativity coefficient until respectively a maximum or minimum are reached. Each elementary rewiring step works as follows.
(1) Two links are chosen at random in the list. Suppose the first link is between nodes and and the second between nodes and (). Let the excess degrees of these nodes be respectively .
(2) The links and are replaced by new links and , provided and (in order to avoid the formation of loops). This rewiring step causes a change in the elements of the matrix for which or coincide with or . We recall that after the wiring process the matrix is computed by counting in the node list, for each fixed couple of values , how many links are between a node with degree and one with degree , and dividing by ; nodes with are counted twice. The variations after one rewiring step are
[TABLE]
The variation of the Newman assortativity coefficient after one rewiring step is due to the presence in the definition of (eq. (12) in the Appendix) of the sum , which changes as
[TABLE]
On the other hand, the sum does not change in the rewiring, because the distribution of the excess degrees does not change (it is a degree-preserving rewiring). For the same reason, the (positive) denominator in the definition of is also unchanged. The variation of is therefore
[TABLE]
The variation is accepted when , if we are looking for the maximum assortativity, or viceversa. The algorithm performs a large number of rewirings, for instance rewirings for a network with nodes; then is visualized and another rewirings are performed, and so on, until the value of stabilizes (this can be checked visually or through some automated criterium; the convergence is usually quick, and we shall discuss in further work the issue of possible local maxima and minima and how to exclude them).
In Fig. 5 the function which arises from one realization of a maximally disassortative network obtained with this method is compared with the corresponding of a BA2 network.
The values of obtained will depend (for fixed ) on the degree distribution, therefore on the scale-free exponent in the case of a pure power law, or on for a “BA-like” degree distribution . We use this degree distribution as a variation of the pure power law , in order to investigate the role of the details of the degree distribution at small ; we recall that these details influence the average degree and may have a strong impact, for instance, on the giant component of random networks in the configuration model newman2010networks .
IV Conclusion
The degree distribution, correlation functions and assortativity character are distinctive features of any network, and affect in an essential way the dynamics processes which take place on it. It is therefore desirable to develop methods and algorithms which generate networks where such characters are pre-assigned; this allows to study the resulting networks in detail and to simulate dynamical processes on them.
The configuration model newman2010networks ; barabasi2016network in its traditional form allows to generate uncorrelated networks with assigned degree distribution and has been widely investigated – even though, for the scale-free case, defining the discretized degree distribution of the high-degree “stubs” in accordance with the integral criterium of Dorogovtsev-Mendes is not trivial (a point that we also fix in this paper, before addressing the correlations).
An improvement of the configuration model through a rewiring algorithm that generates an ensemble of networks with pre-assigned correlations has been proposed by Newman in his seminal paper on assortative mixing newman2003mixing . In that paper Newman applied his rewiring algorithm to scale-free networks of the disassortative kind (and also of the assortative kind, in a small range of the coefficient). No further applications of this method have been published, to our knowledge; degree-preserving rewirings have been often used xulvi2004reshuffling ; van2010influence ; d2012robustness , but not in connection with the correlation functions. Therefore the recent full computation of the correlations for BA networks fotouhi2013degree offers the possibility of a new test of the Newman rewiring by comparison with the BA networks generated directly via the preferential attachment scheme.
In particular, we have tested numerically (and we plan to extend this work to other classes of correlation functions, besides those of BA networks): (a) the giant connected component of the networks obtained; (b) the average of the function in their ensemble; (c) the fluctuations of in the ensemble.
Furthermore, we have developed a new rewiring criterium which allows to obtain in an almost-deterministic way (i.e., with very small fluctuations in the resulting ensemble) networks having maximum or minimum values of the assortativity coefficient.
Finally, with this method it is also possible to focus on features related to specific components of the correlations. For instance, we observe that in the configuration model of BA1 networks the isolated couples are completely absent, thanks to the vanishing of the correlation; in a maximally disassortative network with BA2 degree distribution all the largest hubs are connected exclusively with nodes of degree 2, etc. These features have been obtained in trial networks with 2500 nodes, and therefore with a statistical significance of the order of 1 part in . In future computations we plan to increase the accuracy and especially to address the case of assortative networks.
V Abbreviations used
BA: Barabasi-Albert
BA1, BA2, …: Barabasi-Albert networks with
UNC: Uncorrelated
ASS: Assortative
DIS: Disassortative
VI Declarations
Availability of data and materials: - not applicable (no data and materials have been used or generated in this work).
Competing interests - None of the authors have any competing interests in the manuscript.
Funding - This work was supported by the Open Access Publishing Fund of the Free University of Bozen-Bolzano.
Authors’ contributions - All authors contributed equally to this work.
Acknowledgments - Not applicable.
VII Appendix
We recall here some formulae which have been used throughout the paper and in the simulations.
Whereas expresses the probability that a randomly chosen node of a network has degree , the degree correlation expresses the conditional probability that a node with degree is connected to one with degree .
In particular, for a Barabasi-Albert network with parameter corresponding to the number of parent nodes in the preferential attachment scheme, the degree distribution is given barabasi2016network by
[TABLE]
and the degree correlations are given by
[TABLE]
with denoting the binomial coefficient
[TABLE]
The expressions in (10) have been established by Fotouhi and Rabbat in fotouhi2013degree and have been employed by us, here and in bertotti2018bass , after a suitable normalization, due to the fact that we deal with network with a finite maximal degree.
As for the assortativity coefficient , its expression is given (see e.g. newman2002assortative or bertotti2018bass ) by
[TABLE]
where denotes the probability that a randomly chosen link connects nodes with excess degree and (the excess degree of a node being its total degree minus one), and and are given by
[TABLE]
and
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] B. Fotouhi and M.G. Rabbat. Degree correlation in scale-free graphs. Eur. Phys. J. B , 86(12):510, 2013.
- 2[2] M.E.J. Newman. Assortative mixing in networks. Phys. Rev. Lett. , 89(20):208701, 2002.
- 3[3] B. Fotouhi and M. Rabbat. Temporal evolution of the degree distribution of alters in growing networks. Network Science , 6(1):97–155, 2018.
- 4[4] M.L. Bertotti and G. Modanese. The Bass diffusion model on finite Barabasi-Albert networks. Complexity , 2019:6352657, 2019.
- 5[5] M.L. Bertotti, J. Brunner, and G. Modanese. The Bass diffusion model on networks with correlations and inhomogeneous advertising. Chaos Solitons Fract. , 90:55–63, 2016.
- 6[6] A. Vespignani. Modelling dynamical processes in complex socio-technical systems. Nat. Phys. , 8(1):32–39, 2012.
- 7[7] M.E.J. Newman. Mixing patterns in networks. Phys. Rev. E , 67(2):026126, 2003.
- 8[8] Z. Jiang, F.M. Bass, and P.I. Bass. Virtual Bass model and the left-hand data-truncation bias in diffusion of innovation studies. Int. J. Res. Mark. , 23(1):93–106, 2006.
