Clustering Spectrum of scale-free networks

Clara Stegehuis; Remco van der Hofstad; Johan S.H. van Leeuwaarden,; A.J.E.M Janssen

arXiv:1706.01727·cs.SI·November 1, 2017

Clustering Spectrum of scale-free networks

Clara Stegehuis, Remco van der Hofstad, Johan S.H. van Leeuwaarden,, A.J.E.M Janssen

PDF

TL;DR

This paper investigates the clustering spectrum in scale-free networks, revealing a universal pattern of three regimes in the correlation function and explaining its emergence in large networks.

Contribution

It introduces a universal curve for the clustering spectrum in scale-free networks and analytically explains its properties and dependence on network size.

Findings

01

The clustering spectrum follows a universal three-regime curve.

02

The power-law decay of clustering depends on degree distribution.

03

Large networks exhibit the predicted universal curve properties.

Abstract

Real-world networks often have power-law degrees and scale-free properties such as ultra-small distances and ultra-fast information spreading. In this paper, we study a third universal property: three-point correlations that suppress the creation of triangles and signal the presence of hierarchy. We quantify this property in terms of $\overset{c}{ˉ} (k)$ , the probability that two neighbors of a degree- $k$ node are neighbors themselves. We investigate how the clustering spectrum $k \mapsto \overset{c}{ˉ} (k)$ scales with $k$ in the hidden variable model and show that $c (k)$ follows a {\it universal curve} that consists of three $k$ -ranges where $\overset{c}{ˉ} (k)$ remains flat, starts declining, and eventually settles on a power law $\overset{c}{ˉ} (k) \sim k^{- α}$ with $α$ depending on the power law of the degree distribution. We test these results against ten contemporary real-world networks and explain…

Tables1

Table 1. Table 1: Data sets. N 𝑁 N denotes the number of vertices, τ 𝜏 \tau the exponent of the tail of the degree distribution estimated by the method proposed in Clauset et al. ( 2009 ) together with the goodness of fit criterion proposed in Clauset et al. ( 2009 ) (when the goodness of fit is at least 0.10, a power-law tail cannot be rejected), and α 𝛼 \alpha denotes the exponent of c ( k ) 𝑐 𝑘 c(k) .

	$N$	$τ$	g.o.f.	$α$
Hudong	1.984.484	2,30	0.00	0,85
Baidu	2.141.300	2,29	0.00	0,80
Wordnet	146.005	2,47	0.00	1,01
Google web	875.713	2,73	0.00	1,03
AS-Skitter	1.696.415	2,35	0.06	1,12
TREC-WT10g	1.601.787	2,23	0.00	0,99
Wiki-talk	2.394.385	2,46	0.00	1,54
Catster/Dogster	623.766	2,13	0.00	1,20
Gowalla	196.591	2,65	0.80	1,24
Youtube	1.134.890	2,22	0.00	1,05

Equations268

ρ (h) = C h^{- τ}

ρ (h) = C h^{- τ}

p (h, h^{'}) \sim \frac{h h ^{'}}{N ⟨ h ⟩},

p (h, h^{'}) \sim \frac{h h ^{'}}{N ⟨ h ⟩},

p(h,h^{\prime})=\min\Big{(}1,\frac{hh^{\prime}}{N\langle h\rangle}\Big{)},

p(h,h^{\prime})=\min\Big{(}1,\frac{hh^{\prime}}{N\langle h\rangle}\Big{)},

c (h) = \int_{h^{'}} \int_{h^{''}} p (h^{'} ∣ h) p (h^{'}, h^{''}) p (h^{''} ∣ h) d h^{''} d h^{'},

c (h) = \int_{h^{'}} \int_{h^{''}} p (h^{'} ∣ h) p (h^{'}, h^{''}) p (h^{''} ∣ h) d h^{''} d h^{'},

c (h) \propto N^{2 - τ} ln N, h \leq N^{β (τ)} .

c (h) \propto N^{2 - τ} ln N, h \leq N^{β (τ)} .

c(h)\propto N^{2-\tau}\Big{(}1+\ln\Big{(}\frac{\sqrt{N}}{h}\Big{)}\Big{)},\quad N^{\beta(\tau)}\leq h\leq\sqrt{N}.

c(h)\propto N^{2-\tau}\Big{(}1+\ln\Big{(}\frac{\sqrt{N}}{h}\Big{)}\Big{)},\quad N^{\beta(\tau)}\leq h\leq\sqrt{N}.

c(h)\propto\frac{1}{N}\Big{(}\frac{h}{N}\Big{)}^{-2(3-\tau)},\quad h\geq\sqrt{N},

c(h)\propto\frac{1}{N}\Big{(}\frac{h}{N}\Big{)}^{-2(3-\tau)},\quad h\geq\sqrt{N},

min (1, \frac{h h ^{'}}{N ⟨ h ⟩}) min (1, \frac{h h ^{''}}{N ⟨ h ⟩}) min (1, \frac{h ^{'} h ^{''}}{N ⟨ h ⟩}) .

min (1, \frac{h h ^{'}}{N ⟨ h ⟩}) min (1, \frac{h h ^{''}}{N ⟨ h ⟩}) min (1, \frac{h ^{'} h ^{''}}{N ⟨ h ⟩}) .

h^{'}, h^{''} \leq \frac{N ⟨ h ⟩}{h} .

h^{'}, h^{''} \leq \frac{N ⟨ h ⟩}{h} .

h^{'} h^{''} \leq N ⟨ h ⟩ .

h^{'} h^{''} \leq N ⟨ h ⟩ .

σ_{N} (t) = \frac{ln ( c ( h ) / c ( h _{c} ) )}{ln ( N ⟨ h ⟩)}, h = (N ⟨ h ⟩)^{t},

σ_{N} (t) = \frac{ln ( c ( h ) / c ( h _{c} ) )}{ln ( N ⟨ h ⟩)}, h = (N ⟨ h ⟩)^{t},

p (h, h^{'}) = 1 - e^{- h h^{'} / N ⟨ h ⟩} \approx \frac{h h ^{'}}{N ⟨ h ⟩} .

p (h, h^{'}) = 1 - e^{- h h^{'} / N ⟨ h ⟩} \approx \frac{h h ^{'}}{N ⟨ h ⟩} .

\displaystyle c(h)=\frac{\int_{1}^{h_{c}}\int_{1}^{h_{c}}\rho(h^{\prime})p(h,h^{\prime})\rho(h^{\prime\prime})p(h,h^{\prime\prime})p(h^{\prime},h^{\prime\prime}){\rm d}h^{\prime\prime}{\rm d}h^{\prime}}{\big{[}\int_{1}^{h_{c}}\rho(h^{\prime})p(h,h^{\prime}){\rm d}h^{\prime}\big{]}^{2}}

\displaystyle c(h)=\frac{\int_{1}^{h_{c}}\int_{1}^{h_{c}}\rho(h^{\prime})p(h,h^{\prime})\rho(h^{\prime\prime})p(h,h^{\prime\prime})p(h^{\prime},h^{\prime\prime}){\rm d}h^{\prime\prime}{\rm d}h^{\prime}}{\big{[}\int_{1}^{h_{c}}\rho(h^{\prime})p(h,h^{\prime}){\rm d}h^{\prime}\big{]}^{2}}

\displaystyle=\frac{\int_{1}^{h_{c}}\int_{1}^{h_{c}}(h^{\prime}h^{\prime\prime})^{-\tau}\min(\frac{hh^{\prime}}{h_{s}^{2}},1)\min(\frac{hh^{\prime\prime}}{h_{s}^{2}},1)\min(\frac{h^{\prime}h^{\prime\prime}}{h_{s}^{2}},1){\rm d}h^{\prime\prime}{\rm d}h^{\prime}}{\big{[}\int_{1}^{h_{c}}(h^{\prime})^{-\tau}\min(\frac{hh^{\prime}}{h_{s}^{2}},1){\rm d}h^{\prime}\big{]}^{2}}.

σ_{N} (t) = \frac{ln ( c ( h ) / c ( h _{ref} ))}{ln ( N ⟨ h ⟩)}, h = (N ⟨ h ⟩)^{t},

σ_{N} (t) = \frac{ln ( c ( h ) / c ( h _{ref} ))}{ln ( N ⟨ h ⟩)}, h = (N ⟨ h ⟩)^{t},

h_{s} = N ⟨ h ⟩, h_{c} = (N ⟨ h ⟩)^{1/ (τ - 1)},

h_{s} = N ⟨ h ⟩, h_{c} = (N ⟨ h ⟩)^{1/ (τ - 1)},

⟨ h ⟩ = \frac{τ - 1}{τ - 2} \frac{1 - N ^{2 - τ}}{1 - N ^{1 - τ}} .

⟨ h ⟩ = \frac{τ - 1}{τ - 2} \frac{1 - N ^{2 - τ}}{1 - N ^{1 - τ}} .

a = h_{s}^{- 1} = (N ⟨ h ⟩)^{- 1/2}, b = \frac{h _{c}}{h _{s}} = (N ⟨ h ⟩)^{\frac{3 - τ}{2 ( τ - 1 )}},

a = h_{s}^{- 1} = (N ⟨ h ⟩)^{- 1/2}, b = \frac{h _{c}}{h _{s}} = (N ⟨ h ⟩)^{\frac{3 - τ}{2 ( τ - 1 )}},

r (u) = min (u, 1) .

r (u) = min (u, 1) .

c(h)=\frac{\int_{a}^{b}\int_{a}^{b}(xy)^{-\tau}r(ahx)r(ahy)r(xy){\rm d}x{\rm d}y}{\big{[}\int_{a}^{b}x^{-\tau}r(ahx){\rm d}x\big{]}^{2}}.

c(h)=\frac{\int_{a}^{b}\int_{a}^{b}(xy)^{-\tau}r(ahx)r(ahy)r(xy){\rm d}x{\rm d}y}{\big{[}\int_{a}^{b}x^{-\tau}r(ahx){\rm d}x\big{]}^{2}}.

c(h)\approx\frac{\tau-2}{3-\tau}h_{s}^{4-2\tau}\ln\Big{(}\frac{h_{c}^{2}}{h_{s}^{2}}\Big{)}\propto N^{2-\tau}\ln N,

c(h)\approx\frac{\tau-2}{3-\tau}h_{s}^{4-2\tau}\ln\Big{(}\frac{h_{c}^{2}}{h_{s}^{2}}\Big{)}\propto N^{2-\tau}\ln N,

c (h)

c (h)

\int_{a}^{b} x^{1 - τ} d x = \frac{a ^{2 - τ} - b ^{2 - τ}}{τ - 2} .

\int_{a}^{b} x^{1 - τ} d x = \frac{a ^{2 - τ} - b ^{2 - τ}}{τ - 2} .

\frac{a ^{2 - τ} - b ^{2 - τ}}{τ - 2} \approx \frac{a ^{2 - τ}}{τ - 2} .

\frac{a ^{2 - τ} - b ^{2 - τ}}{τ - 2} \approx \frac{a ^{2 - τ}}{τ - 2} .

\int_{a}^{b} \int_{a}^{b} (x y)^{1 - τ} r (x y) d x d y

\int_{a}^{b} \int_{a}^{b} (x y)^{1 - τ} r (x y) d x d y

= \int_{a}^{1/ b} \int_{a}^{b} (x y)^{2 - τ} d x d y + \int_{1/ b}^{b} \int_{a}^{1/ x} (x y)^{2 - τ} d x d y

+ \int_{1/ b}^{b} \int_{1/ x}^{b} (x y)^{1 - τ} d x d y

= \frac{( b ^{τ - 3} - a ^{3 - τ} ) ( b ^{3 - τ} - a ^{3 - τ} )}{( 3 - τ ) ^{2}}

+ \frac{1}{3 - τ} (ln (b^{2}) - \frac{a ^{3 - τ} ( b ^{3 - τ} - b ^{τ - 3} )}{3 - τ})

+ \frac{1}{2 - τ} (\frac{b ^{2 - τ} ( b ^{2 - τ} - b ^{τ - 2} )}{2 - τ} - ln (b^{2}))

= \frac{ln ( b ^{2} )}{( 3 - τ ) ( τ - 2 )} - \frac{1 - b ^{4 - 2 τ}}{( τ - 2 ) ^{2}} + \frac{1 - 2 ( ab ) ^{3 - τ} + a ^{6 - 2 τ}}{( 3 - τ ) ^{2}} .

\frac{3 - τ}{τ - 1} \frac{ln ( N ⟨ h ⟩)}{( 3 - τ ) ( τ - 2 )} ≫ \frac{1}{( τ - 2 ) ^{2}}

\frac{3 - τ}{τ - 1} \frac{ln ( N ⟨ h ⟩)}{( 3 - τ ) ( τ - 2 )} ≫ \frac{1}{( τ - 2 ) ^{2}}

\frac{3 - τ}{τ - 1} \frac{ln ( N ⟨ h ⟩)}{( 3 - τ ) ( τ - 2 )} ≫ \frac{1}{( 3 - τ ) ^{2}},

\frac{3 - τ}{τ - 1} \frac{ln ( N ⟨ h ⟩)}{( 3 - τ ) ( τ - 2 )} ≫ \frac{1}{( 3 - τ ) ^{2}},

c (h) \approx \frac{τ - 2}{3 - τ} a^{2 τ - 4} ln (b^{2}) \propto N^{2 - τ} ln (N),

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Clustering spectrum of scale-free networks

Clara Stegehuis

Remco van der Hofstad

A.J.E.M. Janssen

Johan S.H. van Leeuwaarden

Eindhoven University of Technology, Department of Mathematics and Computer Science, P.O. Box 513, 5600 MB Eindhoven, The Netherlands

(March 8, 2024)

Abstract

Real-world networks often have power-law degrees and scale-free properties such as ultra-small distances and ultra-fast information spreading. In this paper, we study a third universal property: three-point correlations that suppress the creation of triangles and signal the presence of hierarchy. We quantify this property in terms of $\bar{c}(k)$ , the probability that two neighbors of a degree- $k$ node are neighbors themselves. We investigate how the clustering spectrum $k\mapsto\bar{c}(k)$ scales with $k$ in the hidden variable model and show that $c(k)$ follows a universal curve that consists of three $k$ -ranges where $\bar{c}(k)$ remains flat, starts declining, and eventually settles on a power law $\bar{c}(k)\sim k^{-\alpha}$ with $\alpha$ depending on the power law of the degree distribution. We test these results against ten contemporary real-world networks and explain analytically why the universal curve properties only reveal themselves in large networks.

pacs:

89.75.-k Complex systems, 64.60.aq Networks

I Introduction

Most real-world networks have power-law degrees, so that the proportion of nodes having $k$ neighbors scales as $k^{-\tau}$ with exponent $\tau$ between 2 and 3 Albert et al. (1999); Faloutsos et al. (1999); Jeong et al. (2000); Vázquez et al. (2002). Power-law degrees imply various intriguing scale-free network properties, such as ultra-small distances van der Hofstad et al. (2007); Newman et al. (2001) and the absence of percolation thresholds when $\tau<3$ Janson (2009); Pastor-Satorras and Vespignani (2001). Empirical evidence has been matched by random graph null models that are able to explain mathematically why and how these properties arise. This paper deals with another fundamental property observed in many scale-free networks related to three-point correlations that suppress the creation of triangles and signal the presence of hierarchy. We quantify this property in terms of the clustering spectrum, the function $k\mapsto\bar{c}(k)$ with $\bar{c}(k)$ the probability that two neighbors of a degree- $k$ node are neighbors themselves.

In uncorrelated networks the clustering spectrum $\bar{c}(k)$ remains constant and independent of $k$ . However, the majority of real-world networks have spectra that decay in $k$ , as first observed in technological networks including the Internet Pastor-Satorras et al. (2001); Ravasz and Barabási (2003). Figure 1 shows the same phenomenon for a social network: YouTube users as vertices, and edges indicating friendships between them Leskovec and Krevl (2014).

Close inspection suggests the following properties, not only in Fig. 1, but also in the nine further networks in Fig. 2. The right end of the spectrum appears to be of the power-law form $k^{-\alpha}$ ; approximate values of $\alpha$ give rise to the dashed lines; (ii) The power law is only approximate and kicks in for rather large values of $k$ . In fact, the slope of $\bar{c}(k)$ decreases with $k$ ; (iii) There exists a transition point: the minimal degree as of which the slope starts to decline faster and settles on its limiting (large $k$ ) value.

For scale-free networks a decaying $\bar{c}(k)$ is taken as an indicator for the presence of modularity and hierarchy Ravasz and Barabási (2003), architectures that can be viewed as collections of subgraphs with dense connections within themselves and sparser ones between them. The existence of clusters of dense interaction signals hierarchical or nearly decomposable structures. When the function $\bar{c}(k)$ falls off with $k$ , low-degree vertices have relatively high clustering coefficients, hence creating small modules that are connected through triangles. In contrast, high-degree vertices have very low clustering coefficients, and therefore act as bridges between the different local modules. This also explains why $\bar{c}(k)$ is not just a local property, and when viewed as a function of $k$ , measures crucial mesoscopic network properties such as modularity, clusters and communities. The behavior of $\bar{c}(k)$ also turns out to be a good predictor for the macroscopic behavior of the network. Randomizing real-world networks while preserving the shape of the $\bar{c}(k)$ curve produces networks with very similar component sizes as well as similar hierarchical structures as the original network Colomer-de Simón et al. (2013). Furthermore, the shape of $\bar{c}(k)$ strongly influences the behavior of networks under percolation Serrano and Boguñá (2006a). This places the $\bar{c}(k)$ -curve among the most relevant indicators for structural correlations in network infrastructures.

In this paper, we obtain a precise characterization of clustering in the hidden variable model, a tractable random graph null model. We start from an explicit form of the $\bar{c}(k)$ curve for the hidden variable model Boguñá and Pastor-Satorras (2003); Serrano et al. (2007); Dorogovtsev (2004). We obtain a detailed description of the $\bar{c}(k)$ -curve in the large-network limit that provides rigorous underpinning of the empirical observations (i)-(iii). We find that the decay rate in the hidden variable model is significantly different from the exponent $\bar{c}(k)\sim k^{-1}$ that has been found in a hierarchical graph model Ravasz and Barabási (2003) as well as in the preferential attachment model Krot and Ostroumova Prokhorenkova (2015) and a preferential attachment model with enhanced clustering Szabó et al. (2003). Furthermore, we show that before the power-law decay of $\bar{c}(k)$ kicks in, $\bar{c}(k)$ first has a constant regime for small $k$ , and a logarithmic decay phase. This characterizes the entire clustering spectrum of the hidden variable model.

This paper is structured as follows. Section II introduces the random graph model and its local clustering coefficient. Section III presents the main results for the clustering spectrum. Section IV explains the shape of the clustering spectrum in terms of an energy minimization argument, and Section V quantifies how fast the limiting clustering spectrum arises as function of the network size. We conclude with a discussion in Section VI and present all mathematical derivations of the main results in the appendix.

II Hidden variables

As null model we employ the hidden variable model Boguñá and Pastor-Satorras (2003); Park and Newman (2004); Bollobás et al. (2007); Britton et al. (2006); Norros and Reittu (2006). Given $N$ nodes, hidden variable models are defined as follows. Associate to each node a hidden variable $h$ drawn from a given probability distribution function

[TABLE]

for some constant $C$ . Next join each pair of vertices independently according to a given probability $p(h,h^{\prime})$ with $h$ and $h^{\prime}$ the hidden variables associated to the two nodes. Many networks can be embedded in this hidden-variable framework, but particular attention goes to the case in which the hidden variables have themselves the structure of the degrees of a real-world network. In that case the hidden-variable model puts soft constraints on the degrees, which is typically easier to analyze than hard constraints as in the configuration model Clauset et al. (2009); Newman (2003a); Vázquez et al. (2002); Dhara et al. (2016). Chung and Lu Chung and Lu (2002) introduced the hidden variable model in the form

[TABLE]

so that the expected degree of a node equals its hidden variable.

We now discuss the structural and natural cutoff, because both will play a crucial role in the description of the clustering spectrum. The structural cutoff is defined as the largest possible upper bound on the degrees required to guarantee single edges, while the natural cutoff characterizes the maximal degree in a sample of $N$ vertices. For scale-free networks with exponent $\tau\in(2,3]$ the structural cutoff scales as $\sqrt{N}$ while the natural cutoff scales as $N^{1/(\tau-1)}$ , which gives rise to structural negative correlations and possibly other finite-size effects. If one wants to avoid such effects, then the maximal value of the product $hh^{\prime}$ should never exceed $N\langle h\rangle$ , which can be guaranteed by the assumption that the hidden degree $h$ is smaller than the structural cutoff $h_{s}=\sqrt{N\langle h\rangle}$ . While this restricts $p(h,h^{\prime})$ in (2) within the interval $[0,1]$ , banning degrees larger than the structural cutoff strongly violates the reality of scale-free networks, where degrees all the way up to the natural cutoff $(N\langle h\rangle)^{1/(\tau-1)}$ need to be considered. We therefore work with (although many asymptotically equivalent choices are possible; see van der Hofstad et al. (2017) and Appendix A)

[TABLE]

putting no further restrictions on the range of the hidden variables (and hence degrees).

In this paper, we shall work with $c(h)$ , the local clustering coefficient of a randomly chosen vertex with hidden variable $h$ . However, when studying local clustering in real-world data sets, we can only observe $\bar{c}(k)$ , the local clustering coefficient of a vertex of degree $k$ . In Appendix C we show that the approximation $\bar{c}(h)\approx c(h)$ is highly accurate. We start from the explicit expression for $c(h)$ Boguñá and Pastor-Satorras (2003), which measures the probability that two randomly chosen edges from $h$ are neighbors, i.e.,

[TABLE]

with $p(h^{\prime}|h)$ the conditional probability that a randomly chosen edge from an $h$ -vertex is connected to an $h^{\prime}$ -vertex and $p(h,h^{\prime})$ as in (3). The goal is now to characterize the $c(h)$ -curve (and hence the $\bar{c}(k)$ -curve).

III Universal clustering spectrum

The asymptotic evaluation of the double integral (4) in the large- $N$ regime reveals three different ranges, defined in terms of the scaling relation between the hidden variable $h$ and the network size $N$ . The three ranges together span the entire clustering spectrum as shown in Fig. 3. The detailed calculations are deferred to Appendix A.

The first range pertains to the smallest-degree nodes, i.e., vertices with a hidden variable that does not exceed $N^{\beta(\tau)}$ with $\beta(\tau)=\frac{\tau-2}{\tau-1}$ . In this case we show that

[TABLE]

In particular, here the local clustering does not depend on the degree and in fact corresponds with the large- $N$ behavior of the global clustering coefficient van der Hofstad et al. (2017); Colomer-de Simon and Boguñá (2012). Note that the interval $[0,\beta(\tau)]$ diminishes when $\tau$ is close to 2, a possible explanation for why the flat range associated with Range I is hard to recognize in some of the real-world data sets.

Range II considers nodes with hidden variables (degrees) above the threshold $N^{\beta(\tau)}$ , but below the structural cutoff $\sqrt{N}$ . These nodes start experiencing structural correlations, and close inspection of the integral (4) yields

[TABLE]

This range shows relatively slow, logarithmic decay in the clustering spectrum, and is clearly visible in the ten data sets.

Range III considers hidden variables above the structural cutoff, when the restrictive effect of degree-degree correlations becomes more evident. In this range we find that

[TABLE]

hence power-law decay with a power-law exponent $\alpha=2(3-\tau)$ . Such power-law decay has been observed in many real-world networks Vázquez et al. (2002); Ravasz and Barabási (2003); Serrano and Boguñá (2006b); Catanzaro et al. (2004); Leskovec (2008); Krioukov et al. (2012), where most networks were found to have the power-law exponent close to one. The asymptotic relation (7) shows that the exponent $\alpha$ decreases with $\tau$ and takes values in the entire range $(0,2)$ . Table 1 contains estimated values of $\alpha$ for the ten data sets.

IV Energy minimization

We now explain why the clustering spectrum splits into three ranges, using an argument that minimizes the energy needed to create triangles among nodes with specific hidden variables.

In all three ranges for $h$ , there is one type of ‘most likely’ triangle, as shown in Fig. 4. This means that most triangles containing a vertex $v$ with hidden variable $h$ are triangles with two other vertices $v^{\prime}$ and $v^{\prime\prime}$ with hidden variables $h^{\prime}$ and $h^{\prime\prime}$ of specific sizes, depending on $h$ . The probability that a triangle is present between $v$ , $v^{\prime}$ and $v^{\prime\prime}$ can be written as

[TABLE]

While the probability that such a triangle exists among the three nodes thus increases with $h^{\prime}$ and $h^{\prime\prime}$ , the number of such nodes decreases with $h^{\prime}$ and $h^{\prime\prime}$ because vertices with higher $h$ -values are rarer. Therefore, the maximum contribution to $c(h)$ results from a trade-off between large enough $h^{\prime},h^{\prime\prime}$ for likeliness of occurrence of the triangle, and $h^{\prime},h^{\prime\prime}$ small enough to have enough copies. Thus, having $h^{\prime}>N\langle h\rangle/h$ is not optimal, since then the probability that an edge exists between $v$ and $v^{\prime}$ no longer increases with $h^{\prime}$ . This results in the bound

[TABLE]

Similarly, $h^{\prime}h^{\prime\prime}>N\langle h\rangle$ is also suboptimal, since then further increasing $h^{\prime}$ and $h^{\prime\prime}$ does not increase the probability of an edge between $v^{\prime}$ and $v^{\prime\prime}$ . This gives as a second bound

[TABLE]

In Ranges I and II, $h<\sqrt{N\langle h\rangle}$ , so that $N\langle h\rangle/h>\sqrt{N\langle h\rangle}$ . In this situation we reach bound (10) before we reach bound (9). Therefore, the maximum contribution to $c(h)$ comes from $h^{\prime}h^{\prime\prime}\approx N$ , where also $h^{\prime},h^{\prime\prime}<N\langle h\rangle/h$ because of the bound (9). Here the probability that the edge between $v^{\prime}$ and $v^{\prime\prime}$ exists is large, while the other two edges have a small probability to be present, as shown in Fig. 4a. Note that for $h$ in Range I, the bound (9) is superfluous, since in this regime $N\langle h\rangle/h>h_{c}$ , while the network does not contain vertices with hidden variables larger than $h_{c}$ . This bound indicates the minimal values of $h^{\prime}$ such that an $h$ -vertex is guaranteed to be connected to an $h^{\prime}$ -vertex. Thus, vertices in Range I are not even guaranteed to have connections to the highest degree vertices, hence they are not affected by the single-edge constraints. Therefore the value of $c(h)$ in Range I is independent of $h$ .

In Range III, $h>\sqrt{N\langle h\rangle}$ , so that $N\langle h\rangle/h<\sqrt{N\langle h\rangle}$ . Therefore, we reach bound (9) before we reach bound (10). Thus, we maximize the contribution to the number of triangles by choosing $h^{\prime},h^{\prime\prime}\approx N\langle h\rangle/h$ . Then the probability that the edge from $v$ to $v^{\prime}$ and from $v$ to $v^{\prime\prime}$ is present is large, while the probability that the edge between $v^{\prime}$ and $v^{\prime\prime}$ exists is small, as illustrated in Fig. 4b.

V Convergence rate

We next ask how large networks should be, or become, before they reveal the features of the universal clustering spectrum. In other words, while the results in this paper are shown for the large- $N$ limit, for what finite $N$ -values can we expect to see the different ranges and clustering decay? To bring networks of different sizes $N$ on a comparable footing, we consider

[TABLE]

for $0\leq t\leq\tfrac{1}{\tau-1}$ . The slope of $\sigma_{N}(t)$ can be interpreted as a measure of the decay of $c(h)$ at $h=(N\langle h\rangle)^{t}$ , and all curves share the same right end of the spectrum; see Appendix B for more details. Figure 5 shows this rescaled clustering spectrum for synthetic networks generated with the hidden variable model with $\tau=2.25$ . Already $10^{4}$ vertices reveal the essential features of the spectrum: the decay and the three ranges. Increasing the network size further to $10^{5}$ and $10^{6}$ nodes shows that the spectrum settles on the limiting curve. Here we note that the real-world networks reported in Figs. 1 and 2 are also of order $10^{5}$ - $10^{6}$ nodes, see Table 1.

Figure 5 also brings to bear a potential pitfall when the goal is to obtain statistically accurate estimates for the slope of $c(h)$ . Observe the extremely slow convergence to the limiting curve for $N=\infty$ ; a well documented property of certain clustering measures Boguñá et al. (2009); Colomer-de Simon and Boguñá (2012); Janssen and van Leeuwaarden (2015); van der Hofstad et al. (2017). In Appendix B we again use the integral expression (4) to characterize the limiting curve for $N=\infty$ and the rate of convergence as function of $N$ , and indeed extreme $N$ -values are required for statistically reliable slope estimates for e.g. $t$ -values of $\tfrac{1}{2}$ and $\tfrac{1}{\tau-1}$ ; this is also apparent from visual inspection of Fig. 5. Therefore, the estimates in Table 1 only serve as indicative values of $\alpha$ . Finally, observe that Range II disappears in the limiting curve, due to the rescaling in (11), but again only for extreme $N$ -values. Because this paper is about structure rather than statistical estimation, the slow convergence in fact provides additional support for the persistence of Range II in Figs. 1 and 2.

Table 1 also shows that the relation $\alpha=-2(3-\tau)$ is inaccurate for the real-world data sets, in turn affecting the theoretical boundaries of the three regimes indicated in Fig. 2. One explanation for this inaccuracy is that the real-world networks might not follow pure power-law distributions, as measured by the goodness of fit criterion in Table 1, and visualized in Appendix D. Furthermore, real-world networks are usually highly clustered and contain community structures, whereas the hidden variable model is locally tree-like. These modular structure may explain, for example, why the power-law decay of the hidden variable model is less pronounced in the three social networks of Fig. 2. It is remarkable that despite these differences between hidden variable models and real-world networks, the global shape of the $c(k)$ curve of the hidden variable model is still visible in these heavy-tailed real-world networks.

VI Discussion

The hidden variable model gives rise to single-edge networks in which pairs of vertices can only be connected once. Hierarchical modularity and the decaying clustering spectrum have been contributed to this restriction that no two vertices have more than one edge connecting them Pastor-Satorras et al. (2001); Maslov et al. (2004); Park and Newman (2003); Newman (2002, 2003b). The physical intuition is that the single-edge constraint leads to far fewer connections between high-degree vertices than anticipated based on randomly assigned edges. We have indeed confirmed this intuition, not only through analytically revealing the universal clustering curve, but also by providing an alternative derivation of the three ranges based on energy minimization and structural correlations.

We now show that the clustering spectrum revealed using the hidden variable model, also appears for a second widely studied null model. This second model cannot be the Configuration Model (CM), which preserves the degree distribution by making connections between vertices in the most random way possible Bollobás (1980); Newman et al. (2001). Indeed, because of the random edge assignment, the CM has no degree correlations, leading in the case of scale-free networks with diverging second moment to uncorrelated networks with non-negligible fractions of self-loops (a vertex joined to itself) and multiple connections (two vertices connected by more than one edge). This picture changes dramatically when self-loops and multiple edges are avoided, a restriction mostly felt by the high-degree nodes, who can no longer establish multiple edges among each other.

We therefore consider the Erased Configuration Model (ECM) that takes a sample from the CM and then erases all the self-loops and multiple edges. While this removes some of the edges in the graph, thus violating the hard constraint, only a small proportion of the edges is removed, so that the degree of vertex $j$ in ECM is still close to $D_{j}$ (van der Hofstad, 2017, Chapter 7). In the ECM, the probability that a vertex with degree $D_{i}$ is connected to a vertex with degree $D_{j}$ can be approximated by $1-{\rm e}^{-D_{i}D_{j}/\langle D\rangle N}$ (van der Hofstad et al., 2005, Eq.(4.9)). Therefore, we expect the ECM and the hidden variable model to have similar properties (see e.g. van der Hofstad et al. (2017)) when we choose

[TABLE]

Figure 6 illustrates how both null models generate highly similar spectra, which provides additional support for the claim that the clustering spectrum is a universal property of simple scale-free networks. The ECM is more difficult to deal with compared to hidden variable models, since edges in ECM are not independent. In particular, we expect that these dependencies vanish for the $k\mapsto\bar{c}(k)$ curve. Establishing the universality of the $k\mapsto\bar{c}(k)$ curve for other random graph null models such as ECM, networks with an underlying geometric space Serrano et al. (2008) or hierarchical configuration models Stegehuis et al. (2016) is a major research direction.

The ECM and the hidden variable model are both null models with soft constraints on the degrees. Putting hard constraints on the degrees with the CM, has the nice property that simple graphs generated using this null model are uniform samples of all simple graphs with the same degree sequence. Dealing with such uniform samples is notoriously hard when the second moment of the degrees is diverging, for example since the CM will yield many edges between high-degree vertices. This makes sampling uniform graphs difficult Milo et al. (2003); Viger and Latapy (2005); Genio et al. (2010). Thus, the joint requirement of hard degree and single-edge constraints, as in the CM, presents formidable technical challenges. Whether our results for the $k\mapsto\bar{c}(k)$ curve for soft-constraint models also carry over to these uniform simple graphs is a challenging open problem.

In this paper we have investigated the presence of triangles in the hidden variable model. We have shown that by first conditioning on the node degree, there arises a unique ‘most likely’ triangle with two other vertices of specific degrees. We have not only explained this insight heuristically, but it is also reflected in the elaborate analysis of the double integral for $c(h)$ in Appendix A. As such, we have introduced an intuitive and tractable mathematical method for asymptotic triangle counting. It is likely that the method carries over to counting other motifs, such as squares, or complete graphs of larger sizes. For any given motif, and first conditioning on the node degree, we again expect to find specific configuration that are most likely. Further mathematical challenges need to be overcome, though, because we expect that the ‘most likely’ configurations critically depend on the precise motif topologies and the associated energy minimization problems.

Acknowledgements.

This work is supported by NWO TOP grant 613.001.451 and by the NWO Gravitation Networks grant 024.002.003. The work of RvdH is further supported by the NWO VICI grant 639.033.806. The work of JvL is further supported by an NWO TOP-GO grant and by an ERC Starting Grant.

Appendix A Derivation for the three ranges

In this appendix, we compute $c(h)$ in (4), and we show that $c(h)$ can be approximated by (5), (6), or (7), depending on the value of $h$ . Throughout the appendix, we assume that $p(h,h^{\prime})=\min(1,hh^{\prime}/h_{s}^{2})$ and $\rho(h)=Ch^{-\tau}$ . Then, the derivation of $c(h)$ in Colomer-de Simón et al. (2013) yields

[TABLE]

Computing $c(h)$ will also allow us to compute

[TABLE]

for $0\leq t\leq\tfrac{1}{\tau-1}$ , where $h_{\text{ref}}\in[0,h_{c}]$ is fixed. We are interested in computing the value of $\sigma_{N}(t)$ for large values of $N$ .

Adopting the standard choices van der Hofstad et al. (2017)

[TABLE]

and setting $h_{\min}=1$ gives

[TABLE]

For ease of notation in the proofs below, we will use

[TABLE]

and

[TABLE]

In this notation, (A) can be succinctly written as

[TABLE]

Because of the four min operators in the expression (A), we have to consider various $h$ -ranges. We compute the value of $c(h)$ in these three ranges one by one.

Range I: $h<h_{s}^{2}/h_{c}$ .

We now show that in this range

[TABLE]

which proves (5).

This range corresponds to $h<1/(ab)$ with $a$ and $b$ as in (17). In this range, $r(ahx)=ahx$ and $r(ahy)=ahy$ for all $x\in[a,b]$ . This yields for $c(h)$

[TABLE]

For the denominator we compute

[TABLE]

Since $a\ll b$ , this can be approximated as

[TABLE]

We can compute the numerator of (21) as

[TABLE]

The first of these three terms dominates when

[TABLE]

and

[TABLE]

where we have used that $b^{2}=(N\langle h\rangle)^{(3-\tau)/(\tau-1)}$ . Thus, when $\ln(N\langle h\rangle)$ is large compared to $(\tau-1)/(\tau-2)$ and $(\tau-1)(\tau-2)/(\tau-3)^{2}$ , we obtain

[TABLE]

which proves (20).

Range II: $h_{s}^{2}/h_{c}<h<h_{s}$

In this range, we show that

[TABLE]

for some positive constant $M$ , which proves (6).

This range corresponds to $(ab)^{-1}<h<a^{-1}$ . For these values of $h$ , we have $ahx,ahy=1$ for $x,y=(ah)^{-1}\in(1,b)$ and $xy=1$ for $y=1/x\in[a,b]$ when $b^{-1}<x<b$ . Then for the denominator of (19) we compute

[TABLE]

Splitting up the integral in the numerator results in

[TABLE]

where the factors 2 arise by symmetry of the integrand in $x$ and $y$ . Computing these integrals yields

[TABLE]

We have $ah<1<ahb$ and so the leading behavior of $\text{Num}(h)$ is determined by the terms involving $\ln((ah)^{-2})$ in $I_{3}$ and $I_{4}$ , all other terms being bounded. Retaining only these dominant terms, we get

[TABLE]

provided that $ah\to 0$ as $N\to\infty$ . In terms of the variable $t$ in $h=(N\langle h\rangle)^{t}$ , see (11) and (14), this condition holds when we restrict to $t\in[(\tau-2)/(\tau-1),\tfrac{1}{2}-\varepsilon]$ for any $\varepsilon>0$ . Furthermore, from (29),

[TABLE]

Hence, when $ah\to 0$ , we have

[TABLE]

We compute $c(h=1/a)$ asymptotically by retaining only all constant terms between brackets in (31)-(37) since all other terms vanish or tend to 0 as $N\to\infty$ . This gives

[TABLE]

where $P=\frac{1}{(\tau-1)^{2}}+\frac{1}{(3-\tau)^{2}}+\frac{2}{\tau-1}+\frac{2}{3-\tau}$ . Together with (39), we find

[TABLE]

In van der Hofstad et al. (2017), it has been shown that $c(h)$ decreases in $h$ , and then (28) follows from (40) and (42).

Range III: $h_{s}<h<h_{c}$ .

We now show that when $h_{s}<h<h_{c}$ , then

[TABLE]

which proves (7).

This range corresponds to $1/a<h<b/a$ . The denominator of (19) remains the same as in the previous range and is given by (29). Splitting up the integral in the numerator of (19) now results in

[TABLE]

Computing these integrals yields

[TABLE]

A careful inspection of the terms between brackets in (46) and (54) shows that the terms involving $(ah)^{2\tau-6}$ are dominant when $ah\to\infty$ . In terms of the variable $t$ in $h=(N\langle h\rangle)^{t}$ , see (11) and (14), we have that $ah\to\infty$ when we restrict to $t\in[\tfrac{1}{2}+\varepsilon,1/(\tau-1)]$ for any $\varepsilon>0$ . When we retain only these dominant terms, we have, when $ah\to\infty$ ,

[TABLE]

Using (39) again, we get, when $ah\to\infty$ ,

[TABLE]

Furthermore, $c(1/a)$ is given by (42), while $c(h)$ decreases in $h$ . This gives (43).

Other connection probabilities

In van der Hofstad et al. (2017) we have presented a class of functions $r(u)=uf(u)$ , $u\geq 0$ , so that

[TABLE]

has appropriate monotonicity properties. The maximal member $r(u)=\min(u,1)$ of this class yields $p$ in (3) and is quite representative of the whole class, while allowing explicit computation and asymptotic analysis of $c(h)$ as in van der Hofstad et al. (2017) and this paper. Figure 7 shows that other asymptotically equivalent choices such as $r(u)=u/(1+u)$ and $r(u)=1-{\rm e}^{-u}$ have comparable clustering spectra. A minor difference is that the choice $r(u)=\min(1,u)$ for $p$ in (3) forces $c(h)$ to be constant on the range $h\leq N^{\beta(\tau)}$ , while the other two choices show a gentle decrease.

Limiting form of $\sigma_{N}(t)$ and finite-size effects

We consider $\sigma_{N}(t)$ as in (14) with $h_{\text{ref}}=0$ . Using (20), (28) and (43), it is readily seen that

[TABLE]

Hence, some of the detailed information that is present in (20), (28) and (43), disappears when taking the limit as in (58). This is in particular so for the $\ln N$ -factor in (20) and the logarithmic decaying factor $\ln(N^{2}/h)$ in Region II.

Consider $\sigma_{N}(t)$ of (14) with $h_{\text{ref}}=h_{c}$ as is done in Fig. 5. It follows from the detailed form of (20) and (43), that

[TABLE]

where

[TABLE]

We have that $\sigma_{N}(0)\to\gamma$ as $N\to\infty$ , and the right-hand side of (59) exceeds this limit $\gamma$ from $y=1/\beta$ onwards with a maximum excess $\beta/{\rm e}$ for $N\langle h\rangle$ as large as $\exp({\rm e}/\beta)$ . This explains why the excess of $\sigma_{N}(0)$ over its limit value in Fig. 5 with ${\rm e}^{{\rm e}/\beta}=3\times 10^{10}$ when $\tau=9/4$ persists.

Appendix B Exact and asymptotic result for decay rate of $c(h)$ at $h=h_{c}$ and $h=h_{s}$

We let $h_{c}=(N\langle h\rangle)^{1/(\tau-1)}$ , where we assume that $N$ is so large that $h_{c}\leq N$ . This requires $N$ to be of the order $(1/\varepsilon)^{1/\varepsilon}$ , where $\varepsilon=\tau-2$ . We again consider the function $\sigma_{N}(t)$ of (11),

[TABLE]

for $0\leq t\leq\tfrac{1}{\tau-1}$ and $h_{\text{ref}}$ is fixed, so that

[TABLE]

When we fix a $t_{0}$ and linearize $\sigma_{N}(t)$ around $t_{0}$ , we get

[TABLE]

so that $\sigma_{N}^{\prime}(t)=\frac{\text{d}}{\text{dt}}\sigma_{N}(t)$ is a measure for the decay rate of $c(h)$ at $h=h_{0}=(N\langle h\rangle)^{t_{0}}$ .

In this appendix, we compute an exact expression for $\sigma_{N}^{\prime}(t)$ at $t=\tfrac{1}{\tau-1}$ , we compute its limit as $N\to\infty$ and discuss convergence speed, and we show that this limit is a lower bound for $\sigma_{N}^{\prime}(t)$ .

More precisely, we show the following result:

Proposition 1.

Let $a$ and $b$ be as in (17). Then,

[TABLE]

where

[TABLE]

Furthermore,

[TABLE]

for all $N$ .

The limiting value in (69) is consistent with the limiting value of $\sigma_{N}(t)$ that has been found in (58). We assess this convergence result with plots. While these indicate that the limits are only reached for very large $N$ , especially when $\tau$ is close to 2, it can also be seen that the limiting shape of $\sigma_{N}(t)$ already shows up for considerably smaller $N$ .

To start the proof of Proposition 1, note that in the $a,b$ notation of (17),

[TABLE]

where

[TABLE]

with $f(u)=\min(1,u^{-1})$ . Note that $r(u)=uf(u)$ , see (18). We compute

[TABLE]

where the prime on $c$ indicates differentiation with respect to $h$ . With (70) we get

[TABLE]

and we have to evaluate $K(h),K^{\prime}(h),J(h)$ and $J^{\prime}(h)$ at

[TABLE]

Lemma 1.

[TABLE]

with $A,C,D,E$ as in (65)–(68).

From Lemma 1, (73) and (75) we get (64) in Proposition 1.

Proof of Lemma 1. Since $h_{c}=b/a$ ,

[TABLE]

With $f(u)=\min(1,u^{-1})$ we split up the integration range $[a,b]\times[a,b]$ into the four regions $[a,1/b]\times[a,1/b]$ , $[1/b,b]\times[1/b,b],[1/b,b]\times[a,1/b]$ and $[a,1/b]\times[1/b,b]$ , where we observe that $a\leq 1/b\leq 1\leq b$ . We first get

[TABLE]

Next,

[TABLE]

The remaining double integral with $\tau+1$ instead of $\tau$ has been evaluated in (van der Hofstad et al., 2017, Appendix C, (C3)) as

[TABLE]

Finally, the two double integrals over $[1/b,b]\times[a,1/b]$ and $[a,1/b]\times[1/b,b]$ are by symmetry both equal to

[TABLE]

Here we have used that, see (17),

[TABLE]

Now the expression in (76) for $K(h_{c})$ follows.

To evaluate $K^{\prime}(h_{c})$ , we observe by symmetry that

[TABLE]

At $h=h_{c}$ , we have $ah=b$ , and so

[TABLE]

Now $uf^{\prime}(u)=0$ for $0\leq u\leq 1$ and $uf^{\prime}(u)=-f(u)$ for $u\geq 1$ . Hence, splitting up the integration range into the four regions as earlier, we see that those over $[a,1/b]\times[a,1/b]$ and $[a,1/b]\times[1/b,b]$ vanish while those over $[1/b,b]\times[1/b,b]$ and $[1/b,b]\times[a,1/b]$ give rise to the same double integrals as in (80) and (82) respectively. This yields the expression in (76) for $K^{\prime}(h_{c})$ .

The evaluation of $J(h_{c})$ and $J^{\prime}(h_{c})$ is straightforward from (72) with $ah=b$ and a splitting of the integration range $[a,b]$ into $[a,1/b]$ and $[1/b,b]$ . This yields (77), and the proof of Lemma 1 is complete.

We now turn to the limiting behavior of $\sigma_{N}^{\prime}(\tfrac{1}{\tau-1})$ as $N\to\infty$ . For this we write

[TABLE]

in which

[TABLE]

as $N\to\infty$ . Hence, $D/(D+E)\to 0$ as $N\to\infty$ . Furthermore, we write

[TABLE]

and

[TABLE]

where

[TABLE]

Now, using (83), we have

[TABLE]

as $N\to\infty$ . Thus, we get

[TABLE]

and this yields (69).

Note that $D/(D+E)$ approaches 0 much slower than the limit in (93) is reached when $\tau$ is close to 2, compare (88) and (93). Thus, we can concentrate on $D/(D+E)$ , and the relative deviation of $\sigma_{N}^{\prime}(t)$ from $-2(3-\tau)$ is approximately

[TABLE]

We finally turn to the inequality in (69) in Proposition 1. Obviously, we have

[TABLE]

We shall show that

[TABLE]

where

[TABLE]

the asymptotic form of $A$ and $C$ as $N\to\infty$ obtained from (90) and (89) by deleting $F$ and $(ab)^{3-\tau}$ , respectively. The function

[TABLE]

is decreasing in $x\geq 0$ , and so it suffices to show that

[TABLE]

We have from (89) that

[TABLE]

and from (90) and (91) that

[TABLE]

Using that $(ab)^{3-\tau}=b^{-2(\tau-2)}$ , see (92), we see that the inequality $C_{\text{as}}/C\leq A_{\text{as}}/A$ in (99) is equivalent to

[TABLE]

Using that $(1-u)^{2}-(1-u)=-u(1-u)$ and dividing through by $u=b^{-2(\tau-2)}$ , we see that (102) is equivalent to

[TABLE]

With $y=\ln(b^{2})\geq 0$ , we write (103) as

[TABLE]

Taylor development of $K(y)$ at $y=0$ yields

[TABLE]

Furthermore,

[TABLE]

Therefore, $K(0)=K^{\prime}(0)=0$ , while $K^{\prime\prime}(y)>0$ for $y>0$ . This gives $K(y)>0$ when $y>0$ , as required.

Similar to Proposition 1, we can derive the following result for $\sigma_{N}^{\prime}(\tfrac{1}{2})$ :

Proposition 2.

[TABLE]

where

[TABLE]

Furthermore,

[TABLE]

for all $N$ .

Figure 8 shows the values of $\sigma_{N}^{\prime}(\tfrac{1}{2})$ and $\sigma_{N}^{\prime}(\tfrac{1}{\tau-1})$ for finite-size networks together with its limiting value. For example, when $\tau=2.25$ , Fig. 8a shows that $N$ needs to be of the order $10^{16}$ for the slope to be ‘close’ to its limiting value -1.5. When for example $N=10^{6}$ we see that the slope is much smaller: approximately -1.1. This makes statistical estimation of the true underling power-law exponent $\alpha$ extremely challenging, especially for the relevant regime $\tau$ close to $2$ , because enormous amounts of data should be available to get sufficient statistical accuracy. Most data sets, even the largest available networks used in this paper, are simply not large enough to have sufficiently many samples from the large-degree region to get a statistically accurate estimate of the power-law part. This also explains why based on smaller data sets it is common to assume that $\alpha$ is roughly one Vázquez et al. (2002); Ravasz and Barabási (2003); Serrano and Boguñá (2006b); Catanzaro et al. (2004); Leskovec (2008); Krioukov et al. (2012). Comparing Fig. 8a and Fig. 8b shows that the convergence to the limiting value is significantly faster at the point $t=\tfrac{1}{2}$ than at the point $t=\tfrac{1}{\tau-1}$ .

Appendix C From hidden variables to degrees

In this paper, we focus on computing $c(h)$ , the local clustering coefficient of a randomly chosen vertex with hidden variable $h$ . However, when studying local clustering in real-world data sets, we can only observe $\bar{c}(k)$ , the local clustering coefficient of a vertex of degree $k$ . In this appendix, we show that for the hidden variable model, the difference between these two methods of computing the clustering coefficient is small and asymptotically negligible. We consider

[TABLE]

We define $\bar{c}(k)$ as the average clustering coefficient over all vertices of degree $k$ . By Colomer-de Simon and Boguñá (2012), the probability that a vertex with hidden variable $h$ has degree $k$ equals

[TABLE]

Then, by Colomer-de Simon and Boguñá (2012),

[TABLE]

where $\bar{c}(k)=0$ for $k<2$ because a vertex with degree less than 2 cannot be part of a triangle. Here

[TABLE]

is the probability that a randomly chosen vertex has degree $k$ .

First we consider the case where $h>N^{\frac{\tau-2}{\tau-1}}$ . The Chernoff bound gives for the tails of the Poisson distribution that

[TABLE]

Let $k(h)$ be the degree of a node with hidden variable $h$ . Then, for any $M>1$

[TABLE]

and for any $\delta\in(0,1)$ ,

[TABLE]

Because ${\rm e}^{x-1}/x^{x}<1$ for $x\neq 1$ , (119) and (120) tend to zero as $h\to\infty$ . Therefore, for $h$ large,

[TABLE]

with high probability. Therefore, when $k$ is large,

[TABLE]

Thus, $c(h)$ is very similar to $\bar{c}(k)$ .

On the other hand, for $h\ll h_{s}^{2}/h_{c}$ ,

[TABLE]

which is small by the assumption on $h$ . Thus,

[TABLE]

Furthermore, $c(h)=c(0)$ in this regime of $h$ . This results in

[TABLE]

Therefore, $\bar{c}(h)\approx c(h)$ also when $h$ is small.

Figure 9 shows that indeed the difference between $\bar{c}(k)$ and $c(k)$ is small. When $\tau$ approaches 2, the difference becomes larger. We see that for small values of $k$ , $\bar{c}(k)$ and $c(k)$ are not very close. This is due to the fact that (113) does not take into account that a vertex with hidden variable $h$ may have less than 2 neighbors, so that its local clustering is zero. In van der Hofstad et al. (2017) we show how to adjust (19) to account for this.

Appendix D Degree distributions

Figure 10 shows the degree distributions of all ten networks of Table 1.

Bibliography50

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Albert et al. (1999) R. Albert, H. Jeong, and A.-L. Barabási, Nature 401 , 130 (1999) . · doi ↗
2Faloutsos et al. (1999) M. Faloutsos, P. Faloutsos, and C. Faloutsos, in ACM SIGCOMM Computer Communication Review , Vol. 29 (ACM, 1999) pp. 251–262.
3Jeong et al. (2000) H. Jeong, B. Tombor, R. Albert, Z. N. Oltvai, and A.-L. Barabási, Nature 407 , 651 (2000).
4Vázquez et al. (2002) A. Vázquez, R. Pastor-Satorras, and A. Vespignani, Phys. Rev. E 65 , 066130 (2002) . · doi ↗
5van der Hofstad et al. (2007) R. van der Hofstad, G. Hooghiemstra, and D. Znamenski, Electron. J. Probab. 12 , 703 (2007) . · doi ↗
6Newman et al. (2001) M. E. J. Newman, S. H. Strogatz, and D. J. Watts, Phys. Rev. E 64 , 026118 (2001) . · doi ↗
7Janson (2009) S. Janson, Electron. J. Probab. 14 , 86 (2009) . · doi ↗
8Pastor-Satorras and Vespignani (2001) R. Pastor-Satorras and A. Vespignani, Phys. Rev. Lett. 86 , 3200 (2001) . · doi ↗

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Clustering spectrum of scale-free networks

Abstract

pacs:

I Introduction

II Hidden variables

III Universal clustering spectrum

IV Energy minimization

V Convergence rate

VI Discussion

Acknowledgements.

Appendix A Derivation for the three ranges

Range I: h<hs2/hch<h_{s}^{2}/h_{c}h<hs2​/hc​.

Range II: hs2/hc<h<hsh_{s}^{2}/h_{c}<h<h_{s}hs2​/hc​<h<hs​

Range III: hs<h<hch_{s}<h<h_{c}hs​<h<hc​.

Other connection probabilities

Limiting form of σN(t)\sigma_{N}(t)σN​(t) and finite-size effects

Appendix B Exact and asymptotic result for decay rate of c(h)c(h)c(h) at h=hch=h_{c}h=hc​ and h=hsh=h_{s}h=hs​

Proposition 1**.**

Lemma 1**.**

Proposition 2**.**

Appendix C From hidden variables to degrees

Appendix D Degree distributions

Range I: $h<h_{s}^{2}/h_{c}$ .

Range II: $h_{s}^{2}/h_{c}<h<h_{s}$

Range III: $h_{s}<h<h_{c}$ .

Limiting form of $\sigma_{N}(t)$ and finite-size effects

Appendix B Exact and asymptotic result for decay rate of $c(h)$ at $h=h_{c}$ and $h=h_{s}$

Proposition 1.

Lemma 1.

Proposition 2.