Multiscale unfolding of real networks by geometric renormalization

Guillermo Garc\'ia-P\'erez; Mari\'an Bogu\~n\'a; M. \'Angeles Serrano

arXiv:1706.00394·cond-mat.dis-nn·July 4, 2018

Multiscale unfolding of real networks by geometric renormalization

Guillermo Garc\'ia-P\'erez, Mari\'an Bogu\~n\'a, M. \'Angeles Serrano

PDF

TL;DR

This paper introduces a geometric renormalization group for complex networks, revealing their multiscale structure and self-similarity, which enables better understanding, modeling, and navigation of large-scale networks.

Contribution

It develops a novel geometric renormalization method to analyze and unfold real networks across multiple scales, highlighting their self-similar properties and practical applications.

Findings

01

Real networks exhibit geometric scaling consistent with underlying models.

02

Multiscale unfolding reveals coexisting scales and their interactions.

03

The approach improves network modeling and navigation in hyperbolic space.

Abstract

Multiple scales coexist in complex networks. However, the small world property makes them strongly entangled. This turns the elucidation of length scales and symmetries a defiant challenge. Here, we define a geometric renormalization group for complex networks and use the technique to investigate networks as viewed at different scales. We find that real networks embedded in a hidden metric space show geometric scaling, in agreement with the renormalizability of the underlying geometric model. This allows us to unfold real scale-free networks in a self-similar multilayer shell which unveils the coexisting scales and their interplay. The multiscale unfolding offers a basis for a new approach to explore critical phenomena and universality in complex networks, and affords us immediate practical applications, like high-fidelity smaller-scale replicas of large networks and a multiscale…

Figures17

Click any figure to enlarge with its caption.

Tables1

Table 1. Table 1: Overview of the considered real-world networks. Details for each dataset can be found in the Appendix A .

Name	Type	Nodes	$N$	$γ$	$β$	$⟨ k ⟩$	$⟨ c ⟩$
Internet	Technological	Autonomous systems	23748	2.17	1.44	4.92	0.61
Metabolic	Biological	Metabolites	1436	2.6	1.3	6.57	0.54
Music	Script	Chords	2476	2.27	1.1	16.66	0.82
Airports	Transportation	World airports	3397	1.88	1.7	11.32	0.63
Proteome	Biological	Proteins	4100	2.25	1.001	6.52	0.09
Words	Script	Words	7377	2.25	1.01	11.99	0.47

Equations175

M (T, G) ⟶ F_{r} M^{'} (T^{'}, G^{'}) .

M (T, G) ⟶ F_{r} M^{'} (T^{'}, G^{'}) .

M^{(l + 1)} (T^{(l + 1)}, G^{(l + 1)}) = F_{r} [M^{(l)} (T^{(l)}, G^{(l)})] .

M^{(l + 1)} (T^{(l + 1)}, G^{(l + 1)}) = F_{r} [M^{(l)} (T^{(l)}, G^{(l)})] .

κ_{i}^{(l + 1)} = (j = 1 \sum r (κ_{j}^{(l)})^{β})^{1/ β},

κ_{i}^{(l + 1)} = (j = 1 \sum r (κ_{j}^{(l)})^{β})^{1/ β},

θ_{i}^{(l + 1)} = \frac{j = 1 \sum r ( θ _{j}^{(l)} κ _{j}^{(l)} ) ^{β}}{j = 1 \sum r ( κ _{j}^{(l)} ) ^{β}}^{1/ β},

θ_{i}^{(l + 1)} = \frac{j = 1 \sum r ( θ _{j}^{(l)} κ _{j}^{(l)} ) ^{β}}{j = 1 \sum r ( κ _{j}^{(l)} ) ^{β}}^{1/ β},

ν = \frac{2}{γ - 1} - 1,

ν = \frac{2}{γ - 1} - 1,

ν = \frac{2}{β} - 1 .

ν = \frac{2}{β} - 1 .

p_{ij} = \frac{1}{1 + χ _{ij}^{β}} = \frac{1}{1 + ( \frac{d _{a, ij}}{μ κ _{i} κ _{j}} ) ^{β}},

p_{ij} = \frac{1}{1 + χ _{ij}^{β}} = \frac{1}{1 + ( \frac{d _{a, ij}}{μ κ _{i} κ _{j}} ) ^{β}},

R_{H^{2}} = 2 ln (\frac{2 R}{μ κ _{0}^{2}}),

R_{H^{2}} = 2 ln (\frac{2 R}{μ κ _{0}^{2}}),

r_{i} = R_{H^{2}} - 2 ln \frac{κ _{i}}{κ _{0}},

r_{i} = R_{H^{2}} - 2 ln \frac{κ _{i}}{κ _{0}},

p_{ij} = \frac{1}{1 + e ^{\frac{β}{2} (x_{ij} - R_{H^{2}})}},

p_{ij} = \frac{1}{1 + e ^{\frac{β}{2} (x_{ij} - R_{H^{2}})}},

d_{H^{2}} = acosh (cosh r_{i} cosh r_{j} - sinh r_{i} sinh r_{j} cos Δ θ_{ij}) .

d_{H^{2}} = acosh (cosh r_{i} cosh r_{j} - sinh r_{i} sinh r_{j} cos Δ θ_{ij}) .

q_{ij}^{(l)} = \frac{p _{ij, new}^{(l)}}{p _{ij}^{(l)}} .

q_{ij}^{(l)} = \frac{p _{ij, new}^{(l)}}{p _{ij}^{(l)}} .

P {a_{ij, new}^{(l)} = 1} = p_{ij}^{(l)} q_{ij}^{(l)} = p_{ij, new}^{(l)},

P {a_{ij, new}^{(l)} = 1} = p_{ij}^{(l)} q_{ij}^{(l)} = p_{ij, new}^{(l)},

P {a_{ij, new}^{(l)} = 0} = 1 - p_{ij}^{(l)} + p_{ij}^{(l)} (1 - q_{ij}^{(l)}) = 1 - p_{ij, new}^{(l)},

P {a_{ij, new}^{(l)} = 0} = 1 - p_{ij}^{(l)} + p_{ij}^{(l)} (1 - q_{ij}^{(l)}) = 1 - p_{ij, new}^{(l)},

μ_{new}^{(l)} = \frac{⟨ k ^{(0)} ⟩}{⟨ k ^{(l)} ⟩} μ^{(l)} .

μ_{new}^{(l)} = \frac{⟨ k ^{(0)} ⟩}{⟨ k ^{(l)} ⟩} μ^{(l)} .

H = - i < j \sum J_{ij} a_{ij} s_{i} s_{j},

H = - i < j \sum J_{ij} a_{ij} s_{i} s_{j},

\dot{θ_{i}} = ω_{i} + σ i < j \sum a_{ij} sin (θ_{j} (t) - θ_{i} (t)),

\dot{θ_{i}} = ω_{i} + σ i < j \sum a_{ij} sin (θ_{j} (t) - θ_{i} (t)),

\frac{m + α}{r _{2}} = \frac{m}{r _{2}} + \frac{α}{r _{2}} = ⌊ \frac{m}{r _{2}} ⌋ + \frac{m mod r _{2}}{r _{2}} + \frac{α}{r _{2}},

\frac{m + α}{r _{2}} = \frac{m}{r _{2}} + \frac{α}{r _{2}} = ⌊ \frac{m}{r _{2}} ⌋ + \frac{m mod r _{2}}{r _{2}} + \frac{α}{r _{2}},

⌊ \frac{m + α}{r _{2}} ⌋ = ⌊ \frac{m}{r _{2}} ⌋ \Leftrightarrow \frac{m mod r _{2}}{r _{2}} + \frac{α}{r _{2}} < 1,

⌊ \frac{m + α}{r _{2}} ⌋ = ⌊ \frac{m}{r _{2}} ⌋ \Leftrightarrow \frac{m mod r _{2}}{r _{2}} + \frac{α}{r _{2}} < 1,

\frac{m mod r _{2} + α}{r _{2}} \leq \frac{r _{2} - 1 + α}{r _{2}} = 1 + \frac{α - 1}{r _{2}} < 1.

\frac{m mod r _{2} + α}{r _{2}} \leq \frac{r _{2} - 1 + α}{r _{2}} = 1 + \frac{α - 1}{r _{2}} < 1.

p_{ij}^{'} = 1 - e = 1 \prod r^{2} (1 - p_{e}),

p_{ij}^{'} = 1 - e = 1 \prod r^{2} (1 - p_{e}),

p_{e} = \frac{1}{1 + ( \frac{R Δ θ _{e}}{μ ( κ _{m} κ _{n} ) _{e}} ) ^{β}} .

p_{e} = \frac{1}{1 + ( \frac{R Δ θ _{e}}{μ ( κ _{m} κ _{n} ) _{e}} ) ^{β}} .

p_{ij}^{'} = 1 - e = 1 \prod r^{2} \frac{1}{1 + ( \frac{R Δ θ _{e}}{μ ( κ _{m} κ _{n} ) _{e}} ) ^{- β}} = 1 - \frac{1}{e = 1 \prod r ^{2} 1 + ( \frac{μ ( κ _{m} κ _{n} ) _{e}}{R Δ θ _{e}} ) ^{β}} = 1 - \frac{1}{1 + Φ _{ij}^{'}} = \frac{1}{1 + \frac{1}{Φ _{ij}^{'}}}

p_{ij}^{'} = 1 - e = 1 \prod r^{2} \frac{1}{1 + ( \frac{R Δ θ _{e}}{μ ( κ _{m} κ _{n} ) _{e}} ) ^{- β}} = 1 - \frac{1}{e = 1 \prod r ^{2} 1 + ( \frac{μ ( κ _{m} κ _{n} ) _{e}}{R Δ θ _{e}} ) ^{β}} = 1 - \frac{1}{1 + Φ _{ij}^{'}} = \frac{1}{1 + \frac{1}{Φ _{ij}^{'}}}

Φ_{ij}^{'} = e = 1 \sum r^{2} (\frac{μ ( κ _{m} κ _{n} ) _{e}}{R Δ θ _{e}})^{β} + e = 1 \sum r^{2} - 1 f = e + 1 \sum r^{2} (\frac{μ ( κ _{m} κ _{n} ) _{e}}{R Δ θ _{e}})^{β} (\frac{μ ( κ _{m} κ _{n} ) _{f}}{R Δ θ _{f}})^{β} + \dots

Φ_{ij}^{'} = e = 1 \sum r^{2} (\frac{μ ( κ _{m} κ _{n} ) _{e}}{R Δ θ _{e}})^{β} + e = 1 \sum r^{2} - 1 f = e + 1 \sum r^{2} (\frac{μ ( κ _{m} κ _{n} ) _{e}}{R Δ θ _{e}})^{β} (\frac{μ ( κ _{m} κ _{n} ) _{f}}{R Δ θ _{f}})^{β} + \dots

Φ_{ij}^{'} \approx (\frac{μ}{R Δ θ})^{β} e = 1 \sum r^{2} (κ_{m} κ_{n})_{e}^{β} + (\frac{μ}{R Δ θ})^{2 β} e = 1 \sum r^{2} - 1 f = e + 1 \sum r^{2} (κ_{m} κ_{n})_{e}^{β} (κ_{m} κ_{n})_{f}^{β} + \dots

Φ_{ij}^{'} \approx (\frac{μ}{R Δ θ})^{β} e = 1 \sum r^{2} (κ_{m} κ_{n})_{e}^{β} + (\frac{μ}{R Δ θ})^{2 β} e = 1 \sum r^{2} - 1 f = e + 1 \sum r^{2} (κ_{m} κ_{n})_{e}^{β} (κ_{m} κ_{n})_{f}^{β} + \dots

Φ_{ij}^{'} \approx (\frac{μ}{R Δ θ})^{β} e = 1 \sum r^{2} (κ_{m} κ_{n})_{e}^{β} .

Φ_{ij}^{'} \approx (\frac{μ}{R Δ θ})^{β} e = 1 \sum r^{2} (κ_{m} κ_{n})_{e}^{β} .

p_{ij}^{'} \approx \frac{1}{1 + ( \frac{R Δ θ}{μ} ) ^{β} \frac{1}{e = 1 \sum r ^{2} ( κ _{m} κ _{n} ) _{e}^{β}}},

p_{ij}^{'} \approx \frac{1}{1 + ( \frac{R Δ θ}{μ} ) ^{β} \frac{1}{e = 1 \sum r ^{2} ( κ _{m} κ _{n} ) _{e}^{β}}},

(\frac{R Δ θ}{μ})^{β} \frac{1}{e = 1 \sum r ^{2} ( κ _{m} κ _{n} ) _{e}^{β}} = (\frac{R ^{'} Δ θ _{ij}^{'}}{μ ^{'} κ _{i}^{'} κ _{j}^{'}})^{β^{'}} .

(\frac{R Δ θ}{μ})^{β} \frac{1}{e = 1 \sum r ^{2} ( κ _{m} κ _{n} ) _{e}^{β}} = (\frac{R ^{'} Δ θ _{ij}^{'}}{μ ^{'} κ _{i}^{'} κ _{j}^{'}})^{β^{'}} .

(κ_{i}^{'} κ_{j}^{'})^{β} = e = 1 \sum r^{2} (κ_{m} κ_{n})_{e}^{β},

(κ_{i}^{'} κ_{j}^{'})^{β} = e = 1 \sum r^{2} (κ_{m} κ_{n})_{e}^{β},

κ_{i}^{'} = (j = 1 \sum r κ_{j}^{β})^{1/ β} .

κ_{i}^{'} = (j = 1 \sum r κ_{j}^{β})^{1/ β} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Multiscale unfolding of real networks by geometric renormalization

Guillermo García-Pérez

Departament de Física de la Matèria Condensada, Universitat de Barcelona, Martí i Franquès 1, 08028 Barcelona, Spain

Universitat de Barcelona Institute of Complex Systems (UBICS), Universitat de Barcelona, Barcelona, Spain

Marián Boguñá

Departament de Física de la Matèria Condensada, Universitat de Barcelona, Martí i Franquès 1, 08028 Barcelona, Spain

Universitat de Barcelona Institute of Complex Systems (UBICS), Universitat de Barcelona, Barcelona, Spain

M. Ángeles Serrano

Departament de Física de la Matèria Condensada, Universitat de Barcelona, Martí i Franquès 1, 08028 Barcelona, Spain

Universitat de Barcelona Institute of Complex Systems (UBICS), Universitat de Barcelona, Barcelona, Spain

ICREA, Pg. Lluís Companys 23, E-08010 Barcelona, Spain

Abstract

Multiple scales coexist in complex networks. However, the small world property makes them strongly entangled. This turns the elucidation of length scales and symmetries a defiant challenge. Here, we define a geometric renormalization group for complex networks and use the technique to investigate networks as viewed at different scales. We find that real networks embedded in a hidden metric space show geometric scaling, in agreement with the renormalizability of the underlying geometric model. This allows us to unfold real scale-free networks in a self-similar multilayer shell which unveils the coexisting scales and their interplay. The multiscale unfolding offers a basis for a new approach to explore critical phenomena and universality in complex networks, and affords us immediate practical applications, like high-fidelity smaller-scale replicas of large networks and a multiscale navigation protocol in hyperbolic space which boosts the success of single-layer versions.

I Introduction

Symmetries permeate reality and our theories to understand it. From very simple to very subtle, all of them denote invariance under a transformation, and thus similarity or even exact correspondence between different parts of a system or between the system and itself when observed at different scales of length, or other variable. As paradigmatic examples, fractals are geometric objects showing physical scale invariance and self-similarity Mandelbrot (1961). Moreover, these properties can also apply to phenomenological behaviours like systems dynamics near critical points of phase transitions Stanley (1971).

In complex networks, multiple scales coexist but they are so entangled that the definition of self-similarity and scale-invariance has been limited by the lack of a valid source of geometric length scale transformations. Previous efforts to study these symmetries are based on topology and include coarse-graining to preserve the large-scale behaviour of random walks Gfeller and De Los Rios (2007), or box-covering procedures based on shortest path lengths between nodes Song et al. (2005); Goh et al. (2006); Song et al. (2006); Kim et al. (2007); Radicchi et al. (2008); Rozenfeld et al. (2010). The latter revealed that certain real networks have finite fractal dimensions and exhibit self-similarity, although scaling in the topological properties was not observed beyond the degree distribution and the maximum and average degrees. However, the collection of shortest paths, albeit a well-defined metric, is a poor source of length-based scaling factors in networks due to the small-world Watts and Strogatz (1998) or even ultrasmall-world Cohen and Havlin (2003) property, and the problem remained controversial. Other studies have faced the multiscale structure of network models in a somewhat more geometric way Newman and Watts (1999); Boettcher (2011), but their findings cannot be directly applied to real-world networks.

The development in the last years of plausible models of complex networks based on an underlying metric space Serrano et al. (2008); Boguñá et al. (2010a) opens now the door to a proper geometric definition of self-similarity and scale invariance and to an unfolding of the different scales present in the connectivity structure of real networks. Hidden metric space network models couple the topology of a network to an underlying geometry through a universal connectivity law which combines popularity and similarity dimensions Serrano et al. (2008); Krioukov et al. (2010); Papadopoulos et al. (2012), such that more popular and similar nodes have more chance to interact. Naturally, the geometricalization of networks allows a reservoir of distance scales so that we can borrow concepts and techniques from the renormalization group in statistical physics Leo P. (2000); Wilson (1975), which has been used to study systems where widely different length scales are present simultaneously. By recursive averaging over short-distance degrees of freedom, the renormalization group has successfully explained, for instance, the universality properties of critical behavior in phase transitions Wilson (1983).

In this work, we introduce a geometric renormalization group for complex networks (RGN). The method is based on a geometric embedding of the networks to construct renormalized versions of their structure by coase-graining neighbouring nodes into supernodes and defining a new map which progressively selects longer range connections by identifying relevant interactions at each scale. The RGN technique is inspired by the block spin renormalization group devised by L. P. Kadanoff Leo P. (2000).

II Evidence of geometric scaling in real networks

The map of a complex network embedded in a hidden metric space, $\mathcal{M}(T,G)$ , contains information about both its topology $T$ and geometry $G$ (in terms of the positions of the nodes in the hidden metric space). Given $\mathcal{M}(T,G)$ , we define a geometric renormalization operator $\mathbb{F}_{r}$ of resolution $r$ which coarse-grains the original network by a factor $r$ and defines a new topology $T^{\prime}$ and a new geometry $G^{\prime}$ conforming the renormalized map $\mathcal{M}^{\prime}$

[TABLE]

The transformation zooms out by changing the minimum length scale from that of the original network to a larger value. This operation can be iterated starting from the original network at $l=0$ ,

[TABLE]

In the limit $N\rightarrow\infty$ , it can be applied up to any desired scale of observation, whereas it is bounded to $\mathcal{O}(\log N)$ iterations in systems with a finite number of nodes $N$ .

The simplest hidden metric space that can embed a network is a one-dimensional sphere on which nodes have specific angular positions $\{\theta_{i};i=1,\cdots,N\}$ . In this space, the transformation proceeds by, first, defining non-overlapping blocks of consecutive nodes of size $r$ along the circle and, second, coarse-graining the blocks into supernodes, regardless of whether they are connected or not to each other. Each supernode is then placed within the angular region defined by the corresponding block so that the order of nodes in the original embedding is preserved in the renormalization process. All the links between some node in one supernode and some node in the other, if any, are renormalized into a single link between the two supernodes. Figure 1 illustrates the process. This coarse-graining procedure is not restricted to equal size blocks and can be defined in different ways as long as the angular distance between the nodes inside the blocks is smaller than the distance between nodes in different blocks. For instance, one could divide the circle in equally sized sectors of a certain arc length such that they contain on average a constant number of nodes. The geometric renormalization operator has abelian semigroup structure with respect to the composition, meaning that a certain number of iterations of a given resolution are equivalent to a single transformation of higher resolution, as shown in Fig. 1 111For instance, in Fig. 1 the same transformation with $r=4$ leads from $l=0$ to $l=2$ in a single step. Whenever the number of nodes is not divisible by $r$ , the last supernode in a layer contains less than $r$ nodes, as in the example at $l=1$ ; however, the RGN equations are valid for uneven supernode sizes as well. Notice that the set of transformations $\mathbb{F}_{r}$ does not include an inverse element to reverse the process.. Finally, the set of renormalized network layers $l$ , each $r^{l}$ times smaller than the original one, forms a multiscale shell of the network.

In this work, we apply the RGN to six different real scale-free networks from very different domains: technology (Internet), transportation (Airports), biology (Cell metabolism and Proteome) and scripts (Music and Words); see Appendix A for details. Many real networks can be embedded in the one-dimensional sphere using the $\mathbb{S}^{1}$ model Serrano et al. (2008), which places nodes into a circle and connects every pair with a probability that decreases with their distance along the circle, as a measure of their similarity, and increases with the product of their hidden degrees $\{\kappa_{i}\}$ , as a measure of their popularity (see Appendix A). The hidden degrees are well approximated by the observed degrees in the network Boguñá and Pastor-Satorras (2003); Serrano et al. (2008), and the embedding method uses statistical inference techniques to identify the angular coordinates which maximize the likelihood that the topology of the real network is reproduced by the model Boguñá et al. (2010b); Papadopoulos et al. (2015). Once the hidden degrees and coordinates of the real scale-free networks considered in our study are known, we apply the coarse-graining by defining blocks of size $r=2$ consecutive nodes in the circle, and place the supernodes within the coordinates of their corresponding nodes with the only restriction of preserving the original ordering. We iterate the process so that at each coarse-graining step the size of the system is reduced by a half.

The resulting topological features of the renormalized networks are shown in Fig. 2 (see also Fig. 6 in Appendix B). We observe that the degree distributions, degree-degree correlations —as measured by the average nearest neighbours degree—, and the clustering spectra, all show self-similar behaviour with curves for the different renormalized layers collapsing if the degrees in the layers are rescaled by their average degree. Also, for every layer $l$ we obtained a partition into communities, $P^{(l)}$ , using the Louvain method Blondel et al. (2008); Fig. 2 bottom shows their modularities $Q^{(l)}$ . We also defined the partition induced by $P^{(l)}$ on the original network, $P^{(l,0)}$ , obtained by considering that two nodes $i$ and $j$ of the original network are in the same community in $P^{(l,0)}$ if and only if the supernodes of $i$ and $j$ in layer $l$ belong to the same community in $P^{(l)}$ . Both the modularity $Q^{(l,0)}$ of $P^{(l,0)}$ and the normalized mutual information $nMI^{(l,0)}$ between both partitions $P^{(0)}$ and $P^{(l,0)}$ are shown in Fig. 2 bottom. Strikingly, the community structure is preserved along the flow to the extent of allowing us to find high-modularity partitions of the original network from much smaller versions of it. This property suggests a new and efficient multiscale community detection algorithm Arenas et al. (2008); Ronhovde and Nussinov (2009); Ahn et al. (2010).

III Geometric renormalization of the S1 model

The self-similarity exhibited by real-world networks can be understood in terms of their congruency with the underlying hidden metric space $\mathbb{S}^{1}$ model. As we show analytically (see Appendix C for details), the model is renormalizable in a geometric sense, and that means that real scale-free networks with a geometric structure —i. e., which admit a good embedding— necessarily display the same scaling behaviour.

To see why the $\mathbb{S}^{1}$ model exhibits this self-similarity, we need to consider the renormalization transformation of the geometric layout as well, that is, of hidden degrees, angular positions, $\mu$ , $R$ and $\beta$ . As we show in Appendix C, by assigning a new hidden degree $\kappa_{i}^{(l+1)}$ to supernode $i$ in layer $l+1$ as a function of the hidden degrees of the nodes it contains in layer $l$ according to

[TABLE]

as well as an angular coordinate $\theta_{i}^{(l+1)}$ given by

[TABLE]

and by rescaling the global parameters as $\mu^{(l+1)}=\mu^{(l)}/r$ , $R^{(l+1)}=R^{(l)}/r$ and $\beta^{(l+1)}=\beta^{(l)}$ , the renormalized networks remain maximally congruent with the hidden metric space model. This means that the probability $p_{ij}^{(l+1)}$ for two supernodes $i$ and $j$ to be connected in layer $l+1$ (which, according to the RGN procedure is given by the probability for at least one link to exist between some node in $i$ and some node in $j$ in layer $l$ ), maintains its original form Eq. (7), as shown in Fig. 3A. This applies both to the model and to real networks as long as they admit a good embedding, see also Fig. 7 in Appendix B. In addition, notice that the transformation of the geometric layout also has the abelian semi-group structure.

Since the networks remain congruent with the $\mathbb{S}^{1}$ model, hidden degrees $\kappa^{(l)}$ remain proportional to observed degrees $k^{(l)}$ , which allows us to explore the degree distribution of the renormalized layers analytically. It can be shown that, if the original distribution of hidden degrees is a power law with characteristic exponent $\gamma$ , the hidden degree distribution in the renormalized layers is also a power law with the same exponent asymptotically, as long as $(\gamma-1)/2<\beta$ (see Appendix C). Interestingly, the global parameter controlling the clustering coefficient, $\beta$ , does not change along the flow, which explains the self-similarity of the clustering spectra. Finally, the transformation for the angles Eq. (4) preserves the ordering of nodes and the heterogeneity in their angular density and, as a consequence, the community structure is preserved in the flow Boguñá et al. (2010b); Serrano et al. (2012); Zuev et al. (2015). The model is therefore renormalizable, and RGN realizations at any scale belong to the same ensemble with a different average degree, which should be rescaled to produce self-similar replicas.

A good approximation of the behaviour of the average degree for very large networks can be calculated by taking into account the transformation of hidden degrees in the RG flow Eq. (3) (see Appendix C for details). We obtain $\langle k\rangle^{(l+1)}=r^{\nu}\langle k\rangle^{(l)}$ , with a scaling factor $\nu$ depending on the connectivity structure of the original network. If $0<\frac{\gamma-1}{\beta}\leq 1$ , the flow is dominated by the exponent of the degree distribution $\gamma$ , and the scaling factor is given by

[TABLE]

whereas the flow is dominated by the strength of clustering if $1\leq\frac{\gamma-1}{\beta}<2$ , and

[TABLE]

Therefore, if $\gamma<3$ or $\beta<2$ (phase I in Fig. 3B), then $\nu>0$ and the model flows towards a highly connected graph; the average degree is preserved if $\gamma=3$ and $\beta\geq 2$ or $\beta=2$ and $\gamma\geq 3$ , which indicates that the network is at the edge of the transition between the small-world and non-small-world phases; and $\nu<0$ if $\gamma>3$ and $\beta>2$ , causing the RGN flow to produce sparser networks approaching a unidimensional ring structure as a fixed point (phase II in Fig. 3B). In this case, the renormalized layers eventually lose the small-world property.

In Fig. 3B, several real networks are displayed in the connectivity space. All of them lay in the region having the fully connected network as the fixed point, meaning that the RGN flow progressively selects more and more long range connections as a consequence of their small-worldness (see Appendix C). Furthermore, all of them, except the Internet and the Airports networks, belong to the $\beta$ -dominated region. The inset also shows the behaviour of the average degree of every layer $l$ , $\langle k^{(l)}\rangle$ ; as predicted, it grows exponentially in all cases.

Interestingly, global properties of the model, like those reflected in the spectrum of eigenvalues of both the adjacency and laplacian matrices, and quantities like the diffusion time and the restabilization time (Mieghem, 2011), show a dependence on $\gamma$ and $\beta$ which is in consonance with the one displayed by the RGN flow of the average degree, see results in Figs. 10, 11 and 12 of Appendix C for synthetic networks. The $\mathbb{S}^{1}$ model seems to be more sensitive to small changes in degree heterogeneity in the region $0<\frac{\gamma-1}{\beta}\leq 1$ , whereas changes in clustering are better reflected when $1\leq\frac{\gamma-1}{\beta}\leq 2$ .

IV Applications

The RGN enables us to unfold scale-free complex networks in a self-similar multilayer shell which unveils the coexisting scales and their interplay. Beyond the theoretical implications of the discovery that self-similarity under the RGN flow seems to be an ubiquitous symmetry in real networks, their multiscale unfolding can be exploited in immediate practical applications. Next, we propose two among many others; one which singles out a specific scale and another which exploits multiple scales simultaneously.

IV.1 Mini-me network replicas

The self-similarity unveiled by the RGN in real networks allows the construction of high-fidelity reduced versions that we call Mini-me network replicas. The downscaling of the topology of large real-world complex networks finds useful applications, for instance, in networked communication systems like the Internet, as a reduced testbed to analyze the performance of new routing protocols Papadopoulos et al. (2006); Papadopoulos and Psounis (2007); Yao and Fahmy (2008, 2011). However, the success of such program is based upon the quality of the downscaled version of the original network, that should reproduce not only local properties but also the mesoscopic structure of the network. Mini-me replicas can also be used to perform finite size scaling of critical phenomena taking place on real networks, so that critical exponents could be evaluated starting from a single size instance network. The Mini-me networks can be produced at any scale in the range in which self-similarity is preserved. For their construction, we exploit the fact that, under renormalization, a scale-free network remains self-similar and congruent with the underlying geometric model in all the self-similarity range of the multilayer shell. The idea is to single out a specific scale after a certain number of renormalization steps.

Typically, the renormalized average degree of real networks increases in the flow, since they belong to the small-world phase (see inset in Fig. 3B), meaning that the network layer at the selected scale is more densely connected. To reduce the density to the level of the original network, we apply a pruning of links, see Appendix A. Basically, we readjust parameter $\mu$ , controlling the number of links in the underlying geometric $\mathbb{S}^{1}$ model, so that the expected average degree in the renormalized version is that of the original network, which in turn modifies the connection probability Eq. (7). We keep in the Mini-me network only the links present in the renormalized layer which are consistent with the readjusted connection probability. In this way, we obtain a reduced version of the real network which is statistically equivalent to a very good approximation.

To illustrate the high-fidelity that Mini-me network replicas can achieve, we use them to reproduce the behaviour of dynamical processes in real networks. We selected three different dynamical processes, the classic ferromagnetic Ising model, the susceptible-infected-susceptible (SIS) epidemic spreading model, and the Kuramoto model of synchronization, see Appendix A for details. We test these dynamics in all the self-similar network layers of the real networks analysed in this work. Results are shown in Fig. 4 and Fig. 13 in Appendix D. Quite remarkably, for all dynamics and all networks, we observe very similar results between the original and Mini-me replicas at all scales. This is particularly interesting as these dynamics have a strong dependence on the mesoscale structure of the underlying networks. This strongly supports our claim that both the micro and meso-scales are preserved in the downscaled replicas, as expected given the self-similarity of the network layers in the RGN flow.

IV.2 Multiscale navigation

Applications that simultaneously exploit more than one or even all the layers in the self-similar multiscale shell are also possible. Next, we introduce a new multiscale navigation protocol for networks embedded in hyperbolic space, which improves single-layer results Boguñá et al. (2010b). To this end, we exploit the quasi-isomorphism between the $\mathbb{S}^{1}$ model and the $\mathbb{H}^{2}$ model in hyperbolic space Krioukov et al. (2009, 2010) to produce a purely geometric representation of the multiscale shell (see Appendix A). In hyperbolic space, each node is characterised by a radial coordinate directly related to its degree, and an angular coordinate identical to that in the circle. The connection probability becomes a decreasing function of the hyperbolic distance between nodes and, therefore, the most likely path connecting two distant nodes is typically the topological shortest path.

The multiscale protocol is based on greedy routing, in which a source node transmitting information or a packet to a target node sends it to its neighbour closest to destination in the metric space. As performance metrics we consider the success rate (fraction of successful greedy paths), and the stretch of successful path (ratio between the number of hops in the greedy path and the topological shortest path). Notice that, in general, greedy routing cannot guarantee the existence of a successful greedy path among all pairs of nodes in the network; the packet can get trapped into a loop if sent to an already visited node. In this case, the multiscale protocol can find alternative paths by taking advantage of the increased efficiency of greedy forwarding in the coarse-grained layers. When node $i$ needs to send a packet to some destination node $j$ , node $i$ performs a virtual greedy forwarding step in the highest possible layer to find which supernode should be next in the greedy path. Based on this, node $i$ then forwards the packet to its physical neighbour in the real network which guarantees that it will eventually reach such supernode. The process is depicted in Fig. 5A (full details can be found in Appendix A). To guarantee navigation inside supernodes, we require an extra condition in the renormalization process and only consider blocks of connected consecutive nodes. A single node can be left alone forming a supernode by itself, so blocks are of size one or two nodes. Notice that the new requirement does not alter the self-similarity of the renormalized networks forming the multiscale shell (Figs. 14 and 15 in Appendix E) nor the congruency with the hidden metric space (Fig. 16 in Appendix E).

Figure 5B shows the increase of the success rate as the number of layers $L$ used in the navigation process is increased for the different real networks considered in this work. Interestingly, as seen in Fig. 5C, this improvement alters the stretch of successful paths only mildly. The multiscale navigation protocol boosts the success rate by finding paths just slightly longer on average as compared with standard greedy routing in the original network in almost all cases, see inset in Fig. 5C. The improvement comes at the expense of adding information about the supenodes to the knowledge needed for standard greedy routing in single-layered networks. However, the trade-off between improvement and information overload is advantageous as for many systems the addition of just one or two renormalized layer produces already a notable effect.

V Discussion

Hidden metric space network models Serrano et al. (2008); Krioukov et al. (2010); Papadopoulos et al. (2012) are able to explain non-trivial structural features of real networks—including scale-free degree distributions, clustering, and self-similarity of the nested hierarchy of subgraphs produced by degree pruning Serrano et al. (2011)—, and also fundamental mechanisms like preferential attachment in growing networks Papadopoulos et al. (2012) and the emergence of communities Zuev et al. (2015). Interestingly, the existence of a metric space underlying complex networks allows us to define a geometric renormalization group that reveals the multiscale nature of these systems. Quite strikingly, models of scale-free networks are shown to be self-similar under such renormalization, revealing different structural properties depending on the level of coupling with the metric space and degree heterogeneity. The importance of these results, however, stems from the observed self-similarity under geometric renormalization as an ubiquitous symmetry of real world scale-free networks, which moreover stands as a new evidence in favour of the conjecture that hidden metric spaces underlie real networks.

The renormalization group presented in this work is similar in spirit to the topological renormalization studied in Song et al. (2005). However, it has clear advantages. First, the ordering in the construction of the boxes is dictated by the embedding of the original network in the underlying space. Second, the congruency between real scale-free networks and the underlying metric space explains the self-similarity of real systems and reveals a multiscale organization that preserves the mesoscopic structure across different observation scales. In the case of topological renormalization, on the other hand, the lack of an underlying model implies that it is not obvious to advance when the network will be self-similar before applying the transformation and whether or not the mesoscopic structure will be mantained.

From a fundamental point of view, the geometric renormalization group introduced here has proven to be an exceptional tool to unravel the global organization of complex networks across scales and promises to become a standard methodology to analyze real complex networks. It can also help in areas like the study of metapopulation models, in which transportation fluxes or population movements happen both on a local and a global scale Colizza et al. (2007). From a practical point of view, we envision many applications besides the two studied in this paper. For instance, the development of a new community detection method that would use the mesoscopic information encoded in the different observation scales, and the use of downscaled versions of the network to perform finite size scaling. This last application would allow for the determination of critical exponents of real complex networks, a task that it not possible with current methods.

Acknowledgments

We acknowledge support from a James S. McDonnell Foundation Scholar Award in Complex Systems; the ICREA Academia prize, funded by the Generalitat de Catalunya; Ministerio de Economía y Competitividad of Spain projects no. FIS2013-47282-C2-1-P and no. FIS2016-76830-C2-2-P (AEI/FEDER, UE); the Generalitat de Catalunya grant no. 2014SGR608.

Author contributions

G. G.-P., M. B., and M. Á. S. contributed to the design and implementation of the research, to the analysis of the results, and to the writing of the manuscript.

Additional information

Competing financial interests: The authors declare no competing financial interests.

Appendix A Methods

A.1 Real networks data

The real networks analyzed in this paper are:

•

The Internet at the Autonomous Systems level. The data was collected by the Cooperative Association for Internet Data Analysis (CAIDA) (Claffy et al., 2009) and corresponds to mid 2009.

•

The Airports network. It was obtained from Ref. (kon, 2016; Kunegis, 2013). Directed links represent flights by airlines. We consider the undirected version obtained by keeping bidirectional edges only.

•

The one-mode projection onto metabolites of the human metabolic network at the cell level, as used in Ref. (Serrano et al., 2012).

•

The human HI-II-14 interactome. This proteome network was obtained from Ref. (Rolland et al., 2014). We removed self-loops.

•

The Music network. Nodes are chords—sets of musical notes played in a single beat—and connections represent observed transitions among them in a set of songs, see Ref. (Serrà et al., 2012). The original network is weighted, directed and very dense. Hence, we applied the disparity filter (Serrano et al., 2009) with $\alpha=0.01$ to obtain a sparser network. Finally, we kept bidirectional edges only to construct the undirected network.

•

The network of adjacency between words in Darwin’s book “On the Origin of Species”, from Ref. (Milo et al., 2004).

In all cases, we only considered the largest connected components.

A.2 $\mathbb{S}^{1}$ model and transformation to $\mathbb{H}^{2}$

The $\mathbb{S}^{1}$ model Serrano et al. (2008) places the nodes of a network into a one-dimensional sphere of radius $R$ and connects every pair $i,j$ with probability

[TABLE]

where $\mu$ controls the average degree of the network, $\beta$ its clustering, and $d_{a,ij}=R\Delta\theta_{ij}$ is the distance between the nodes separated by an angle $\Delta\theta_{ij}$ ; $R$ is set to $N/2\pi$ , where $N$ is the number of nodes, so that the density of nodes along the circle is equal to 1. The hidden degrees $\kappa_{i}$ and $\kappa_{j}$ are proportional to the degrees of nodes $i$ and $j$ , respectively.

The $\mathbb{S}^{1}$ model is isomorphic to a purely geometric model, the $\mathbb{H}^{2}$ model (Krioukov et al., 2010), in which nodes are placed in a two-dimensional hyperbolic disk of radius

[TABLE]

where $\kappa_{0}=\min\left\{\kappa_{i}\right\}$ . By mapping every mass $\kappa_{i}$ to a radial coordinate $r_{i}$ according to

[TABLE]

the connection probability, Eq. (7), becomes

[TABLE]

where $x_{ij}=r_{i}+r_{j}+2\ln\frac{\Delta\theta_{ij}}{2}$ is a good approximation to the hyperbolic distance between two points with coordinates $(r_{i},\theta_{i})$ and $(r_{j},\theta_{j})$ in the native representation of hyperbolic space. The exact hyperbolic distance $d_{\mathbb{H}^{2}}$ is given by the hyperbolic law of cosines,

[TABLE]

A.3 Adjusting the average degree of Mini-me network replicas

To reduce the average degree in a renormalized network to the level of the original network, we apply a pruning of links using the underlying metric model with which the networks in all layers are congruent. The procedure is detailed in this section.

The renormalized network in layer $l$ has an average degree $\langle k^{(l)}\rangle$ generally larger (in phase I) from the original network’s $\langle k^{(0)}\rangle$ . Moreover, the new network is congruent with the underlying hidden metric space with a parameter $\mu^{(l)}=\mu^{(0)}/r^{l}$ controlling its average degree. The main idea is to decrease the value of $\mu^{(l)}$ to a new value $\mu^{(l)}_{\textrm{new}}$ —which implies that the connection probability of every pair of nodes $(i,j)$ , $p_{ij}^{(l)}$ , decreases to $p_{ij,\textrm{new}}^{(l)}$ . We then prune the existing links by keeping them with probability

[TABLE]

Therefore, the probability for a link to exist in the pruned network reads,

[TABLE]

whereas the probability for it not to exist is

[TABLE]

that is, the pruned network has a lower average degree and is also congruent with the underlying metric space model with the new value of $\mu^{(l)}_{\textrm{new}}$ . Hence, we only need to find the right value of $\mu^{(l)}_{\textrm{new}}$ so that $\langle k^{(l)}_{\textrm{new}}\rangle=\langle k^{(0)}\rangle$ . In the thermodynamic limit, the average degree of an $\mathbb{S}^{1}$ network is proportional to $\mu$ , so we could simply set

[TABLE]

However, since we consider real-world networks, finite-size effects play an important role. Indeed, we need to correct the value of $\mu^{(l)}_{\textrm{new}}$ in Eq. (15). To this end, we use a correcting factor $c$ , initially set to $c=1$ , and use $\mu^{(l)}_{\textrm{new}}=c\frac{\langle k^{(0)}\rangle}{\langle k^{(l)}\rangle}\mu^{(l)}$ ; for every value of $c$ , we prune the network. If $\langle k^{(l)}_{\textrm{new}}\rangle>\langle k^{(0)}\rangle$ , we give $c$ the new value $c-0.1u\rightarrow c$ , where $u$ is a random variable uniformly distributed between 0 and 1. Similarly, if $\langle k^{(l)}_{\textrm{new}}\rangle<\langle k^{(0)}\rangle$ , $c+0.1u\rightarrow c$ . The process ends when $|\langle k^{(l)}_{\textrm{new}}\rangle-\langle k^{(0)}\rangle|$ is below a given threshold (in our case, we set it to 0.1).

A.4 Simulation of dynamical processes

The Ising model is an equilibrium model of interacting spins Dorogovtsev et al. (2002). Every node $i$ is assigned a variable $s_{i}$ with two possible values $s_{i}=\pm 1$ , and the energy of the system is, in the absence of external field, given by the Hamiltonian

[TABLE]

where $a_{ij}$ are the elements of the adjacency matrix and $J_{ij}$ are coupling constants which we set to one. We start from an initial condition with $s_{i}=1$ for all $i$ and explore the ensemble of configurations using the Metropolis-Hastings algorithm: we randomly select one nod and propose a change in its spin, $-s_{i}\rightarrow s_{i}$ . If $\Delta\mathcal{H}\leq 0$ , we accept the change; otherwise, we accept it with probability $e^{-\Delta\mathcal{H}/T}$ , where $T$ is the temperature acting as a control parameter. The order parameter is the absolute magnetization per spin $|m|$ , where $m=\frac{1}{N}\sum_{i}s_{i}$ ; if all spins point in the same direction, $|m|=1$ , whereas $|m|=0$ if half the spins point in each direction.

In the SIS dynamical model of epidemic spreading Pastor-Satorras and Vespignani (2001), every node $i$ can present two states at a given time $t$ , susceptible ( $n_{i}(t)=0$ ) or infected ( $n_{i}(t)=1$ ). Both infection and recovery are Poisson processes. An infected node recovers with rate 1, whereas infected nodes infect their susceptible neighbours at rate $\lambda$ . We simulate this process using the continuous-time Gillespie algorithm with all nodes initially infected. The order parameter is the prevalence or fraction of infected nodes $\rho(t)=\frac{1}{N}\sum_{i}n_{i}(t)$ .

The Kuramoto model is a dynamical model for coupled oscillators. Every node $i$ is described by a natural frequency $\omega_{i}$ and a time-dependent phase $\theta_{i}(t)$ . A node’s phase evolves according to

[TABLE]

where $a_{ij}$ are the adjacency matrix elements and $\sigma$ is the coupling strength. We integrate the equations of motion using Heun’s method. Initially, the phases $\theta_{i}(0)$ and the frequencies $\omega_{i}$ are randomly drawn from the uniform distributions $U(-\pi,\pi)$ and $U(-1/2,1/2)$ respectively, as in Ref. (Moreno and Pacheco, 2004). The order parameter $r(t)=\frac{1}{N}\left|\sum_{j}e^{i\theta_{j}(t)}\right|$ measures the phase coherence of the set of nodes; if all nodes oscillate in phase, $r(t)=1$ , whereas $r(t)\rightarrow 0$ if nodes oscillate in a disordered manner.

In every realization, we compute an average of the order parameter in the stationary state. In the case of the SIS model, the single-realization mean of prevalence values is weighted by time. The curves presented in this work correspond to statistics over 100 realizations.

A.5 Multiscale navigation

Given a network and its embedding (layer 0), we merge pairs of consecutive nodes only if they are connected, which guarantees navigation inside supernodes; this process generates layer 1. We repeat the process to generate $L$ layers. The multiscale navigation protocol requires every node $i$ to be provided with the following local information:

The coordinates $(r_{i}^{(l)},\theta_{i}^{(l)})$ of node $i$ in every layer $l$ .

2.

The list of (super)neighbours of node $i$ in every layer as well as their coordinates.

3.

Let SuperN $(i,l)$ be the supernode to which $i$ belongs in layer $l$ . If SuperN $(i,l)$ is connected to SuperN $(k,l)$ in layer $l$ , at least one of the (super)nodes in layer $l-1$ belonging to SuperN $(i,l)$ must be connected to at least one of the (super)nodes in layer $l-1$ belonging to SuperN $(k,l)$ ; such node is called gateway. For every superneighbour of node SuperN $(i,l)$ in layer $l$ , node $i$ knows which (super)node or (super)nodes in layer $l-1$ are gateways reaching it. Notice that both the gateways and SuperN $(i,l-1)$ belong to SuperN $(i,l)$ in layer $l$ so, in layer $l-1$ , they must either be the same (super)node or different but connected (super)nodes.

4.

If SuperN $(i,l-1)$ is a gateway reaching some supernode $s$ , at least one of its (super)neighbours in layer $l-1$ belongs to $s$ ; node $i$ knows which.

This information allows us to navigate the network as follows. Let $j$ be the destination node to which $i$ wants to forward a message, and let node $i$ know $j$ ’s coordinates in all $L$ layers $(r_{j}^{(l)},\theta_{j}^{(l)})$ . In order to decide which of its physical neighbours (i. e., in layer 0) should be next in the message-forwarding process, node $i$ must first check if it is connected to $j$ ; in that case, the decision is clear. If it is not, it must:

Find the highest layer $l_{max}$ in which SuperN $(i,l_{max})$ and SuperN $(j,l_{max})$ still have different coordinates. Set $l=l_{max}$ .

2.

Perform a standard step of greedy routing in layer $l$ : find the closest neighbour of SuperN $(i,l)$ to SuperN $(j,l)$ . This is the current target SuperT $(l)$ .

3.

While $l>0$ , look into layer $l-1$ :

–

Set $l=l-1$ .

–

If SuperN $(i,l)$ is a gateway connecting to some (super)node within SuperT $(l+1)$ , node $i$ sets as new current target SuperT $(l)$ its (super)neighbour belonging to SuperT $(l+1)$ closest to SuperN $(j,l)$ .

–

Else node $i$ sets as new target SuperT $(l)$ the gateway in SuperN $(i,l+1)$ connecting to SuperT $(l+1)$ (its (super)neighbor belonging to SuperN $(i,l+1)$ ).

4.

In layer $l=0$ , SuperT $(0)$ belongs to the real network and she is a neighbour of $i$ , so node $i$ forwards the message to SuperT $(0)$ .

Appendix B Evidence of geometric scaling in real networks

The global topological parameters of all six networks are contained in Table 1.

Fig. 2 compares the topological properties of the renormalized networks for three real networks. We show the equivalent results for the Airports, Proteome and Words networks in Fig. 6.

In Fig. 7, we show the empirical connection probabilities of the six real-world networks considered in this paper as well as their renormalized versions.

Appendix C The Geometric Renormalization Group

This section contains the calculations related to the theoretical aspects of the geometric renormalization transformation. In particular, we show the semi-group structure of the transformation, derive the corresponding recurrence relations for the renormalization of the $\mathbb{S}^{1}$ model and calculate the flow of the average degree. We also discuss the connection with statistical mechanics by using the isomorphism between the $\mathbb{S}^{1}$ and the $\mathbb{H}^{2}$ models and, finally, we include some numerical results regarding the relation between global properties of the networks generated by the model and the flow of the average degree.

C.1 The semigroup structure of the coarse-graining step

It is easy to show that the geometric coarse-graining presented in this paper has the semigroup structure. To this end, we need to see that node $i$ is mapped to the same supernode whether we apply the coarse-graining with $r=r_{1}$ first and then a second time with $r=r_{2}$ or just once with $r=r_{1}r_{2}$ . In the first case, the step with $r=r_{1}$ maps $i$ to supernode $m=\left\lfloor i/r_{1}\right\rfloor$ (where $\lfloor x\rfloor$ represents the integer part of $x$ ), and then $m$ is mapped to $n=\left\lfloor m/r_{2}\right\rfloor=\left\lfloor\left\lfloor i/r_{1}\right\rfloor/r_{2}\right\rfloor$ in the second step. In the second case, $i$ is mapped to supernode $s=\left\lfloor i/(r_{1}r_{2})\right\rfloor$ . Notice that $s=\left\lfloor(i/r_{1})/r_{2}\right\rfloor=\left\lfloor(\left\lfloor i/r_{1}\right\rfloor+\alpha)/r_{2}\right\rfloor=\left\lfloor(m+\alpha)/r_{2}\right\rfloor$ , where $\alpha=(i\mod r_{1})/r_{1}<1$ . Now,

[TABLE]

so

[TABLE]

which is always fulfilled since $\alpha<1$ and

[TABLE]

Thus, $s=n$ , and node $i$ is mapped to the same supernode in both cases. It follows immediately from this result that both processes yield the same final link structure.

C.2 Selecting long-range connections

As we apply the renormalization transformation, some links are integrated inside the supernodes, so they do not contribute to the topology of the renormalized network. In Fig. 8, we show that links joining nodes separated a large angular distance $\Delta\theta_{ij}$ require larger values of $r$ to be integrated; in other words, the connections in a renormalized network represent long-range connections in the original graph.

C.3 Geometric renormalization of the S1 model

In this subsection, we derive the RG equations of the $\mathbb{S}^{1}$ model. In order to simplify the notation, all unprimed quantities will refer to layer $l-1$ , whereas primed ones will correspond to layer $l$ . Moreover, we consider the particular case in which all supernodes contain the same number of nodes ( $r$ ) for simplicity, although the following calculations are also valid for supernodes of different sizes.

Consider the probability $p_{ij}^{\prime}$ for two supernodes $i$ and $j$ in layer $l$ to be connected, which is given by the probability for at least one link between a pair of the nodes within the supernodes in layer $l-1$ to exist,

[TABLE]

where $e$ runs over all pairs of nodes $(m,n)$ with $m$ in supernode $i$ and $n$ in supernode $j$ . The term $p_{e}$ is the probability for $m$ and $n$ to be connected in layer $l-1$ ,

[TABLE]

Eq. (21) takes the same functional form as Eq. (22),

[TABLE]

with

[TABLE]

Since the angular distance between the nodes inside each block is generally smaller than the distance between $i$ and $j$ , all the $\Delta\theta_{e}$ are approximately equal ( $\Delta\theta_{e}\approx\Delta\theta$ ), so we can write

[TABLE]

The $\mathbb{S}^{1}$ model assumes a uniform density of nodes $\delta=1$ , which means that $R=\frac{N}{2\pi}$ , whereas $\mu$ is a constant independent of $N$ . Indeed, $\frac{\mu}{R}\ll 1$ , so the first term leads Eq. (25) in most cases. Thus,

[TABLE]

Introducing this result into Eq. (23),

[TABLE]

we see that, in order for the resulting expression to be congruent with the model, we need a set of equations that transform the parameters according to

[TABLE]

Let us now assume that the angular coordinate of a supernode is some generalised center of mass of the nodes it integrates, so the separation between the two renormalised nodes $\Delta\theta_{ij}^{\prime}$ is approximately equal to the angular separation between the nodes that belong to different blocks, i.e. $\Delta\theta_{ij}^{\prime}\approx\Delta\theta$ ; thus, $\beta^{\prime}=\beta$ . The choice $\delta=1$ leads to $R^{\prime}=\frac{R}{r}$ , that is, to the rescaling step. Setting $\mu^{\prime}=\frac{\mu}{r}$ , Eq. (28) further requires

[TABLE]

which is fulfilled if

[TABLE]

The transformation of masses preserves the semi-group structure exactly, since

[TABLE]

We should require the transformation of angles to preserve it as well. This can be achieved using the following generalised center of mass

[TABLE]

given that

[TABLE]

C.4 RG flow of the average degree

As discussed in the previous subsection, as we renormalize, we move in the space of realizations of the $\mathbb{S}^{1}$ model, always keeping the congruency between the network and the hidden metric space, i.e. Eq. (22). Therefore, we can use the $\mathbb{S}^{1}$ model to compute the average degree $\langle k^{\prime}\rangle$ of the renormalised networks. According to Ref. (Krioukov et al., 2010),

[TABLE]

where $C_{0}$ does not change as we renormalize. We thus need to compute $\langle\kappa^{\prime}\rangle$ , where $\kappa^{\prime}$ is given by Eq. (30) and the original distribution of masses is assumed to be a power-law,

[TABLE]

The strategy to compute $\langle\kappa^{\prime}\rangle$ is as follows: 1. We define $z\equiv\kappa^{\beta}$ and find their distribution $\rho_{z}(z)$ . 2. We then calculate $\hat{\rho}_{z}^{r}(s)$ (where $\hat{\rho}_{z}(s)$ is the Laplace transform of $\rho_{z}(z)$ ); according to the convolution theorem, this is the Laplace transform of the variable $z^{\prime}\equiv\sum_{r}z=\kappa^{\prime\beta}$ . 3. Finally, we compute $\langle\kappa^{\prime}\rangle$ as the $1/\beta$ -th moment of $z^{\prime}$ , that is, $\langle\kappa^{\prime}\rangle=\langle z^{\prime 1/\beta}\rangle$ , from $\hat{\rho}_{z}^{r}(s)$ .

1.

From Eq. (35),

[TABLE]

so

[TABLE]

where $\eta=\frac{\gamma-1}{\beta}+1$ .

2.

If $\gamma<2\beta+1$ , $\eta<3$ , which means that $z^{\prime}$ and, consequently, $\kappa^{\prime}$ are also power-law distributed since the central limit theorem does not apply (the opposite case corresponds to phase III in Fig.2B) (Gnedenko and Kolmogorov, 1968). The Laplace transform of Eq. (37) is given by

[TABLE]

where $\Gamma(a,b)$ is the incomplete gamma function,

[TABLE]

From this result, it follows that

[TABLE]

3.

We need to compute

[TABLE]

To do so, consider the integral

[TABLE]

Taking into account that for $\alpha>-1$

[TABLE]

we see that

[TABLE]

Now, setting $C^{\prime}=(-1)^{n}\Gamma(1+\alpha)^{-1}$ and $n-1-\alpha=1/\beta$ , $I=\langle z^{\prime 1/\beta}\rangle$ . However, since $\alpha=n-1-1/\beta>-1$ and $n\in\mathbb{N}$ , the smallest $n$ we can choose is $n=1$ , so $\alpha=-1/\beta$ . Finally, we can write

[TABLE]

where $\hat{\rho}_{z^{\prime}}(s)$ is given in Eq. (40).

Particular case $r=2$

To start solving Eq. (45), let us first take the limit of $N\to\infty\Rightarrow\kappa_{c}\to\infty$ , which means that $\hat{\rho}_{z}(s)$ becomes

[TABLE]

Using the same change of variable as in Eq. (38), we see that

[TABLE]

Let us now evaluate $\hat{\rho}_{z^{\prime}}^{\prime}(s)$ ,

[TABLE]

and introduce this result into Eq. (45),

[TABLE]

We thus need to solve an integral of the form

[TABLE]

In our case, $\nu=2\eta-3-1/\beta=(2\gamma-3)/\beta-1>-1\Leftrightarrow\gamma>3/2$ . Integrating by parts,

[TABLE]

We can find a recurrence relation for the integrals in the last expression,

[TABLE]

Iterating yields

[TABLE]

Introducing this result into Eq. (51),

[TABLE]

Finally, Eq. (49) becomes

[TABLE]

Using this result, Eq. (34) and $\kappa_{0}=\langle\kappa\rangle(\gamma-2)/(\gamma-1)$ we can write an expression for the exponent $\nu$ (defined by the expression $\langle k^{\prime}\rangle=r^{\nu}\langle k\rangle$ ):

[TABLE]

The above result is shown in Fig. 9.

Solution in the power-law approximation

From Eq. (55), we see that the exact solution for large $r$ can be extremely convoluted, thus making the limit $r\to\infty$ inaccessible. However, if we consider that $\rho_{\kappa^{\prime}}(\kappa^{\prime})$ is a power-law (which is a reasonable approximation if $\eta<3$ , as discussed above), the computation of $\langle\kappa^{\prime}\rangle$ becomes simpler. Under this assumption, $z^{\prime}$ are also power-law distributed with exponent $-\eta$ , that is,

[TABLE]

We study two cases separately:

i.

$1<\eta<2$ : In this case, we determine the value of $C^{\prime}$ and, with it, $\langle\kappa^{\prime}\rangle=\kappa^{\prime}_{0}\frac{\gamma-1}{\gamma-2}$ . If the assumption in Eq. (57) is correct, $\hat{\rho}_{z^{\prime}}(s)$ must behave as (Handelsman and Lew, 1974)

[TABLE]

According to Eqs. (40) and (46),

[TABLE]

In the above expression, we see that the term that does not depend on $s$ is given by the product of the $r$ terms with $n=0$ , whereas the term of order $s^{\eta-1}$ is given by the sum of the $r$ products of $s^{\eta-1}$ with the remaining $r-1$ terms with $n=0$ . Thus, we find

[TABLE]

We can now identify $C^{\prime}$ as

[TABLE]

so

[TABLE]

and

[TABLE]

Finally, plugging this result into Eq. (34),

[TABLE]

ii.

$2<\eta<3$ : This case is much simpler, since $\langle z\rangle$ and hence $\langle z^{\prime}\rangle$ are finite and can be easily computed. Indeed, given that $\langle z^{\prime}\rangle=r\langle z\rangle$ , we see that

[TABLE]

This result and Eq. (34) together imply

[TABLE]

Both solutions, Eqs. (64) and (66), are equivalent at $\eta=2$ , since

[TABLE]

Therefore, we can conclude that the network flows towards a fully connected graph if $\gamma<3$ or $\beta<2$ . The line $\gamma=3$ and $\beta>2$ or $\beta=2$ and $\gamma>3$ is an unstable fixed point, whereas $\langle k\rangle\to 0$ if $\gamma>3$ and $\beta>2$ . Notice that this assertion is only valid under the assumption in Eq. (57), which is not true in general. However, we expect it to be a good approximation of the flow’s behaviour as $r\to\infty$ .

C.5 Mapping to hyperbolic space and the partition function

In this section, we show how the RGN presented in this work can be described in the formalism of statistical physics. As explained in Appendix A, using the mapping to hyperbolic space, the connection probability, Eq. (22), becomes

[TABLE]

where $x_{mn}=r_{m}+r_{n}+2\ln\frac{\Delta\theta_{mn}}{2}$ is a good approximation to the hyperbolic distance between two points with coordinates $(r_{m},\theta_{m})$ and $(r_{n},\theta_{n})$ in the native representation of hyperbolic space.

Now, let $a_{mn}=1$ if the link between nodes $m$ and $n$ exists and $a_{mn}=0$ otherwise; Eq. (68) can be written as

[TABLE]

which means that, in the $\mathbb{H}^{2}$ model, every pair of nodes represents a fermionic state of energy $x_{mn}/2$ in the grand-canonical ensemble with $R_{\mathbb{H}^{2}}/2$ playing the role of the chemical potential. Indeed, since a network can be represented by the set $\{a_{mn}\}$ , the likelihood of a given network is given by

[TABLE]

that is, by the probability of the corresponding microstate of the gas of non-interacting fermions. The partition function of the system is

[TABLE]

When we apply the renormalization transformation, every node $m$ ( $n$ ) is mapped to a supernode $i$ ( $j$ ). We can rearrange the terms in the partition function according to such mapping as

[TABLE]

The first double product in the above expression corresponds to the partial sum over the links among the nodes within every supernode $i$ (hence, there are $N(r-1)/2$ such terms), whereas the second double product represents the partial sum over the links among nodes in different supernodes $i$ and $j$ ; thus, it contains $(N^{2}-Nr)/2$ terms. According to Eqs. (8) and (9),

[TABLE]

so the rightmost term in Eq. (72) reads

[TABLE]

where $\Phi_{ij}^{\prime}$ is given by Eq. (24). Using Eq. (26) and Eq. (28), which is fulfilled with the RG transformations Eqs. (30) and (31), yields

[TABLE]

The leftmost term in Eq. (72) can be written as

[TABLE]

In the particular case of $r=2$ , we integrate consecutive nodes separated by a typical angular distance $\Delta\theta_{t}\approx 2\pi/N$ . Hence, $R\Delta\theta_{t}\approx 1$ , so

[TABLE]

Defining

[TABLE]

we can write Eq. (72) as

[TABLE]

where $Z^{\prime}=\sum_{\{a_{ij}\}}e^{-\beta H^{\prime}(\{a_{ij}\})}$ .

C.6 Local vs. global properties

In the $\mathbb{S}^{1}$ model, we impose three parameters, $\gamma,\beta$ and $\langle\kappa\rangle$ , all three related to local properties of nodes (degree and clustering). However, the RG flow of observables like the average degree should be related to global properties of the system; indeed, we would expect two networks with similar average degree flows to exhibit similarities at the global scale as well, whereas two networks with very different RG trajectories (even in the same phase, i.e., flowing towards the same fixed point) should be easier to distinguish by looking at their global properties. To check this hypothesis, we have generated synthetic networks with different values of $\gamma$ and $\beta$ and compared the eigenvalues of both the adjacency and laplacian matrices. The results are shown in Figs. 10, 11 and 12. As we see, the RG analysis of the model allows us to assess the stability of the global properties of networks against perturbations of their local ones, and hence the importance of clustering and degree heterogeneity on a given system.

Appendix D Mini-me network replicas

Appendix E Multiscale navigation networks

This section includes some results showing the topological properties of the coarse-grained for navigation networks; Fig. 14 shows the complementary cumulative degree distributions, whereas Fig. 15 contains their clustering spectra.

We also present the empirical connection probabilities of the networks after the coarse-graining for navigation (in which pairs of nodes are merged together into a supernode only if they are connected) in Fig. 16. Notice that the congruency with the underlying metric space is preserved even is the sizes of the blocks are different.

Bibliography50

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Mandelbrot (1961) B. Mandelbrot, in Proceedings of the Twelve Symposia in Applied Mathematics, Roman Jakobson editor. Structure of Language and its Mathematical Aspects, New York, USA (1961) pp. 190–219.
2Stanley (1971) H. E. Stanley, Introduction to Phase Transitions and Critical Phenomena (Oxford Univ. Press, Oxford, 1971).
3Gfeller and De Los Rios (2007) D. Gfeller and P. De Los Rios, Phys. Rev. Lett. 99 , 038701 (2007) . · doi ↗
4Song et al. (2005) C. Song, S. Havlin, and H. A. Makse, Nature 433 , 392 (2005).
5Goh et al. (2006) K. I. Goh, G. Salvi, B. Kahng, and D. Kim, Phys. Rev. Lett. 96 , 018701 (2006).
6Song et al. (2006) C. Song, S. Havlin, and H. A. Makse, Nature Physics 2 , 275 (2006).
7Kim et al. (2007) J. S. Kim, K. I. Goh, B. Hahng, and D. Kim, New J. Phys. 9 , 177 (2007).
8Radicchi et al. (2008) F. Radicchi, J. J. Ramasco, A. Barrat, and S. Fortunato, Phys. Rev. Lett. 101 , 148701 (2008) . · doi ↗

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Multiscale unfolding of real networks by geometric renormalization

Abstract

I Introduction

II Evidence of geometric scaling in real networks

III Geometric renormalization of the S1 model

IV Applications

IV.1 Mini-me network replicas

IV.2 Multiscale navigation

V Discussion

Acknowledgments

Author contributions

Additional information

Appendix A Methods

A.1 Real networks data

A.2 S1\mathbb{S}^{1}S1 model and transformation to H2\mathbb{H}^{2}H2

A.3 Adjusting the average degree of Mini-me network replicas

A.4 Simulation of dynamical processes

A.5 Multiscale navigation

Appendix B Evidence of geometric scaling in real networks

Appendix C The Geometric Renormalization Group

C.1 The semigroup structure of the coarse-graining step

C.2 Selecting long-range connections

C.3 Geometric renormalization of the S1 model

C.4 RG flow of the average degree

Particular case r=2r=2r=2

Solution in the power-law approximation

C.5 Mapping to hyperbolic space and the partition function

C.6 Local vs. global properties

Appendix D Mini-me network replicas

Appendix E Multiscale navigation networks

A.2 $\mathbb{S}^{1}$ model and transformation to $\mathbb{H}^{2}$

Particular case $r=2$