Symmetric motifs in random geometric graphs
Carl P. Dettmann, Georgie Knight

TL;DR
This paper investigates symmetric motifs in random geometric graphs, analyzing their occurrence probabilities, spectral signatures, and the minimum separation distances across different limits and dimensions.
Contribution
It provides a detailed probabilistic and spectral analysis of symmetric motifs, including their prevalence and the minimum separation distance in various limits.
Findings
Symmetric motifs are prevalent and produce distinct spectral peaks.
Probability of closest nodes being symmetric approaches one in the thermodynamic limit.
In the intensive limit, symmetry probability depends on the dimension.
Abstract
We study symmetric motifs in random geometric graphs. Symmetric motifs are subsets of nodes which have the same adjacencies. These subgraphs are particularly prevalent in random geometric graphs and appear in the Laplacian and adjacency spectrum as sharp, distinct peaks, a feature often found in real-world networks. We look at the probabilities of their appearance and compare these across parameter space and dimension. We then use the Chen-Stein method to derive the minimum separation distance in random geometric graphs which we apply to study symmetric motifs in both the intensive and thermodynamic limits. In the thermodynamic limit the probability that the closest nodes are symmetric approaches one, whilst in the intensive limit this probability depends upon the dimension.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
\shortauthorlist
C. P. DETTMANN AND G. KNIGHT
Symmetric motifs in random geometric graphs
Carl P. Dettmann∗
School of Mathematics
University of Bristol
University Walk
Bristol BS8 1TW
UK
∗
and
Georgie Knight
School of Mathematics
Corresponding author: [email protected]
University of Bristol
University Walk
Bristol BS8 1TW
UK
Abstract
We study symmetric motifs in random geometric graphs. Symmetric motifs are subsets of nodes which have the same adjacencies. These subgraphs are particularly prevalent in random geometric graphs and appear in the Laplacian and adjacency spectrum as sharp, distinct peaks, a feature often found in real-world networks. We look at the probabilities of their appearance and compare these across parameter space and dimension. We then use the Chen-Stein method to derive the minimum separation distance in random geometric graphs which we apply to study symmetric motifs in both the intensive and thermodynamic limits. In the thermodynamic limit the probability that the closest nodes are symmetric approaches one, whilst in the intensive limit this probability depends upon the dimension. Random geometric graph, spectrum, motif, Chen-Stein method.
2010 Math Subject Classification: 90B15, 47A10, 05C50, 05C80
1 Introduction
Many physical systems like social networks, biological networks, transport networks and technological infrastructures can be modelled using the graph concept of a set of nodes connected by edges. For an introduction see for example [23, 24]. In the study of complex networks, a popular technique is to randomly generate graphs in such a way as to capture certain features of the topology and dynamics of the real-life systems of interest. Some common examples of these are the Erdos-Renyi random graph [11], the Barabási-Albert scale-free network generator [4] and the Watts-Strogatz small-world network generator [34]. A powerful tool for analysing the topology and dynamics of these networks is their graph spectra [7] [15] [21]. In particular, important subgraphs called symmetric motifs can be seen through the presence of sharp peaks in the spectra indicative of the multiplicity of particular eigenvalues, see figure 1 for an illustration of this in the real-world network of the high-voltage power grid in the Western States of the United States of America [34] and see [19] for an analysis of the symmetric motifs in this particular network. Networks such as this are important for an understanding of efficiency and robustness of power-grids [28].
It is known that the presence of symmetric motifs is important for other real-world complex systems [20] and has been shown to influence synchronisation processes [1], [9] and dynamical stability [3], [10] and is also related to redundancy and network stability [19]. However, the non-spatial random graph generators mentioned above rarely contain these important subgraphs. It is known, however, that they do occur in a spatial random graph model the random geometric graph (RGG). RGGs were first introduced by Gilbert as a way of modelling wireless networks [16]. In a RGG the nodes are distributed throughout a given domain uniformly at random and are connected by an edge when they are within a given range of each other. See [26] and [31] for introductions and see figure 2 for an illustration of a RGG.
RGGs are often used to model networks in which the node location is an important factor, so called spatial networks ( See Ref.[5] for a review). In particular RGGs have been used in modelling wireless networks [27], [17], [29], [14], in the study of epidemics [32], [22], [30], in the study of city development [33], in modelling the vulnerability of infrastructure [35] and biological protein-protein interaction networks [18]. In addition the properties of RGGs such as synchronisation [12],[9], consensus dynamics [13], connectivity [8] and spectral properties of RGGs [25], [6] have all been studied.
Here we study the symmetric motifs in RGGs. Our aim is to understand how properties like density and dimension of the RGG affect their appearance. We consider the probability of finding symmetric nodes as a function of the connection radius and in two high density limit cases. In particular, we analyse the symmetric motifs in the intensive limit of fixed connection radius, and the thermodynamic limit of fixed mean degree. It turns out that the expected number of symmetric motifs is non-trivially dependent on these system parameters.
We will study the binomial model of RGGs. That is, we distribute points representing nodes of a network uniformly on the unit torus. A connection is made between two nodes if their toral distance is less than some given range . This is known as the unit disc connection model, but note there are many other connection models in which the links are random with probability depending on the inter-node distance [8].
In section 2 we will look at the adjacency and Laplacian spectrum of some RGG ensembles and discuss in particular the sharp peaks we find in these spectra. We will explain how the symmetric motifs which give rise to these sharp peaks and study the probability of finding them in RGGs. In section 3 we will then use the Chen-Stein method [2] to derive the scaling of the minimum separation distance in RGGs. The Chen-Stein method is a tool for obtaining a bound on the total variation distance between a stochastic process which contains dependent random variables and a corresponding independent process. Here, we apply it to remove the dependence of inter-node distances, and hence obtain the variation of the symmetric motifs with dimension in the above limits. Section 4 contains a summary.
2 Spectrum
We first look at the spectrum of the graph adjacency matrix and Laplacian matrix . The adjacency matrix is the zero-one adjacency matrix whose entries if there is a connection between nodes and and zero otherwise. The Laplacian matrix where has entries , the degree of vertex . Again see figure 1 for an illustration of the spectrum of and in the real-world network of the high-voltage power grid in the Western States of the United States of America [34].
In [25] the Laplacian spectra of one-dimensional RGGs is studied. The authors show that the ensemble-averaged Laplacian spectrum of RGGs on the circle consist of a continuous part and a discrete part consisting of peaks at integer values. We numerically obtained the Laplacian spectrum for an ensemble of RGGs. The ensemble-averaged spectral density is illustrated in figure 3. In [6] the spectra of the adjacency matrix is studied for RGGs. They find that the ensemble averaged spectral density has a discrete part consisting of a peak at . We numerically obtained the adjacency spectrum of an ensemble of RGGs. This is illustrated in figure 4.
In both [25] and [6] they identify the presence of symmetric nodes as the structural phenomenon which gives rise to the multiplicities in the eigenvalues which characterise the spectral densities. These Motifs or graph orbits are subgraphs whose nodes are invariant under permutation of the indices. That is two nodes are symmetric in this sense when they are connected to the same set of nodes. Eigenvectors localise on these symmetric nodes and give rise to integer eigenvalues. To see this let and be symmetric nodes and be a vector with and , all other entries equal to zero. Considering the adjacency matrix we then have
[TABLE]
if and are connected (called a Type-I symmetry). If they are not connected we get eigenvalue [math] (a Type-II symmetry). For the Laplacian we get
[TABLE]
if and are connected. If they are not connected we get eigenvalue . Here is the degree of the vertices . We note that the Laplacian distinguishes between different degrees in these motifs whilst the adjacency does not. Note that for a set of symmetric nodes, there are independent, orthogonal eigenvectors that lead to a multiplicity of for the eigenvalue.
2.1 Symmetry probability
The symmetry of a node is dependent upon the nodes in its neighbourhood and their respective neighbourhoods. The neighbourhood of a node is defined as , that is the region within the connection range of . Two nodes have a shared neighbourhood which is given by the intersection of their respective neighbourhoods. An additional important concept is the excluded neighbourhood, which for two vertices is the region that is within the range of one of the vertices but not both.
A given node is type-I symmetric when there is a node within its neighbourhood and when the excluded neighbourhood of and is empty. This ensures that and are connected and that they are connected to the same set of nodes. If they are not within the neighbourhood of each other but have an empty excluded neighbourhood then they are type-II symmetric.
We calculated the probabilities of finding type-I and type-II symmetric nodes from ensembles of RGGs. The results are illustrated in figure 5. For one-dimensional RGGs the probability of type-I symmetric nodes quickly approaches a constant value as a function of connection radius whilst type-II quickly approaches zero. For two and three -dimensional RGGs we see entirely different behaviour. Most interestingly there is an optimal value of for finding type-I symmetric nodes.
3 Dimension and limiting probabilities
In order to look closer at the differences across dimension, we have looked at the adjacency spectral density of both and RGGs with similar connection probability. The results are illustrated in figure 6 where we see a noticeable difference in that for the RGGs the peak at is far less prominent.
To understand this property we look at the probability that nearest neighbour vertices are symmetric (these are the most likely to be symmetric). We will analyse the minimum separation distance between nodes in a RGG and derive how it scales as the number of nodes increases.
We consider nodes , uniformly distributed on d-dimensional torus and choose an index set which consists of all pairs of nodes . For each in the index set we let be the indicator random variable of event . That is, the event that the nodes are separated by a distance less than . Note that , where is the volume of the unit ball in dimensions. Furthermore, for each we choose a with such that is a neighbourhood of dependence. That is for and are dependent. For each we let
[TABLE]
Consider the sum
[TABLE]
and note that
[TABLE]
where is the minimum distance of any pair of nodes in the RGG. The expectation of , is given by
[TABLE]
We now apply the Chen-Stein method described in the Introduction to remove the dependency of the inter-node distances. From Theorem in Ref.[2] we have that
[TABLE]
where and are constants given by
[TABLE]
[TABLE]
[TABLE]
From Eq.(3) we have
[TABLE]
Considering we note that the are pairwise independent, this implies , therefore
[TABLE]
Finally we note that is independent of all where , hence . Combining Eqs.(11,12) via Eq.(7) we have the following
[TABLE]
We note that from Eq.(6) we have that is of order one when , that is when . Furthermore in this regime from Eq.(13)
[TABLE]
Hence we see that is order one when . From this we derive that scales like for large .
For , two points separated by a distance have an excluded neighbourhood of length . The probability that they are type-I symmetric with for constant is then
[TABLE]
So the closest nodes will be type-I symmetric in this limit. We obtain the same result in the thermodynamic limit, where the mean degree is kept constant with .
For , two points separated by a distance have an excluded neighbourhood equal to two times the area of a circle with radius , minus four times the area of the circular segment with height , that is
[TABLE]
Using for some constant the probability of type-I symmetry in two dimensions is then
[TABLE]
where we use . In the thermodynamic limit with
[TABLE]
this probability goes to one.
With , two points separated by a distance have an excluded neighbourhood equal to two times the volume of a sphere minus four times the volume of the spherical cap with height , that is
[TABLE]
Using for constant ,
[TABLE]
Hence the probability of seeing no nodes in this excluded neighbourhood is
[TABLE]
In the thermodynamic limit with
[TABLE]
The difference we see is that in one and two dimensions the probability that points separated by the minimal distance are symmetric approaches a constant value in the intensive limit of large whilst in three dimensions this probability goes to zero. In the thermodynamic limit this probability goes to one.
4 Summary
We have looked at the appearance of symmetric motifs in random geometric graphs. These subgraphs are of interest as they are prevalent in many real world networks and random geometric graphs but not in other random graph models. We looked at how the probability of finding symmetric nodes is dependent on the connection radius and the dimension of the random geometric graph. We found that in one dimensional random geometric graphs this probability is close to being independent of the connection radius and the density in that it quickly approaches a constant value as is increased. In two and three dimensions in contrast we found that there is a value of at which the probability attains a maximum value. In the thermodynamic limit we found that the closest nodes will be symmetric almost surely, irrespective of the dimension. Whilst in the intensive limit, in three dimensions this probability goes to zero.
Future work in this direction will look to analytically understand the numerical results we presented here on the probability of finding symmetric nodes as a function of connection radius. In addition, it will be possible to study how generalisations of the hard disc connection function affect the appearance of symmetric nodes.
Acknowledgment
This work was supported by the EPSRC grant number EP/N002458/1 for the project Spatially Embedded Networks. The authors are grateful to an anonymous referee for correcting a few of the equations in Section 3.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Arenas, A., Díaz-Guilera, A. & Pérez-Vicente, C. J. (2006) Synchronization processes in complex networks. Physica D: Nonlinear Phenomena , 224 (1–2), 27 – 34, Dynamics on Complex Networks and Applications.
- 2[2] Arratia, R., Goldstein, L. & Gordon, L. (1990) Poisson Approximation and the Chen-Stein Method. Statistical Science , 5 (4), 403–424.
- 3[3] Aufderheide, H., Rudolf, L. & Gross, T. (2012) Mesoscale symmetries explain dynamical equivalence of food webs. New Journal of Physics , 14 (10), 105014.
- 4[4] Barabási, A. & Albert, R. (1999) Emergence of Scaling in Random Networks. Science , 286 (5439), 509–512.
- 5[5] Barthélemy, M. (2011) Spatial networks. Physics Reports , 499 (1–3), 1 – 101.
- 6[6] Blackwell, P., Edmondson-Jones, M. & Jordan, J. (2006) Spectra of adjacency matrices of random geometric graphs. Unpublished .
- 7[7] Chung, F. (1994) Spectral Graph Theory , no. 92 in CBMS Regional Conference Series. Conference Board of the Mathematical Sciences.
- 8[8] Dettmann, C. P. & Georgiou, O. (2016) Random geometric graphs with general connection functions. Phys. Rev. E , 93 , 032313.
