Hamiltonian System Approach to Distributed Spectral Decomposition in Networks
Konstantin Avrachenkov, Philippe Jacquet, Jithin Sreedharan

TL;DR
This paper introduces a novel Hamiltonian system approach for distributed spectral decomposition in large networks, modeling graph spectra as physical systems and applying symplectic integrators for higher resolution eigenvalue detection.
Contribution
It develops efficient distributed algorithms based on Hamiltonian and Lagrangian dynamics to improve spectral resolution and stability in eigenvalue computations for large symmetric graph matrices.
Findings
Effective detection of closely spaced eigenvalues.
Improved stability in numerical spectral computation.
Validated on real-world large networks.
Abstract
Because of the significant increase in size and complexity of the networks, the distributed computation of eigenvalues and eigenvectors of graph matrices has become very challenging and yet it remains as important as before. In this paper we develop efficient distributed algorithms to detect, with higher resolution, closely situated eigenvalues and corresponding eigenvectors of symmetric graph matrices. We model the system of graph spectral computation as physical systems with Lagrangian and Hamiltonian dynamics. The spectrum of Laplacian matrix, in particular, is framed as a classical spring-mass system with Lagrangian dynamics. The spectrum of any general symmetric graph matrix turns out to have a simple connection with quantum systems and it can be thus formulated as a solution to a Schr\"odinger-type differential equation. Taking into account the higher resolution requirement in the…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9| Notation | Meaning |
|---|---|
| , | Graph, Node set and edge set |
| , | No. of nodes, no. of edges |
| Adjacency matrix | |
| Laplacian matrix | |
| Smallest eigenvalues of in ascending order | |
| Eigenvectors corresponding to | |
| Degree of node without including self loop | |
| Neighbor list of node without including self loop | |
| Sampling interval in time domain | |
| Total time frame and no. of samples | |
| vector with index in continuous and discrete time | |
| th component of a vector |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Hamiltonian System Approach to
Distributed Spectral Decomposition in Networks
Konstantin Avrachenkov
INRIA, France
Email: [email protected]
Philippe Jacquet
Nokia Bell Labs, France
Email: [email protected]
Jithin K. Sreedharan
Purdue University, USA
Email: [email protected]
Abstract
Because of the significant increase in size and complexity of the networks, the distributed computation of eigenvalues and eigenvectors of graph matrices has become very challenging and yet it remains as important as before. In this paper we develop efficient distributed algorithms to detect, with higher resolution, closely situated eigenvalues and corresponding eigenvectors of symmetric graph matrices. We model the system of graph spectral computation as physical systems with Lagrangian and Hamiltonian dynamics. The spectrum of Laplacian matrix, in particular, is framed as a classical spring-mass system with Lagrangian dynamics. The spectrum of any general symmetric graph matrix turns out to have a simple connection with quantum systems and it can be thus formulated as a solution to a Schrödinger-type differential equation. Taking into account the higher resolution requirement in the spectrum computation and the related stability issues in the numerical solution of the underlying differential equation, we propose the application of symplectic integrators to the calculation of eigenspectrum. The effectiveness of the proposed techniques is demonstrated with numerical simulations on real-world networks of different sizes and complexities.
I Introduction
Consider an undirected graph with as the vertex set () and as the edge set (). Let be any symmetric matrix associated with . Broadly speaking, we say that is a graph matrix if it has non-zero elements on the edge set. Due to symmetry, the eigenvalues of are real and can be ranked in ascending order as . We investigate efficient techniques to compute the eigenvalues and the corresponding eigenvectors , in a distributed way.
We define two typical matrices which appear frequently in network analysis. First one is the adjacency matrix in which the individual entries are given by
[TABLE]
Since the focus in this paper is on undirected graphs, . The matrix is also called the unweighted adjacency matrix and one can also define a weighted version in which the weight for an edge is replaced by any weight such that . Another matrix which is found very common in many graph theoretic problems is the Laplacian matrix . Here the matrix is a diagonal matrix with diagonal elements equal to degrees of the nodes .
I-A Applications of graph spectrum
The knowledge of ’s and ’s can be made use of in many ways. For instance, spectral clustering is a prominent solution which exploits the first eigenvectors of the Laplacian matrix for identifying the clusters in a network [1]. Another classical use of Laplacian eigenvalues is in computing the number of spanning trees of a graph which is . Many studies have been conducted on the use of the spectrum of the adjacency matrix such as computing the number of traingles of a netwok (both locally and globally) [2], graph dimensionality reduction and link reduction [3] etc.
Two applications relevant to multi-agent and multi-dimensional systems will be explained in detail in Section VI.
I-B Our basic approach
Let be any symmetric graph matrix. Consider the following Schrödinger-type differential equation
[TABLE]
where is a complex valued dimension vector, which can be interpreted as the wave function of a hypothetical quantum system. The solution of this differential equation with the boundary condition is . Subjecting this solution to the Fourier transform provides a decomposition in terms of eigenvalues and eigenvectors as follows:
[TABLE]
where is the Dirac function shifted by . This follows from the eigen-decomposition of the matrix , . In order to avoid the harmonic oscillations created from finite and discretized version of the above Fourier transform, which will mask the Dirac peaks, the following Gaussian smoothing can be performed. For ,
[TABLE]
The right-hand side of the above expression leads us to a plot at each node with Gaussian peaks at each of the eigenvalues and the amplitude of the peak at th eigenvalue is , proportional to the th component in the eigenvector .
The idea is to estimate the solution of (1) at intervals of time for a total of samples. One way to form an estimate to the left-hand side to (3) is:
[TABLE]
In [4], we use the above approximation with various approaches based on diffusion, gossiping and quantum random walks for distributed computation of the eigenvalues and the eigenvectors. Let us discuss some challenges in this approach.
Issues in the computation
While doing numerical experiments, we have observed that the approaches in [4] work well for larger eigenvalues of the adjacency matrix of a graph but they do not perform that well when one needs to distinguish between the eigenvalues which are very close to each other. One of the main techniques proposed there to solve (1) and to find in (4) are via th order Runge-Kutta method and its implementation as a diffusion process in the network. The -th order Runge-Kutta method has the convergence rate of . We have observed that this is the case while checking the trajectory of the associated differential equation solution; the solution diverges, and it happens when a large number of iterations is required (see Section V).
A larger value for is anticipated from our approximation in (4) due to the following facts. From the theory of Fourier transform and Nyquist sampling, the following conditions must be satisfied:
[TABLE]
where is the maximum resolution we require in the frequency (eigenvalue) domain, which is ideally . This explains that when dealing with graph matrices with larger and require higher resolution, will take large values. For instance, in case of the Laplacian matrix, where the maximum eigenvalue is bounded as with as the maximum degree of the graph and the lower eigenvalues are very close to each other, turns out to be typically a very large value.
In what follows, we propose solutions to the above mentioned issues.
I-C Related works
The general idea of using mechanical oscillatory behaviour for the detection of eigenvalues has appeared in a few previous works, see e.g., [5, 6]. Though the technique in [5] is close to ours, our methods differ by focussing on a Schrödinger-type equation and numerical integrators specific to it. Moreover, we demonstrate the efficiency and stability of the methods in real-world networks of varying sizes, in contrast to a small size synthetic network considered in [5], and our methods can be used to estimate eigenvectors as well.
In comparison to [6] we do not deform the system and we use new symplectic numerical integrators [7, 8]. For the problem of distributed spectral decomposition in networks, one of the first and prominent works appeared in [3]. But their algorithm requires distributed orthonormalization at each step and they solve this difficult operation via random walks. But if the graph is not well-connected (low conductance), this task will take a very long time to converge. Our distributed implementation based on fluid diffusion in the network does not require such orthonormalization.
I-D Contributions
We make the following contributions and significantly improve the algorithms from our previous work:
We observe from our previous studies that the stability in trajectory of the differential equation solver is of significant influence in the eigenvalue-eigenvector technique based on (1). Thus, we resort to geometric integrators to ensure the stability. In particular, by modeling the problem as a Hamiltonian system, we use symplectic integrators (SI) which protect the volume preservation of Hamiltonian dynamics, thus preserve stability and improve accuracy. 2. 2.
We propose algorithms that are easy to design without involving many parameters with interdependence, compared to the algorithms proposed in [4].
In the rest of the paper for clarity of presentation we mostly concentrate on the Laplacian matrix as an example for graph matrix . We design algorithms based on Lagrangian as well as Hamiltonian dynamics, to compute the smallest eigenvalues and the respective eigenvectors of the Laplacian matrix efficiently. For simplicity, in this paper we do not consider Gaussian smoothing (3), but the proposed algorithms can be readily extended to include it.
The paper is organized as follows. In Section II, we explain a mass-spring analogy specific to Laplacian matrix and derive a method to identify the spectrum. Section III focuses on general symmetric matrices and develop techniques based on solving the Schrödinger-type equation efficiently. Section IV details a distributed implementation of the proposed algorithm. Section V contains numerical simulations on networks of different sizes. Section VI contains two relevant applications to multi-agent and multi-dimensional systems. Section VII concludes the paper.
For convenience we summarize the important notation used in this paper in Table I.
II Mechanical spring analogy with Lagrangian dynamics
Consider a hypothetical mechanical system representation of the graph in which unit masses are placed on the vertices and the edges are replaced with mechanical springs of unit stiffness. Using either Lagrangian or Netwonian mechanics, the dynamics of this system is described by the following system of differential equations
[TABLE]
The system has the Hamiltonian function as , being the identity matrix of order .
We note that once we obtain by some identification method the frequencies of the above oscillatory system, the eigenvalues of the Laplacian can be immediately retrieved by the simple formula . This will be made clearer later in this section.
Starting with a random initial vector , we can simulate the motion of this spring system. For the numerical integration, the Leapfrog or Verlet method [9] technique can be applied. Being an example of geometric integrator, Verlet method has several remarkable properties. It has the same computational complexity as the Euler method but it is of second order method (Euler method is employed in [4] as the first order distributed diffusion). In addition, the Verlet method is stable for oscillatory motion and conserves the errors in energy and computations [10, Chapter 4]. It has the following two forms. Let and be the approximation of , similarly for . Here is the step size for integration. First, define
[TABLE]
Then, perform the following iterations
[TABLE]
Equivalently, one can do the updates as
[TABLE]
We name the above algorithm as Order-2 Leapfrog.
Solution of the differential equation (6) subject to the boundary values and is
[TABLE]
where we assume the decomposition of based on spectral theorem, i.e., with as the orthonormal matrix with columns as eigenvectors and as the diagonal matrix formed from the eigenvalues. Further simplification of the above expression along the fact that , for any function which can be expressed in terms of power series, gives
[TABLE]
or th component of is
[TABLE]
Now we have
[TABLE]
Taking the real and positive spectrum will give . The whole operation can be approximated by applying an -point FFT on , and taking real values. (To be exact, there is a phase factor to be multiplied to the th point in FFT approximation, and is given by , where we considered the time interval ).
Note that (6) is different from the original differential equation in [4] where it is containing complex coefficients.
III Hamiltonian Dynamics and Relation with Quantum Random Walk
In [4] we have studied the Schrödinger-type equation of the form (1) with taken as the adjacency matrix and as the wave function. Now let us consider a similar equation with respect to the graph Laplacian
[TABLE]
The solution of this dynamics is closely related to the evolution of continuous time quantum random walk and algorithms are developed in [4] based on this observation.
Now since the matrix is real and symmetric, it is sufficient to use the real-imaginary representation of the wave function , . Substituting this representation into equation (7) and taking real and imaginary parts, we obtain the following system of equations
[TABLE]
or equivalently in the matrix form
[TABLE]
Such a system has the following Hamiltonian function
[TABLE]
Next, the very helpful decomposition
[TABLE]
together with the observation that
[TABLE]
leads us to another modification of the leapfrog method known as symplectic split operator algorithm [7]: Initialize with
[TABLE]
then perform the iterations
[TABLE]
and update
[TABLE]
The above modified leapfrog method belongs to the class of symplectic integrator (SI) methods [8, 10, 11]. We name the above algorithm as order-2 SI.
The Hamiltonian system approach can be implemented in two ways:
Form the complex vector at each of the intervals. Then with and approximates at intervals. A direct application of point FFT with appropriate scaling will give the spectral decomposition as in (2). 2. 2.
Note that the formulation in (8) is equivalent to the following differential equations
[TABLE]
which are similar to the one in (6) except the term . Now on the same lines of analysis as in the previous section, taking the real and positive spectrum of just component will give .
III-A Fourth order integrator
The Hamiltonian in (9) associated to the Schrödinger-type equation has a special characteristic that it is seperable into two quadratic forms, which help to develop higher order integrators. The stage integrator has the following form. Between and intervals, we run for ,
[TABLE]
In order to make th order integrator . For our numerical studies we take the optimized coefficients for order- derived in [12]. We call such algorithm as order-4 SI.
IV Distributed Implementation
The order-2 symplectic integrator algorithm in (10)-(13) can be implemented in a distributed fashion such that each node needs to communicate only to its neighbors. The matrix-vector multiplications in (11) and (12), during one iteration of the algorithm, require diffusion of packets or fluids to the neighbors of every node, and fusion of the received fluids from all the neighbors at each node. Each iteration of the algorithm subsequently has two diffusion-fusion cycles and three synchronization points. A diffusion-fusion cycle consists of packets sent in parallel and hence total number of packets exchanged in one iteration of the algorithm is . Since order-2 SI does not require orthonormalization (unlike classical power iteration and inverse iteration methods for computing eigenelements), and the diffusion-fusion cycle is within one hop neighborhood, time delay of the algorithm will not be too significant. The synchronization points definitely pose some constraints, and demand extra resources. We have also considered an asynchronous version of the distributed algorithm, which will be presented in the extended version of the work.
V Numerical Results
The parameters and are chosen in the numerical studies satisfying the constraints in (5). We assume that the maximum degree is known to us.
Note that if the only purpose is to detect eigenvalues, not to compute the eigenvectors, then instead of taking real part of the FFT in the Hamiltonian solution, it is clearly better to compute the absolute value of the complex quantity to get higher peaks. But in the following simulations we look for eigenvectors as well.
For the numerical studies, in order to show the effectiveness of the distributed implementation, we focus on one particular node and plot the spectrum observed at that node. In the plots, indicates the approximated spectrum at frequency observed on node .
V-A Les Misérables network
In Les Misérables network, nodes are the characters in the well-known novel with the same name and edges are formed if two characters appear in the same chapter. The number of nodes is and number of edges is . We look for the spectral plot at a specific node called Valjean (with node ID ), a character in the associated novel.
The instability of the Euler method is clear from Figure 1(a), whereas Figure 1(b) shows the guaranteed stability of Hamiltonian SI. Here the y-axis represents the absolute value of . (Note the difference in the y-axis scale in the figures). Figure 2 shows the result given by the Lagrangian Leapfrog method from Section II. It can be observed that very few smallest eigenvalues are detected using order-2 Leapfrog compared to the SI technique (order-2) in Figure 3. This demonstrates the superiority of the Hamiltonian system approach. Figure 4 shows order-4 SI with much less number of iterations. The precision in order-4 plot can be significantly improved further by increasing the number of iterations.
V-B Coauthorship graph in network science
The coauthorship graph represents a collaborative network of scientists working in network science as compiled by M. Newman [13]. The numerical experiments are done on the largest connected component with and . Figure 5 displays the order-4 SI simulation and it can be seen that even though the eigenvalues are very close, the algorithm is able to distinguish them clearly.
V-C Enron email network
The nodes in this network are the email ID’s of the employees in a company called Enron and the edges are formed when two employees communicated through email111Data collected from SNAP: http://snap.stanford.edu/data. Since the graph is not connected, we take the largest connected component with nodes and edges. The standard MATLAB procedures for eigenelements computation have difficulty to cope with such network sizes. The node under focus is the highest degree node in that component. Simulation result is shown in Figure 6.
VI Applications
In this section we consider two related applications of the distributed spectrum computation to multi-agent and multi-dimensional (nD) systems.
First, we consider the consensus protocol in the wireless sensor networks [14]. We model a wireless sensor network as a random geometric graph, with nodes corresponding to agents (sensors) located on the unit square of the plane and edges correspond to possible communication links within radius . The consensus protocol is described as follows: let be the value of sensor at time slot ,
[TABLE]
or in the matrix form , where is the set of neighbour nodes of node including the node itself, and is a matrix of edge weights. Let the initial value of sensor be . Then, if the consensus protocol converges, we have
[TABLE]
It has been demonstrated in [14] that performance of the best constant consensus protocol is very good in sparse wireless networks. The optimal weight value is given by [15]
[TABLE]
where denotes the -th largest eigenvalue of the graph Laplacian , . Using our distributed approach for the eigenvalues computation, we can propose a completely distributed, self-tuning, best constant consensus protocol.
We note that in the above example of the consensus protocol, the perfect consensus is actually reached only in the limit. Often, in practice we would like to obtain consensus in finite time. The finite-time consensus problem is very challenging and there is no simple solution for its design (see e.g., [16]). In [17] it has been suggested to use Iterative Learning Control (ICL) to achieve finite-time consensus. As in [17] we assume that each agent can be described by a fairly general Markovian dynamics
[TABLE]
where indicates the time slots, whereas indicates the ILC iterations. Here is the zero input responsive function, is the transfer operator with Markovian parameters and is the control input.
The authors of [17] have proposed the following update rule for the control
[TABLE]
with being elements of the graph adjacency matrix and being the learning gains to be designed, and shown that under such update rule, the system “learns” finite-time consensus, i.e.,
[TABLE]
if the following condition holds
[TABLE]
where is the diagonal matrix of gains, is the diagonal matrix of the response function at time and is the spectral radius of matrix . In fact, if the communication graph is undirected and connected, the condition is always satisfied. However, in practice, it is good to be not too aggressive in learning [18], and hence our distributed procedure can be used to choose gains in such a way so that the value estimated by our algorithm will be not too small and not too close to one.
VII Conclusions and future research
We have proposed a distributed approach for the eigenvalue-eigenvector problem for graph matrices based on Hamiltonian dynamics, symplectic integrators and smoothed Fourier transform. We demonstrate with various network sizes that the proposed approach efficiently scales and finds, with higher resolution, closely situated eigenvalues and associated eigenvectors of graph matrices.
In the future we hope to present asynchronous versions of the introduced algorithms, to extend the proposed approaches to some classes of non-symmetric matrices and to design an automatic or a semi-automatic procedure for the identification of dominant eigenvalues and eigenvectors.
Acknowledgement
This work is supported by INRIA Bell Labs joint lab (ADR Network Science). We also would like to thank Leonid Freidovich for stimulating discussions. This is the author version of the IEEE nDS 2017 article.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] U. Von Luxburg, “A tutorial on spectral clustering,” Statistics and computing , vol. 17, no. 4, pp. 395–416, 2007.
- 2[2] C. Tsourakakis, “Fast counting of triangles in large real networks without counting: Algorithms and laws,” in IEEE ICDM , Dec. 2008.
- 3[3] D. Kempe and F. Mc Sherry, “A decentralized algorithm for spectral analysis,” in STOC , Jun. 2004.
- 4[4] K. Avrachenkov, P. Jacquet, and J. K. Sreedharan, “Distributed spectral decomposition in networks by complex diffusion and quantum random walk,” in IEEE INFOCOM , Apr. 2016.
- 5[5] M. Franceschelli, A. Gasparri, A. Giua, and C. Seatzu, “Decentralized estimation of laplacian eigenvalues in multi-agent systems,” Automatica , vol. 49, no. 4, pp. 1031–1036, 2013.
- 6[6] T. Sahai, A. Speranzon, and A. Banaszuk, “Hearing the clusters of a graph: A distributed algorithm,” Automatica , vol. 48, pp. 15–24, 2012.
- 7[7] S. Blanes, F. Casas, and A. Murua, “Symplectic splitting operator methods for the time-dependent schrödinger equation,” The Journal of chemical physics , vol. 124, no. 23, p. 234105, 2006.
- 8[8] ——, “Splitting and composition methods in the numerical integration of differential equations,” ar Xiv preprint ar Xiv:0812.0377 , 2008.
