Sampling And Reconstruction Of Diffusive Fields On Graphs
Siddartha Reddy, Sundeep Prabhakar Chepuri

TL;DR
This paper presents methods for reconstructing diffusive fields and localizing sources on graphs from limited, time-sampled observations, achieving exact recovery in noiseless cases and robustness with noise, with applications to physical systems.
Contribution
It introduces novel sampling and reconstruction techniques for diffusive fields on graphs, enabling source localization without sparsity constraints and handling noisy data.
Findings
Exact recovery possible with noiseless data
Robust performance of least squares estimators with noise
Application demonstrated on heat diffusion in a metal plate
Abstract
In this paper, the focus is on the reconstruction of a diffusive field and the localization of the underlying driving sources on arbitrary graphs by observing a significantly smaller subset of vertices of the graph uniformly in time. Specifically, we focus on the heat diffusion equation driven by an initial field and an external time-invariant input. When the underlying driving sources are modeled as an initial field or external input, the sources (hence the diffusive field) can be recovered from the subsampled observations without imposing any band-limiting or sparsity constraints. When the diffusion is induced by both the initial field and external input, then the field and sources can be recovered from the subsampled observations, however, by imposing band-limiting constraints on either the initial field or external input. For heat diffusion on graphs, we can compensate for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Sampling and Reconstruction of Diffusive Fields on Graphs
Abstract
In this paper, the focus is on the reconstruction of a diffusive field and the localization of the underlying driving sources on arbitrary graphs by observing a significantly smaller subset of vertices of the graph uniformly in time. Specifically, we focus on the heat diffusion equation driven by an initial field and an external time-invariant input. When the underlying driving sources are modeled as an initial field or external input, the sources (hence the diffusive field) can be recovered from the subsampled observations without imposing any band-limiting or sparsity constraints. When the diffusion is induced by both the initial field and external input, then the field and sources can be recovered from the subsampled observations, however, by imposing band-limiting constraints on either the initial field or external input. For heat diffusion on graphs, we can compensate for the unobserved vertices with the temporal samples at the observed vertices. If the observations are noiseless, then the recovery is exact. Nonetheless, the developed least squares estimators perform reasonably well with noisy observations. We apply the developed theory for localizing and recovering hot spots on a rectangular metal plate with a cavity.
Index Terms— Graph signal processing, graph sampling, heat diffusion, non-bandlimited signals, source localization on graphs.
1 Introduction
Graph signal processing extends tools from classical signal processing to deal with data defined on networks and other irregular domains [1, 2, 3]. We often come across such datasets in many diverse applications such as environmental sensing, traffic monitoring, mapping the human brain [4], cybersecurity [5], and social networks, to list a few.
Similar to how we understand many physical phenomena by a partial differential equation that explains the evolution of a spatiotemporal field and relates it to the inducing sources, we can also understand the temporal evolution of data over a network or an irregular domain using a partial differential equation. For example, the heat equation is often used to model the traffic movement, infection or virus spread, or rumor propagation [6, 5].
In this work, we focus on heat diffusion over networks. Specifically, we are interested in recovering diffusive signals on a graph by sampling a significantly smaller subset of vertices of the graph. This essentially amounts to localizing the underlying sources that drive the diffusion process from the observations that are collected at a few nodes. Oftentimes, the sources (e.g., traffic bottleneck or rumor sources) that induce the diffusion process are highly localized in the network or sparse in the vertex domain and hence are not usually bandlimited. Therefore, we require new sampling and recovery methods for graph signals that do not impose any structural or band-limiting constraints, unlike some of the existing graph sampling methods [7, 8, 9]. Although band-limiting constraints are not needed for recovering the second-order statistics of a signal defined on a graph from the subsampled observations, the framework developed in [10] cannot be used for localizing diffusive sources. Spatio-temporal sampling and reconstruction of diffusive fields on a regular domain under the assumption that the inducing sources are sparse are studied in [11, 12]. Assuming that the underlying sources are known, [13] focuses on estimating the time instance when the sources appear. In contrast, we will assume that the start time of the sources are is known.
In this work, we develop a graph sampling method to recover diffusive fields induced by an initial field and/or an external input that does not vary with time. The main results of this paper are as follows. When the underlying driving sources are modeled as an initial field or external input, we can localize and recover the sources by sampling a significantly smaller subset of vertices of the graph uniformly in time and by using a simple least squares estimator. To do so, we do not impose any constraints on the sources such as sparsity or bandlimitedness. Since we can compensate for the unobserved vertices with the temporal samples at the observed vertices, we can recover the sources without imposing any constraints. However, when the diffusion field is due to both the initial field and external input, to reconstruct the diffusive fields from the subsampled observations, we require either the initial field or external input to be bandlimited. If the observations are noiseless, then the recovery is exact. Nonetheless, the developed estimators perform reasonably well with noisy observations.
Throughout this paper, we will use upper (lower) case boldface letters to denote matrices (column vectors), and we will denote sets using calligraphic letters.
2 Graph signals
Consider an undirected graph with vertices (or nodes), where and represent the vertex set and edge set, respectively. Let us denote the graph Laplacian matrix associated with as . A graph signal is a function with being the value of the function at vertex . Let us collect the function values in a length- vector .
For undirected graphs is real symmetric, and hence admits an eigendecomposition with being the eigenvector matrix collecting the eigenvectors and being the diagonal matrix containing the corresponding eigenvalues . Here, refers to a diagonal matrix with its argument on the main diagonal. The eigenvectors and eigenvalues of provide the notion of frequency in the graph setting [2, 3]. Specifically, forms an orthonormal Fourier-like basis for graph signals with the graph frequencies denoted by . The graph Fourier transform of , denoted by , is given by
[TABLE]
We say that a graph signal is bandlimited, if its graph Fourier transform is sparse (i.e., contains a very few nonzero entries). Due to the uncertainty principle [14], a sparse graph signal is not bandlimited in general.
The frequency content of graph signals may be modified using linear shift-invariant graph filters [15] of the form
[TABLE]
where is the frequency response of the graph filter.
3 Data model
Let us consider a signal in a physical domain and temporal domain . We will assume that obeys the heat equation
[TABLE]
where is the Laplace operator, is the diffusion constant, and is the external time-invariant input. When , represents the initial field distribution. Without loss of generality, from now on we will assume .
To solve such a differential equation on a surface or manifold, the manifold is discretized (e.g., using a Delaunay mesh), and the Laplace operator is replaced with a discrete Laplacian matrix (more specifically, a cotan-Laplacian matrix) denoted by . Thus, approximating (3) to
[TABLE]
where and are signals defined on the graph represented by the Laplacian matrix . The differential equation (4) models heat diffusion on graphs, where the diffusion field is induced by and .
The solution to the non-homogenous differential equation (4) is given by [16]
[TABLE]
where denotes the matrix exponential of , is the initial field distribution at . Here, and are, respectively the graph Fourier transforms of and . From (2), we can see that the diffusive field is obtained by filtering and with graph filters having frequency responses and , respectively.
Let us introduce the vectors and , where
[TABLE]
with , and . We can now express (5) compactly as
[TABLE]
Next, let us sample uniformly in time at instances with step size to obtain the data matrix , which is given by
[TABLE]
where and . Also, let us observe a subset of out of mesh points and denote this subset with , where . By introducing a selection matrix that selects the field values at vertices indicated by , we can mathematically relate the subsampled observations to as
[TABLE]
In what follows, we will develop estimators to recover and/or from .
4 Diffusion field induced by or
In this section, we will develop a simple least squares estimator for reconstructing the diffusion field induced by or from the subsampled data matrix . More importantly, we do not impose any band-limiting constraints on the sources. This means that the sources may be sparse in the vertex domain and can model localized events such as rumor or infection sources in a complex network, traffic accidents in a road network, or diffusion of hot spots on a surface, to list a few .
Consider the case in which the diffusion field (5) is induced by only the initial field and the external input . From (5) and (7), we have
[TABLE]
Vectorizing , we get a system of equations in unknowns given by
[TABLE]
where denotes the Khatri-Rao (i.e., columnwise Kronecker) product, refers to the matrix vectorization operator. Here, we have used the property .
Suppose we choose and such that , and if the matrix has full-column rank, then we can estimate using least squares as
[TABLE]
and localize the sources as
[TABLE]
Using this in (5) allows us to compute the diffusive field at any time and at all the vertices. When the diffusion field is induced by with , the least squares estimator for may be developed along the similar lines.
The rank of the Khatri-Rao product of two matrices and (of appropriate dimensions) with no all-zero column satisfies [17]
[TABLE]
When the sampling time instances and the eigenvalues of are distinct, then the Vandermonde matrix will have full column rank of for and by construction does not have an all-zero column. Therefore, selecting rows of such that there are no all-zero columns ensures that the rank of the matrix will be . In fact, observing only one node uniformly in time might result in the matrix that has full column rank. However, in practice, depending on the observation time window, diffusion constant and the spectrum of , might be ill-conditioned. In such cases, may be designed using sparse sensing (or sensor selection) techniques (e.g., see [18, 19]) to obtain a full column rank matrix .
5 Diffusion field induced by and
In this section, we consider the case in which the diffusion field is induced by and a bandlimited time-invariant input , and provide a simple least squares estimator to recover the underlying sources from the subsampled data matrix . Although we restrict to be bandlimited, we do not impose any band-limiting or other structural constraints on . The heat diffusion equation may be used to understand the movement of traffic in cities. Although the usual traffic movement may be assumed to be a smooth signal on a road network, there could exist a localized traffic bottleneck (e.g., due to an accident), which is a sparse non-bandlimited graph signal. Such diffusive fields may be modeled using (4) with a sparse representing the localized events and a bandlimited representing the usual activity.
Recall that if is bandlimited, then will be sparse.Without loss of generality, let us assume that the first entries of are nonzero. Then, the bandlimited (or smooth) signal may be expressed as a linear combination of the first few eigenvectors as
[TABLE]
where .
Vectorizing in (7), we have
[TABLE]
Substituting (9), we get a linear system of equations in unknowns
[TABLE]
If the matrix has full column rank, which requires , we can use least squares to obtain
[TABLE]
and subsequently localize the underlying sources as
[TABLE]
In the previous subsection, we have seen that by appropriately selecting rows of we may obtain a full column rank matrix as has full-column rank. As a consequence, by appropriately selecting rows of will only increase the rank of by . This means that we have to impose some structural constraint on to recover it uniquely from the subsampled data when the diffusive field is induced by both and . In other words, by sampling in time and observing all the nodes, we can recover unknowns and without any band-limiting constraints.
6 Numerical experiments
In this section, we apply the developed theory of graph sampling for reconstructing diffusive fields induced by hot spots on a metal block with a cavity. We use the partial differential equation toolbox from MATLAB to mesh the surface. The generated mesh with vertices is shown in Fig. 1. We observe vertices uniformly in time in the interval s with step size s and . The sampled vertices are also indicated in Fig. 1. We present results for the following two cases: (i) diffusive field induced by , and (ii) diffusive field induced by and .
As discussed in Section 4, to recover diffusive fields induced by with , we do not require any band-limiting constraints. To demonstrate this, for , we use a very sparse vector with only two non-zero entries at vertices and . Since this initial field distribution is highly localized in the vertex domain, it is not bandlimited. Fig. 2(a) shows the initial field distribution at , and Fig. 2(b) shows the evolution of the diffusive field at vertices , , , and for different time instances. In Fig. 2(c), we can see the exact localization of the hot spots in the noiseless setting using a simple linear least squares estimator, and more importantly, without using any sparsity constraints. In Fig. 3, we consider a noisy setting in which the observations in (8) are corrupted with Gaussian noise having zero mean and variance . We show the normalized root mean squared error (RMSE), averaged over 1000 independent Monte-Carlo experiments, for different values of . Although the error decreases as increases, we can see that increasing beyond a certain value does not lead to better performance. This is because becomes ill-conditioned as increases.
For the case in which the diffusion field is induced due to both and , we use a sparse as before, and a bandlimited with . Fig. 2(d) shows the external time-invariant input, which is smooth on the surface. When , we can see in Fig. 2(e) that the field values do not decay with time as earlier. Fig. 2(f) shows the exact recovery of using a simple linear least squares estimator (the reconstruction of is similar to Fig. 2(c), hence not shown), where we do not impose any sparsity constraints for recovering . As before, gathering more samples in time does not lead to better performance as both and become ill-conditioned, and as a consequence we need to sample more vertices.
7 Concluding remarks
In this paper, we discussed the sampling and recovery of diffusive fields on graphs induced by possibly non-bandlimited sources. When the diffusion field is induced by an initial field or a time-invariant external input, we can localize and recover the sources by sampling a significantly smaller subset of nodes uniformly in time without imposing any band-limiting constraints and by using a simple least squares estimator. For diffusive fields induced due to an initial field and external input, we can exactly recover the sources from noiseless subsampled data when we constrain the external input to be bandlimited. When the observations are noiseless, the recovery is exact. In essence, for diffusion models on graphs, we can compensate for the unobserved vertices with the temporal samples at the observed vertices.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Ortega, P. Frossard, J. Kovačević, J. M. Moura, and P. Vandergheynst, “Graph signal processing: Overview, challenges, and applications,” Proceedings of the IEEE , vol. 106, no. 5, pp. 808–828, 2018.
- 2[2] D. I. Shuman, S. K. Narang, P. Frossard, A. Ortega, and P. Vandergheynst, “The emerging field of signal processing on graphs: Extending high-dimensional data analysis to networks and other irregular domains,” IEEE Signal Process. Mag. , vol. 30, no. 3, pp. 83–98, 2013.
- 3[3] A. Sandryhaila and J. M. Moura, “Big data analysis with signal processing on graphs: Representation and processing of massive data sets with irregular structure,” IEEE Signal Process. Mag. , vol. 31, no. 5, pp. 80–90, 2014.
- 4[4] W. Huang, L. Goldsberry, N. F. Wymbs, S. T. Grafton, D. S. Bassett, and A. Ribeiro, “Graph frequency analysis of brain signals,” IEEE Journ. of Sel. Topics in Signal Process. , vol. 10, no. 7, pp. 1189–1203, 2016.
- 5[5] D. Shah and T. Zaman, “Detecting sources of computer viruses in networks: theory and experiment,” in ACM SIGMETRICS Performance Evaluation Review , vol. 38, no. 1. ACM, 2010, pp. 203–214.
- 6[6] Z. Chen, K. Zhu, and L. Ying, “Detecting multiple information sources in networks under the sir model,” IEEE Transactions on Network Science and Engineering , vol. 3, no. 1, pp. 17–31, 2016.
- 7[7] S. Chen, R. Varma, A. Sandryhaila, and J. Kovačević, “Discrete signal processing on graphs: Sampling theory,” IEEE Trans. Signal Process. , vol. 63, no. 24, pp. 6510–6523, 2015.
- 8[8] S. P. Chepuri, Y. C. Eldar, and G. Leus, “Graph sampling with and without input priors,” in Proc. of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , 2018, pp. 4564–4568.
