Spectral partitioning of time-varying networks with unobserved edges

Michael T. Schaub; Santiago Segarra; Hoi-To Wai

arXiv:1904.11930·cs.SI·April 29, 2019

Spectral partitioning of time-varying networks with unobserved edges

Michael T. Schaub, Santiago Segarra, Hoi-To Wai

PDF

TL;DR

This paper introduces a spectral algorithm for community detection in time-varying networks with unobserved edges, leveraging filtered graph signals and a stochastic blockmodel framework, with proven consistency guarantees.

Contribution

It presents a novel spectral method for blind community detection in dynamic networks modeled by latent SBMs, with theoretical analysis and empirical validation.

Findings

01

The algorithm achieves consistent recovery of latent communities.

02

Numerical experiments demonstrate effectiveness on synthetic and real data.

03

The method handles unobserved edges in time-varying networks.

Abstract

We discuss a variant of `blind' community detection, in which we aim to partition an unobserved network from the observation of a (dynamical) graph signal defined on the network. We consider a scenario where our observed graph signals are obtained by filtering white noise input, and the underlying network is different for every observation. In this fashion, the filtered graph signals can be interpreted as defined on a time-varying network. We model each of the underlying network realizations as generated by an independent draw from a latent stochastic blockmodel (SBM). To infer the partition of the latent SBM, we propose a simple spectral algorithm for which we provide a theoretical analysis and establish consistency guarantees for the recovery. We illustrate our results using numerical experiments on synthetic and real data, highlighting the efficacy of our approach.

Equations46

H (L) = k = 0 \sum T h_{k} L^{k} .

H (L) = k = 0 \sum T h_{k} L^{k} .

y = H (L) w,

y = H (L) w,

A_{ij} ∣ g_{i}, g_{j} \sim Ber (Ω_{g_{i}, g_{j}}) .

A_{ij} ∣ g_{i}, g_{j} \sim Ber (Ω_{g_{i}, g_{j}}) .

E [A ∣ G] = G Ω G^{⊤} .

E [A ∣ G] = G Ω G^{⊤} .

y^{(ℓ)} = H (L^{(ℓ)}) w^{(ℓ)}, ℓ = 1, \dots, m .

y^{(ℓ)} = H (L^{(ℓ)}) w^{(ℓ)}, ℓ = 1, \dots, m .

C_{y}^{m} : = (1/ m) \sum_{ℓ = 1}^{m} (y^{(ℓ)}) (y^{(ℓ)})^{⊤} \vspace - .4 c m

C_{y}^{m} : = (1/ m) \sum_{ℓ = 1}^{m} (y^{(ℓ)}) (y^{(ℓ)})^{⊤} \vspace - .4 c m

C_{y} : = E [y^{(ℓ)} (y^{(ℓ)})^{⊤}],

C_{y} : = E [y^{(ℓ)} (y^{(ℓ)})^{⊤}],

\big{\|}\widehat{\bm{C}}_{y}^{m}-{\bm{C}}_{y}\big{\|}_{2}\leq c_{0}\!~{}(\log\log n)^{2}\left(\frac{n}{m}\right)^{\frac{1}{2}-\frac{2}{q}}\;,

\big{\|}\widehat{\bm{C}}_{y}^{m}-{\bm{C}}_{y}\big{\|}_{2}\leq c_{0}\!~{}(\log\log n)^{2}\left(\frac{n}{m}\right)^{\frac{1}{2}-\frac{2}{q}}\;,

∥ y^{(ℓ)} ∥_{2} = ∥ H (L^{(ℓ)}) w^{(ℓ)} ∥_{2} \leq ∥ H (L^{(ℓ)}) ∥_{2} ∥ w^{(ℓ)} ∥_{2},

∥ y^{(ℓ)} ∥_{2} = ∥ H (L^{(ℓ)}) w^{(ℓ)} ∥_{2} \leq ∥ H (L^{(ℓ)}) ∥_{2} ∥ w^{(ℓ)} ∥_{2},

∥ y^{(ℓ)} ∥_{2} \leq \overset{ˉ}{h} ∥ w^{(ℓ)} ∥_{2} .

∥ y^{(ℓ)} ∥_{2} \leq \overset{ˉ}{h} ∥ w^{(ℓ)} ∥_{2} .

∥ y^{(ℓ)} ∥_{2} \leq c \overset{ˉ}{h} n a . s .,

∥ y^{(ℓ)} ∥_{2} \leq c \overset{ˉ}{h} n a . s .,

∣ ⟨ y^{(ℓ)}, u ⟩ ∣ \leq ∥ y^{(ℓ)} ∥_{2} ∥ u ∥_{2} \leq \overset{ˉ}{h} ∥ w^{(ℓ)} ∥_{2} .

∣ ⟨ y^{(ℓ)}, u ⟩ ∣ \leq ∥ y^{(ℓ)} ∥_{2} ∥ u ∥_{2} \leq \overset{ˉ}{h} ∥ w^{(ℓ)} ∥_{2} .

\begin{split}\big{(}\mathbb{E}[|\langle{\bm{y}}^{(\ell)},{\bm{u}}\rangle|^{q}]\big{)}^{1/q}&\leq\bar{h}\!~{}(\mathbb{E}[\|{\bm{w}}^{(\ell)}\|_{2}^{q}])^{1/q}\leq\bar{h}\!~{}W_{0}\;.\end{split}

\begin{split}\big{(}\mathbb{E}[|\langle{\bm{y}}^{(\ell)},{\bm{u}}\rangle|^{q}]\big{)}^{1/q}&\leq\bar{h}\!~{}(\mathbb{E}[\|{\bm{w}}^{(\ell)}\|_{2}^{q}])^{1/q}\leq\bar{h}\!~{}W_{0}\;.\end{split}

p_{1} := E [H_{ii}^{2}] for all i,

p_{1} := E [H_{ii}^{2}] for all i,

p_{3} := E [H_{ij}^{2}] for i \neq \sim j,

p_{5} := E [H_{ii} H_{j i}] for i \sim j,

p_{7} := E [H_{ii} H_{j i}] for i \neq \sim j,

C_{y} = (c_{3} - c_{1}) I + G (c_{1} c_{2} c_{2} c_{1}) G^{⊤},

C_{y} = (c_{3} - c_{1}) I + G (c_{1} c_{2} c_{2} c_{1}) G^{⊤},

[C_{y}]_{ii}

[C_{y}]_{ii}

\displaystyle=\mathbb{E}\Big{[}\sum_{j}H_{ij}^{2}w_{j}^{2}+\sum_{j,k}H_{ij}w_{j}H_{ik}w_{i}\Big{]}

= j \sum E [H_{ij}^{2}] E [w_{j}^{2}] + j, k \sum E [H_{ij} H_{ik}] E [w_{j}] E [w_{i}]

[C_{y}]_{ii}

[C_{y}]_{ii}

\displaystyle=p_{1}+\Big{(}\frac{n}{2}-1\Big{)}p_{2}+\frac{n}{2}p_{3}=c_{3}.

[C_{y}]_{ij}

[C_{y}]_{ij}

\displaystyle\overset{(a)}{=}\mathbb{E}\Big{[}\sum_{l}H_{il}H_{jl}w_{l}^{2}\Big{]}\overset{(b)}{=}\sum_{l}\mathbb{E}[H_{il}H_{jl}],

[C_{y}]_{ij}

[C_{y}]_{ij}

\displaystyle=2p_{5}+\Big{(}\frac{n}{2}-2\Big{)}p_{4}+\frac{n}{2}p_{6}=c_{1}.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Spectral partitioning of time-varying networks with unobserved edges

Abstract

We discuss a variant of ‘blind’ community detection, in which we aim to partition an unobserved network from the observation of a (dynamical) graph signal defined on the network. We consider a scenario where our observed graph signals are obtained by filtering white noise input, and the underlying network is different for every observation. In this fashion, the filtered graph signals can be interpreted as defined on a time-varying network. We model each of the underlying network realizations as generated by an independent draw from a latent stochastic blockmodel (SBM). To infer the partition of the latent SBM, we propose a simple spectral algorithm for which we provide a theoretical analysis and establish consistency guarantees for the recovery. We illustrate our results using numerical experiments on synthetic and real data, highlighting the efficacy of our approach.

**Index Terms— ** graph signal processing, topology inference, stochastic blockmodel, community detection, spectral methods

1 Introduction

Graph-based tools have become prevalent for the analysis of a range of different systems across the sciences [1, 2, 3]. However, while in many applications we abstract the system under investigation as a network of coupled entities, the underlying couplings are often not known. Network inference, the problem of determining the interaction topology of a networked system based on a set of nodal observables, has thus gained significant interest over the last years [4, 5, 6]. A number of notions for network inference have featured in the literature, ranging from estimating ‘functional’ couplings based on statistical association measures such as correlation or mutual information [7], all the way to causal inference [8]. The notion of inference most pertinent to our discussion is what may be called ‘topological’ inference: given a system of dynamical units, we want to infer their direct physical interactions. For example, we would like to infer the adjacency matrix of the network that a distributed system is defined on. This problem has received wide interest in the literature recently, using techniques from optimization, spectral analysis, and statistics [9, 10, 11, 12, 13, 14, 15, 16, 17, 18]. However, in many situations the goal of inferring the exact network of couplings may be unfeasible for various reasons. First, we may not have access to a sufficiently large number of samples to fully identify the network. Second, the network structure itself may be subject to fluctuations over time. Finally, we may be able to observe only some relevant parts of the system.

The described challenges need not be fundamental roadblocks since in a number of cases our ultimate target is not to obtain the exact network structure. Rather, our goal is to extract certain mesoscopic features of the network such as important nodes, motifs, or levels of assortativity. A typical scenario in these lines is the inference of modular structure within the network, i.e., the partitioning of the network into a few blocks, or communities of ‘similar’ nodes according to certain criteria (see [19, 20, 21] for a review on a variety of different approaches). In this context, the so-called stochastic blockmodel and its related variants [21] have become a major tool for solving this problem from a statistical perspective. By assuming that the observed network data has been created according to a prescribed generative model, the problem of detecting modular structure is transformed into an estimation problem in which we aim to infer the latent parameters of the model, based on the observed network.

Inspired by our recent work on blind community detection [22, 23, 24], in this paper we ask the following question [24]:

Can we infer the latent partition of a stochastic blockmodel based solely on the observation of a set of nodal signals on the graph without ever observing the underlying graph itself?

Contributions and outline We present a fresh look on the network inference problem by advocating an inference approach based on a latent generative model of the network, rather than trying to infer the exact network in terms of its adjacency matrix. As we show, this model-based inference procedure that requires only the knowledge of a set of sampled nodal observations can yield surprisingly good results, that are competitive with spectral clustering in which the full network is observed. We complement the presentation of our blind identification algorithm with a theoretical analysis, in which we we show the statistical consistency of our approach using concentration inequalities and recent results from random matrix theory.

In the remainder of this article, we first discuss our problem setup and associated preliminaries in Section 2. Section 3 describes our main theoretical results, which underpin our partition inference scheme. Section 4 provides numerical illustrations of our results both using synthetic and real-world data. We conclude with a brief discussion and an outlook on future directions in Section 5.

2 Problem Formulation

Graphs, graph signals, and graph filters. An undirected graph $\mathcal{G}$ consists of a set $\mathcal{N}$ of $n:=|\mathcal{N}|$ nodes, and a set $\mathcal{E}$ of ${n_{e}:=|\mathcal{E}|}$ edges, corresponding to unordered pairs of elements in $\mathcal{N}$ . By identifying the node set $\mathcal{N}$ with the natural numbers $1,\ldots,n$ , such a graph can be compactly encoded by the symmetric adjacency matrix $\bm{A}$ , such that $A_{ij}=A_{ji}=1$ for all $(i,j)\in\mathcal{E}$ , and $A_{ij}=0$ otherwise. Given a graph with adjacency matrix $\bm{A}$ , the (combinatorial) graph Laplacian is defined as $\bm{L}:=\bm{D}-\bm{A}$ , where $\bm{D}=\text{diag}(\bm{A}\bm{1})$ is the diagonal matrix containing the degrees of each node. We denote the spectral decomposition of the Laplacian by $\bm{L}=\bm{V}\bm{\Lambda}\bm{V}^{\top}$ . It is well known that the Laplacian matrix is positive semi-definite [25].

In this paper, we consider filtered signals defined on the graph as described next. A graph signal is a vector $\bm{y}\in\mathbb{R}^{n}$ that associates to each node in the graph a scalar-valued observable. A graph filter $\bm{\mathcal{H}}$ of order $T$ is a linear map between graph signals that can be expressed as a matrix polynomial in $\bm{L}$ of degree $T$

[TABLE]

Associated with each graph filter, we define the (scalar) generating polynomial $h(\lambda)=\sum_{k=0}^{T}h_{k}\lambda^{k}$ . In this work we are concerned with filtered graph signals that can be expressed as

[TABLE]

where $\bm{w}$ is an excitation signal corresponding to the ‘initial condition’. We assume that it is zero-mean and white, i.e., $\mathbb{E}[\bm{w}\bm{w}^{\top}]=\bm{I}$ , and its entries are bounded almost surely.

Combined with a set of appropriately chosen filter-coefficients, the above signal model can account for a range of interesting signal transformations and dynamics. This includes consensus dynamics [26], random walks and diffusion [27], as well as more complicated dynamics that can be mediated via interactions commensurate with the graph topology described by the Laplacian [28].

Stochastic blockmodel. The stochastic blockmodel (SBM) is a latent variable model that defines a probability measure over the set of unweighted networks of fixed size $n$ . In an SBM, the network is assumed to be divided into $k$ groups of nodes. Each node $i$ in the network is endowed with one latent group label $g_{i}\in\{1,\ldots,k\}$ . Conditioned on these latent class labels, each link $A_{ij}$ of the adjacency matrix $\bm{A}\in\{0,1\}^{n\times n}$ is a Bernoulli random variable that takes value $1$ with probability $\Omega_{g_{i},g_{j}}$ and value [math] otherwise:

[TABLE]

To compactly describe the model, we collect all the link probabilities between the different groups in the symmetric affinity matrix $\bm{\Omega}=[\Omega_{ij}]\in[0,1]^{k\times k}$ . Furthermore we define the partition indicator matrix $\bm{G}\in\{0,1\}^{n\times k}$ with entries $G_{ij}=1$ if node $i$ belongs to group $j$ and $G_{ij}=0$ otherwise. Based on these definitions, we can write the expected adjacency matrix under the SBM as

[TABLE]

Observation model and network model inference. We observe a nodal signal $\bm{y}^{(\ell)}$ on a network at $m$ instances. For each instance, we obtain a sample of the form

[TABLE]

For every $\ell$ , we assume that the Laplacian $\bm{L}^{(\ell)}$ is computed from the adjacency matrix of an independently drawn SBM network with a constant parameter matrix $\bm{\Omega}$ . Moreover, the initial conditions $\bm{w}^{(\ell)}$ are i.i.d. with zero mean and $\mathbb{E}[{\bm{w}}^{(\ell)}({\bm{w}}^{(\ell)})^{\top}]={\bm{I}}$ .

Our goal is now to solve the following problem. {problem} Consider the observation model described by Equation 5. Based solely on the $m$ observations $(\bm{y}^{(1)},\ldots,\bm{y}^{(m)})$ , infer the group structure of the latent SBM generating $\bm{L}^{(\ell)}$ .

To motivate this setup, consider the example of observing fMRI signals of $m$ different patients in resting state [29]. While for similar patients the overall large-scale structure of each patient’s brain network will be similar (the same SBM parameters), the individual details of these networks will be different (each network is a particular realization of the SBM). Moreover, we do not observe the network itself but only node-measurements ( $\bm{y}^{(\ell)}$ ), which will generally correspond to different, unknown independent initial conditions ( $\bm{w}^{(\ell)}$ ). As a second example, we may think of measuring some node activities such as the expression of opinions at $m$ different, sufficiently separated instances of time in some form of social network. Assuming a reasonable stable social fabric, the large scale features of the latent (unobserved) network should be relatively stable, while the individual active links in each observation instance may be different.

3 Algorithm and Theoretical Analysis

Algorithm 1 describes a simple spectral method to solve Problem 2. In a nutshell, given the observations $\{\bm{y}^{(\ell)}\}_{\ell=1}^{m}$ , we compute their sample covariance $\widehat{\bm{C}}_{y}^{m}$ as in (6) and then apply $k$ -means clustering on the leading eigenvectors of $\widehat{\bm{C}}_{y}^{m}$ . For simplicity, we assume here that the number of groups $k$ of the SBM is known. However, $k$ could be estimated as well from the spectral properties of the covariance matrix, e.g., by estimating its effective rank.

To theoretically assess the performance of the proposed method, we present an analysis in three steps. First, we characterize the rate of convergence of the sample covariance to the true covariance ${\bm{C}}_{y}$ (cf. Proposition 1). Second, we determine the structure of the limiting matrix ${\bm{C}}_{y}$ (cf. Proposition 2). Finally, we show that the eigenstructure of ${\bm{C}}_{y}$ contains all the information needed to solve Problem 2 (cf. Proposition 3).

Recall the definition of the covariance matrix

[TABLE]

where the expected value is taken over both sources of randomness, i.e., the excitation signal ${\bm{w}}^{(\ell)}$ as well as the Laplacian $\bm{L}^{(\ell)}$ of the realized graph. Based on this, the following result can be shown.

Proposition 1

Assume that the following conditions hold:

(a)

The spectral norm of the graph filter is uniformly bounded, i.e., $\|\bm{\mathcal{H}}(\bm{L}^{(\ell)})\|_{2}\leq\bar{h}$ for all $\ell$ . 2. (b)

The excitation signal satisfies $\|{\bm{w}}^{(\ell)}\|_{2}\leq c\sqrt{n}$ almost surely, and $(\mathbb{E}[\|{\bm{w}}^{(\ell)}\|_{2}^{q}])^{1/q}\leq W_{0}<\infty$ for some $q\geq 4$ .

Then, for any $m\geq n\geq 4$ , with probability at least $1-\delta$ , one has

[TABLE]

where the constant $c_{0}$ depends on $q$ , $\bar{h}$ , $\delta$ , and $W_{0}$ .

Proof. Observe that the following bound

[TABLE]

combined with condition (a) implies that

[TABLE]

To show that $\widehat{\bm{C}}_{y}^{m}$ converges to its expected value, first we observe from (9) that

[TABLE]

if $\|{\bm{w}}^{(\ell)}\|_{2}\leq c\sqrt{n}$ for some $c$ almost surely. Second, consider any ${\bm{u}}$ such that $\|{\bm{u}}\|_{2}=1$ , we have

[TABLE]

Applying (11), for any $q\geq 1$ , one has

[TABLE]

From (10) and (12), the two conditions in [30, Eq. (2.2)] hold. Invoking [30, Theorem 6.1] shows the desired result in (8). $\blacksquare$

The conditions required by the proposition are mild. For instance, condition (a) holds for graph filters that are low-pass [22]. Indeed, in such a case we have that $\|\bm{\mathcal{H}}(\bm{L}^{(\ell)})\|_{2}\leq h(0)$ , where $h(\cdot)$ is the generating polynomial of the filter $\bm{\mathcal{H}}(\cdot)$ . Condition (b) holds with for $q\geq 4$ when the excitation signal is bounded, e.g., $w_{i}^{(\ell)}$ is i.i.d. and distributed with ${\cal U}[-b,b]$ , $b<\infty$ . The proposition shows that the sampled covariance converges to the true covariance at a rate ${\cal O}(1/m^{\frac{1}{2}-\frac{2}{q}})$ . In particular, the convergence rate is ${\cal O}(\sqrt{1/m})$ in the case of bounded excitation signals, where $q$ can be made arbitrarily large.

Notice that Proposition 1 concerns general covariance matrices and does not use the fact that $\bm{L}^{(\ell)}$ is the Laplacian of a graph drawn from an SBM. In order to derive results about the recovery of the latent communities, we will have to put this assumption into place. For simplicity, we consider in the theoretical considerations that follow a simple planted partition model of size $n$ , in which only two equally sized communities of size $n/2$ exist [21]. Nonetheless, the arguments that follow can be extended to general SBMs.

In our planted partition model, the probability of an edge between two nodes within the same community is governed by the parameter $a$ whereas the probability of a link between two nodes of different communities is described by parameter $b$ . Given two nodes $i$ and $j$ , the expression $i\sim j$ denotes that both nodes lie in the same block of the SBM, whereas $i\not\sim j$ indicates the contrary. Moreover, for simplicity we denote by $\bm{H}=\bm{\mathcal{H}}({\bm{L}}^{(\ell)})$ the (random) matrix representing the filter of interest. We use the following parameters to denote the expected entries of $\bm{H}$ :

[TABLE]

Based on the introduced notation, we characterize the covariance structure of our observed output signals.

Proposition 2

The covariance ${\bm{C}}_{y}$ defined in (7) is given by

[TABLE]

where $\bm{G}\in\{0,1\}^{n\times 2}$ is the partition indicator matrix as defined before (4), and the constants $c_{i}$ are given by $c_{1}=(\frac{n}{2}-2)p_{4}+2p_{5}+\frac{n}{2}p_{6}$ , $c_{2}=2(\frac{n}{2}-1)p_{8}+2p_{7}$ , and $c_{3}=p_{1}+(\frac{n}{2}-1)p_{2}+\frac{n}{2}p_{3}$ .

Proof. Consider first the diagonal entries of ${\bm{C}}_{y}$ , we have that

[TABLE]

Using the fact that $\mathbb{E}[w_{j}^{2}]=1$ and $\mathbb{E}[w_{j}]=0$ , it follows that

[TABLE]

Next, we consider an off-diagonal entry in ${\bm{C}}_{y}$ within a block of the SBM, i.e., for $i\sim j$ but $i\neq j$ we have that

[TABLE]

where (a) follows from $\mathbb{E}[w_{l}w_{k}]=0$ whenever $l\neq k$ , and (b) used that $\mathbb{E}[w_{l}^{2}]=1$ . From the above it then follows that

[TABLE]

Finally, considering $i$ and $j$ in different blocks, we can similarly show that $[{\bm{C}}_{y}]_{ij}=c_{2}$ . By combining this result with (14) and (3), expression (13) readily follows. $\blacksquare$

An important consequence of 2 is the resulting spectral decomposition of ${\bm{C}}_{y}$ and how this eigenstructure relates to the planted (true) communities in the underlying SBM. The following proposition combines the results from Propositions 1 and 2 and justifies (asymptotically) the performance of Algorithm 1 in recovering the true communities.

Proposition 3

Assume that the conditions in Proposition 1 hold, and that $c_{1}>|c_{2}|$ , as defined in Proposition 2. Then, for a large enough number of observations $m$ , Algorithm 1 is guaranteed to recover the two communities of the equisized planted partition model.

Proof. Direct computation from expression (13) reveals that the vector of all ones $\mathbf{1}$ is an eigenvector of ${\bm{C}}_{y}$ with associated eigenvalue $\mu_{1}:=\frac{n}{2}(c_{1}+c_{2})+(c_{3}-c_{1})$ . Similarly, the signed binary vector $\pm\mathbf{1}:={\bm{G}}[1,-1]^{\top}$ whose sign indicates membership to each community is also an eigenvector of ${\bm{C}}_{y}$ but with eigenvalue $\mu_{2}:=\frac{n}{2}(c_{1}-c_{2})+(c_{3}-c_{1})$ . Every other eigenvector is associated with the eigenvalue $\mu:=c_{3}-c_{1}$ . Given that Algorithm 1 keeps the top- $2$ eigenvectors of $\widehat{\bm{C}}_{y}^{m}$ , it follows from the concentration result in Proposition 1 that whenever $\mu_{1}>\mu$ and $\mu_{2}>\mu$ , the eigenvectors selected by our algorithm will be arbitrarily close to $\mathbf{1}$ and $\pm\mathbf{1}$ for large enough $m$ , thus leading to perfect recovery. Hence, we need $c_{1}+c_{2}>0$ and $c_{1}-c_{2}>0$ , from where $c_{1}>|c_{2}|$ follows. $\blacksquare$

The constants $c_{1}$ and $c_{2}$ depend on the parameters $p_{4}$ through $p_{8}$ , which in turn depend on the filter specification $h(\cdot)$ and the probabilities $a$ and $b$ in the considered SBM. Whenever $a=b$ , it can be shown that $c_{1}=c_{2}$ , thus preventing the recovery of the planted true communities, as expected. Given a generic filter for which $c_{1}>|c_{2}|$ if $a\neq b$ , however, even a minimal difference between $a$ and $b$ will result asymptotically in a perfect recovery. This is in contrast with the detectability limit that holds for the SBM recovery problem with an observed network, where the partitions cannot be recovered if $a$ is too close to $b$ [21]. The reason behind the improved resolution here is that in our problem each sample $\bm{y}^{(\ell)}$ corresponds to an (indirect) observations of a different graph drawn from the same SBM, allowing us to detect communities for large enough samples $m$ even in the most adverse scenarios. When inferring an SBM from a single network observation, one cannot (indirectly) leverage such additional graph samples, resulting in a detectability limit [21].

4 Numerical Experiments

Synthetic data. We first examine the claims made in the paper using synthetic data. We draw graphs from an SBM with $n=100$ nodes and $k=2$ communities, with $\Omega_{g_{i},g_{j}}=4\log n/n$ if $g_{i}=g_{j}$ , and $\Omega_{g_{i},g_{j}}=4\gamma\log n/n$ otherwise, parametrized by $\gamma\in(0,1)$ . Note that the smaller $\gamma$ is, the easier it is to detect the communities. Throughout the section, the input signal is i.i.d. and set as ${\bm{w}}^{(\ell)}\sim{\cal U}[-1,1]^{n}$ . The graph filter considered is ${\cal H}({\bm{L}})=({\bm{I}}-\alpha{\bm{L}}^{(\ell)})^{5}$ where $\alpha=1/(4+4\gamma)\log n$ ensures that $\|\bm{\mathcal{H}}(\bm{L}^{(\ell)})\|<1$ for all $\ell$ .

In Fig. 1 we simulate the error rate of the partition inference over different settings of $\gamma$ , against the sample size $m$ using our proposed method. We found that the error rate decays to zero asymptotically as $m\rightarrow\infty$ regardless of the connectivity probability parameter $\gamma$ . Moreover, the error rate is markedly better compared to the application of standard spectral clustering (SC) on a single instance of the graph Laplacian. Note that this holds even if the graph considered for SC is taken from an SBM with $\gamma=0.1$ , in line with our discussion at the end of Section 3.

United States Senate data. We apply the proposed method to rollcall data (available at https://voteview.com) taken from the 110th to 114th congress of the US Senate (corresponding to years 2007 to 2017) consisting of $m=2998$ rollcalls. Using this data we focus on inferring partitions of a network in which the nodes represent the $n=50$ states of USA. To convert the data into real-valued graph signals that agree with our time varying topology model, the $\ell$ th rollcall data is mapped into a sample graph signal ${\bm{y}}^{\ell}\in\mathbb{R}^{50}$ as follows. For each state $i\in\{1,...,50\}$ , we compute $y_{i}^{\ell}\in[-1,1]$ as the average vote value from the two senators of each state, where the vote value counts a ‘Yay’ as $1$ , an absentee or an abstain as [math], and a ‘Nay’ as $-1$ . Note that with the framework of our model, we assume that the community a state belongs to remains invariant since the economic/political situation of the state varies slowly in general, even though senators maybe elected in/out during different periods.

Fig. 2 shows the partitions of the states at different resolution ( $k=2,4$ ) based on the rollcall data from the combined periods of 2007-2017 (Fig. 2a,b) and from the latest period 2015-2017 (Fig. 2c,d), respectively. At a resolution of $k=2$ , the partition result corroborates the common belief about the division between ‘Republican’ (red, e.g., Texas & Arizona) and ‘Democrat’ (blue, e.g., California & Massachusetts) states, with the 2015-2017 data reflecting recent changes in the elected senators for states such as Maine and New Hampshire. We also remark that for $k=4$ , the partitioning result using 2015-2017 data is less conclusive as it changes substantially when we sample a small batch of rollcall data. Such instability is not observed in the 2007-2017 data at the same resolution, where the partition identifies some of the ‘swing’ states such as Michigan and Louisiana.

5 Discussion

Network inference is often a critical step to perform any kind of network analysis. In certain cases, however, we are only interested in extracting some coarser features of the network, e.g., in the form of communities [22, 23, 24, 31]. As we have shown in this manuscript, if we have access to a set of independent samples from a filtered signal defined on the nodes of the network, this task can be achieved even in the absence of any information about the edges. As we have discussed for the system studied here, if the underlying network is time-varying but its latent structure remains stationary, we may even obtain a better partition recovery performance when compared to observing a single full snapshot of the actual network. Characterizing this trade-off and the sample complexity of the corresponding problems in more detail, as well as enlarging the class of latent models and considered graph filters are interesting avenues for future work.

Bibliography31

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Steven H. Strogatz, “Exploring complex networks,” Nature , vol. 410, no. 6825, pp. 268–276, Mar. 2001.
2[2] Mark E. J. Newman, Networks: An Introduction , Oxford University Press, USA, Mar. 2010.
3[3] Matthew O Jackson, Social and economic networks , Princeton university press, 2010.
4[4] Marc Timme and Jose Casadiego, “Revealing networks from dynamics: an introduction,” Journal of Physics A: Mathematical and Theoretical , vol. 47, no. 34, pp. 343001, 2014.
5[5] Patrik D’haeseleer, Shoudan Liang, and Roland Somogyi, “Genetic network inference: from co-expression clustering to reverse engineering,” Bioinformatics , vol. 16, no. 8, pp. 707–726, 2000.
6[6] Ivan Brugere, Brian Gallagher, and Tanya Y Berger-Wolf, “Network structure inference, a survey: Motivations, methods, and applications,” ACM Computing Surveys (CSUR) , vol. 51, no. 2, pp. 24, 2018.
7[7] Jonathan Friedman and Eric J Alm, “Inferring correlation networks from genomic survey data,” P Lo S computational biology , vol. 8, no. 9, pp. e 1002687, 2012.
8[8] Judea Pearl, “Causal inference in statistics: An overview,” Statistics surveys , vol. 3, pp. 96–146, 2009.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Spectral partitioning of time-​varying networks with unobserved edges

Abstract

1 Introduction

2 Problem Formulation

3 Algorithm and Theoretical Analysis

Proposition 1

Proposition 2

Proposition 3

4 Numerical Experiments

5 Discussion

Spectral partitioning of time-varying networks with unobserved edges