A spectral method for bipartizing a network and detecting a large   anti-community

A. Concas; S. Noschese; L. Reichel; and G. Rodriguez

arXiv:1812.08408·math.NA·September 21, 2021·J. Comput. Appl. Math.

A spectral method for bipartizing a network and detecting a large anti-community

A. Concas, S. Noschese, L. Reichel, and G. Rodriguez

PDF

TL;DR

This paper introduces a spectral method to approximate networks by bipartite structures and detect large anti-communities, aiding in understanding complex network relationships.

Contribution

It presents a novel spectral algorithm that efficiently finds the closest bipartite network and detects large anti-communities within a given network.

Findings

01

Successfully approximates networks by bipartite structures

02

Identifies large anti-communities in networks

03

Provides an efficient optimization-based algorithm

Abstract

Relations between discrete quantities such as people, genes, or streets can be described by networks, which consist of nodes that are connected by edges. Network analysis aims to identify important nodes in a network and to uncover structural properties of a network. A network is said to be bipartite if its nodes can be subdivided into two nonempty sets such that there are no edges between nodes in the same set. It is a difficult task to determine the closest bipartite network to a given network. This paper describes how a given network can be approximated by a bipartite one by solving a sequence of fairly simple optimization problems. The algorithm also produces a node permutation which makes the possible bipartite nature of the initial adjacency matrix evident, and identifies the two sets of nodes. We finally show how the same procedure can be used to detect the presence of a large…

Figures22

Click any figure to enlarge with its caption.

Tables3

Table 1. Table 1: Results for ξ = 10 − 2 𝜉 superscript 10 2 \xi=10^{-2} , η = 10 − 4 𝜂 superscript 10 4 \eta=10^{-4} .

(256,128)	specbip- $n_{1}$	specbip	red-black
$ℐ_{B}$	1.22e-16	1.89e-16	2.33e-03
$ℰ_{B}$	5.46e-04	6.74e-04	3.72e-03
$ℰ_{A}$	2.80e-01	2.79e-01	-
$ℰ_{N}$	1.45e-01	1.58e-01	2.76e-01
$T$	4.94e-02	5.05e-02	3.15e-04
(512,256)	specbip- $n_{1}$	specbip	red-black
$ℐ_{B}$	1.11e-17	1.11e-17	2.98e-03
$ℰ_{B}$	1.13e-04	1.50e-04	3.39e-03
$ℰ_{A}$	4.84e-02	6.27e-02	-
$ℰ_{N}$	3.36e-02	5.96e-02	2.97e-01
$T$	2.77e-01	2.95e-01	4.94e-04
(1024,512)	specbip- $n_{1}$	specbip	red-black
$ℐ_{B}$	7.77e-17	0.00e+00	4.17e-02
$ℰ_{B}$	9.92e-05	2.11e-04	4.75e-03
$ℰ_{A}$	1.06e-01	1.80e-01	-
$ℰ_{N}$	3.62e-02	1.15e-01	2.75e-01
$T$	1.92e+00	1.94e+00	8.67e-04

Table 2. Table 2: Results for ξ = 10 − 2 𝜉 superscript 10 2 \xi=10^{-2} , η = 10 − 5 𝜂 superscript 10 5 \eta=10^{-5} .

(256,128)	specbip- $n_{1}$	specbip	red-black
$ℐ_{B}$	1.11e-17	7.77e-17	1.68e-06
$ℰ_{B}$	6.68e-04	8.79e-04	3.36e-03
$ℰ_{A}$	2.70e-01	2.68e-01	-
$ℰ_{N}$	1.23e-01	1.49e-01	2.58e-01
$T$	4.39e-02	4.70e-02	1.01e-03
(512,256)	specbip- $n_{1}$	specbip	red-black
$ℐ_{B}$	0.00e+00	2.22e-17	1.40e-04
$ℰ_{B}$	3.05e-05	1.91e-05	8.81e-04
$ℰ_{A}$	3.88e-02	2.38e-02	-
$ℰ_{N}$	1.87e-02	1.93e-02	3.16e-01
$T$	2.72e-01	2.77e-01	5.12e-04
(1024,512)	specbip- $n_{1}$	specbip	red-black
$ℐ_{B}$	0.00e+00	0.00e+00	4.04e-03
$ℰ_{B}$	1.91e-07	1.03e-05	1.07e-03
$ℰ_{A}$	1.73e-04	9.49e-03	-
$ℰ_{N}$	9.77e-05	9.47e-03	3.25e-01
$T$	1.91e+00	1.89e+00	9.52e-04

Table 3. Table 3: Results for ξ = 10 − 1 𝜉 superscript 10 1 \xi=10^{-1} , η = 10 − 4 𝜂 superscript 10 4 \eta=10^{-4}

(256,128)	specbip- $n_{1}$	specbip	red-black
$ℐ_{B}$	0.00e+00	0.00e+00	7.71e-02
$ℰ_{B}$	0.00e+00	5.83e-04	1.35e-02
$ℰ_{A}$	2.43e-02	4.24e-02	-
$ℰ_{N}$	0.00e+00	2.58e-02	3.18e-01
$T$	5.56e-02	6.05e-02	3.07e-03
(512,256)	specbip- $n_{1}$	specbip	red-black
$ℐ_{B}$	0.00e+00	0.00e+00	1.44e-01
$ℰ_{B}$	0.00e+00	8.19e-04	8.01e-03
$ℰ_{A}$	8.02e-03	5.19e-02	-
$ℰ_{N}$	0.00e+00	4.47e-02	3.31e-01
$T$	2.77e-01	2.76e-01	1.08e-03
(1024,512)	specbip- $n_{1}$	specbip	red-black
$ℐ_{B}$	0.00e+00	0.00e+00	2.60e-01
$ℰ_{B}$	0.00e+00	1.04e-03	6.54e-03
$ℰ_{A}$	2.33e-03	9.04e-02	-
$ℰ_{N}$	0.00e+00	8.71e-02	3.28e-01
$T$	2.02e+00	2.07e+00	3.99e-03

Equations104

a_{i, j} = {w_{k}, 0, if there is an edge e_{k} between the nodes v_{i} and v_{j} with weight w_{k}, otherwise .

a_{i, j} = {w_{k}, 0, if there is an edge e_{k} between the nodes v_{i} and v_{j} with weight w_{k}, otherwise .

b_{s} = \frac{trace ( exp ( - A ))}{trace ( exp ( A ))} .

b_{s} = \frac{trace ( exp ( - A ))}{trace ( exp ( A ))} .

A_{B} = [O_{n_{1}} C^{T} C O_{n_{2}}],

A_{B} = [O_{n_{1}} C^{T} C O_{n_{2}}],

σ (A_{B}) = {λ_{1}, \dots, λ_{n_{2}}, n_{1} - n_{2} 0, \dots, 0, - λ_{n_{2}}, \dots, - λ_{1}},

σ (A_{B}) = {λ_{1}, \dots, λ_{n_{2}}, n_{1} - n_{2} 0, \dots, 0, - λ_{n_{2}}, \dots, - λ_{1}},

D = diag (D, O_{n_{1} - n_{2}}, - D),

D = diag (D, O_{n_{1} - n_{2}}, - D),

Q = [U_{1} V U_{2} O_{n_{2}, n_{1} - n_{2}} U_{1} - V],

Q = [U_{1} V U_{2} O_{n_{2}, n_{1} - n_{2}} U_{1} - V],

A_{B} = Q D Q^{T},

A_{B} = Q D Q^{T},

[U_{1} V U_{2} O_{n_{2}, n_{1} - n_{2}} U_{1} - V] diag (D, O_{n_{1} - n_{2}}, - D) [U_{1} V U_{2} O_{n_{2}, n_{1} - n_{2}} U_{1} - V]^{T} .

[U_{1} V U_{2} O_{n_{2}, n_{1} - n_{2}} U_{1} - V] diag (D, O_{n_{1} - n_{2}}, - D) [U_{1} V U_{2} O_{n_{2}, n_{1} - n_{2}} U_{1} - V]^{T} .

A_{B} = [U_{1} V U_{1} - V] [D 0 0 - D] [U_{1} V U_{1} - V]^{T},

A_{B} = [U_{1} V U_{1} - V] [D 0 0 - D] [U_{1} V U_{1} - V]^{T},

\biggl{(}\sum_{i=1}^{\ell}(\alpha_{i}-\beta_{i})^{2}\biggr{)}^{1/2},

\biggl{(}\sum_{i=1}^{\ell}(\alpha_{i}-\beta_{i})^{2}\biggr{)}^{1/2},

(α_{1} - β_{2})^{2} + (α_{2} - β_{1})^{2} \leq (α_{1} - β_{1})^{2} + (α_{2} - β_{2})^{2}

(α_{1} - β_{2})^{2} + (α_{2} - β_{1})^{2} \leq (α_{1} - β_{1})^{2} + (α_{2} - β_{2})^{2}

α_{2} (β_{1} - β_{2}) \geq α_{1} (β_{1} - β_{2}) .

α_{2} (β_{1} - β_{2}) \geq α_{1} (β_{1} - β_{2}) .

β_{j} = ⎩ ⎨ ⎧ \frac{1}{2} (α_{j} - α_{n - j + 1}), 0, - β_{n - j + 1}, j = 1, 2, \dots, n_{2}, j = n_{2} + 1, \dots, n_{1}, j = n_{1} + 1, \dots, n,

β_{j} = ⎩ ⎨ ⎧ \frac{1}{2} (α_{j} - α_{n - j + 1}), 0, - β_{n - j + 1}, j = 1, 2, \dots, n_{2}, j = n_{2} + 1, \dots, n_{1}, j = n_{1} + 1, \dots, n,

β_{j} - β_{j + 1} = ⎩ ⎨ ⎧ \frac{1}{2} (α_{j} - α_{j + 1}) + \frac{1}{2} (α_{n - j} - α_{n - j + 1}), \frac{1}{2} (α_{n_{2}} - α_{n_{1} + 1}), 0, β_{n_{2}}, β_{n - j} - β_{n - j + 1}, 1 \leq j \leq n_{2} - 1, j = n_{2}, n_{2} + 1 \leq j \leq n_{1} - 1, j = n_{1}, n_{1} + 1 \leq j \leq n - 1,

β_{j} - β_{j + 1} = ⎩ ⎨ ⎧ \frac{1}{2} (α_{j} - α_{j + 1}) + \frac{1}{2} (α_{n - j} - α_{n - j + 1}), \frac{1}{2} (α_{n_{2}} - α_{n_{1} + 1}), 0, β_{n_{2}}, β_{n - j} - β_{n - j + 1}, 1 \leq j \leq n_{2} - 1, j = n_{2}, n_{2} + 1 \leq j \leq n_{1} - 1, j = n_{1}, n_{1} + 1 \leq j \leq n - 1,

\begin{cases}\min_{\beta}\Bigl{(}(\alpha_{j}-\beta)^{2}+(\alpha_{n-j+1}+\beta)^{2}\Bigr{)},\quad&1\leq j\leq n_{2},\\ \min_{\beta}\,(\beta^{2}),\quad&n_{2}+1\leq j\leq n_{1}.\end{cases}

\begin{cases}\min_{\beta}\Bigl{(}(\alpha_{j}-\beta)^{2}+(\alpha_{n-j+1}+\beta)^{2}\Bigr{)},\quad&1\leq j\leq n_{2},\\ \min_{\beta}\,(\beta^{2}),\quad&n_{2}+1\leq j\leq n_{1}.\end{cases}

A_{B} = W_{B} Λ_{B} W_{B}^{T}, Λ_{B} = diag (λ_{1}^{(B)}, λ_{2}^{(B)}, \dots, λ_{n}^{(B)}),

A_{B} = W_{B} Λ_{B} W_{B}^{T}, Λ_{B} = diag (λ_{1}^{(B)}, λ_{2}^{(B)}, \dots, λ_{n}^{(B)}),

λ_{1}^{(B)} \geq λ_{2}^{(B)} \geq \dots \geq λ_{n}^{(B)} .

λ_{1}^{(B)} \geq λ_{2}^{(B)} \geq \dots \geq λ_{n}^{(B)} .

W_{B} = [U_{1} V U_{2} O U_{1} Z - V Z], Λ_{B} = D O O O O_{n_{1} - n_{2}} O O O - Z D Z,

W_{B} = [U_{1} V U_{2} O U_{1} Z - V Z], Λ_{B} = D O O O O_{n_{1} - n_{2}} O O O - Z D Z,

Z = O 1 \iddots 1 O \in R^{n_{2} \times n_{2}} .

Z = O 1 \iddots 1 O \in R^{n_{2} \times n_{2}} .

A = W Λ W^{T}, Λ = diag (λ_{1}, λ_{2}, \dots, λ_{n}),

A = W Λ W^{T}, Λ = diag (λ_{1}, λ_{2}, \dots, λ_{n}),

λ_{1} \geq λ_{2} \geq \dots \geq λ_{n} .

λ_{1} \geq λ_{2} \geq \dots \geq λ_{n} .

W = [W_{11} W_{21} W_{12} W_{22} W_{13} W_{23}] .

W = [W_{11} W_{21} W_{12} W_{22} W_{13} W_{23}] .

U_{1}^{T} U_{1} = V^{T} V = \frac{1}{2} I_{n_{2}} U_{2}^{T} U_{2} = \frac{1}{2} I_{n_{1} - n_{2}} min [U_{1} V U_{2} O U_{1} - V] - [W_{11} W_{21} W_{12} W_{22} W_{13} Z W_{23} Z]_{F},

U_{1}^{T} U_{1} = V^{T} V = \frac{1}{2} I_{n_{2}} U_{2}^{T} U_{2} = \frac{1}{2} I_{n_{1} - n_{2}} min [U_{1} V U_{2} O U_{1} - V] - [W_{11} W_{21} W_{12} W_{22} W_{13} Z W_{23} Z]_{F},

U_{1}^{T} U_{1} = \frac{1}{2} I_{n_{2}} min

U_{1}^{T} U_{1} = \frac{1}{2} I_{n_{2}} min

V^{T} V = \frac{1}{2} I_{n_{2}} min

U_{2}^{T} U_{2} = \frac{1}{2} I_{n_{1} - n_{2}} min

X_{1}^{T} X_{1} = I_{n_{2}} min {X_{1} - 2 W_{11}_{F}^{2} + X_{1} - 2 W_{13} Z_{F}^{2}} .

X_{1}^{T} X_{1} = I_{n_{2}} min {X_{1} - 2 W_{11}_{F}^{2} + X_{1} - 2 W_{13} Z_{F}^{2}} .

X^{T} X = I min ∥ X - W ∥_{F}^{2} .

X^{T} X = I min ∥ X - W ∥_{F}^{2} .

X^{T} X = I min {trace (X^{T} X) - 2 trace (X^{T} W) + trace (W^{T} W)} .

X^{T} X = I min {trace (X^{T} X) - 2 trace (X^{T} W) + trace (W^{T} W)} .

X^{T} X = I min {- trace (X^{T} W)} .

X^{T} X = I min {- trace (X^{T} W)} .

X_{1}^{T} X_{1} = I_{n_{2}} min {- trace (X_{1}^{T} (W_{11} + W_{13} Z))} .

X_{1}^{T} X_{1} = I_{n_{2}} min {- trace (X_{1}^{T} (W_{11} + W_{13} Z))} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

A spectral method for bipartizing a network and detecting a large

anti-community

A. Concas 111Partially supported by the Fondazione di Sardegna 2017 research project “Algorithms for Approximation with Applications (Acube)” and the Regione Autonoma della Sardegna research project “Algorithms and Models for Imaging Science (AMIS)” (intervento finanziato con risorse FSC 2014-2020 - Patto per lo Sviluppo della Regione Sardegna).222Partially supported by the INdAM-GNCS research project “Metodi numerici per problemi mal posti”.333Anna Concas gratefully acknowledges Sardinia Regional Government for the financial support of her Ph.D. scholarship (P.O.R. Sardegna F.S.E. Operational Programme of the Autonomous Region of Sardinia, European Social Fund 2014-2020 - Axis III Education and Formation, Objective 10.5, Line of Activity 10.5.12).444Member of the INdAM Research group GNCS.

[email protected]

S. Noschese555Partially supported by the INdAM-GNCS research project “Metodi numerici per problemi mal posti”.666Member of the INdAM Research group GNCS.

[email protected]

L. Reichel777Partially supported by NSF grants DMS-1729509 and DMS-1720259.

[email protected]

G. Rodriguez888Partially supported by the Fondazione di Sardegna 2017 research project “Algorithms for Approximation with Applications (Acube)” and the Regione Autonoma della Sardegna research project “Algorithms and Models for Imaging Science (AMIS)” (intervento finanziato con risorse FSC 2014-2020 - Patto per lo Sviluppo della Regione Sardegna).999Partially supported by the INdAM-GNCS research project “Metodi numerici per problemi mal posti”.101010Member of the INdAM Research group GNCS.

[email protected]

Department of Mathematics and Computer Science, University of Cagliari,

Viale Merello, 92, 09123 Cagliari, Italy

Department of Mathematics “Guido Castelnuovo”, Sapienza University of Rome,

P.le A. Moro, 2, I-00185 Roma, Italy

Department of Mathematical Sciences, Kent State University, Kent, OH 44242, USA

Abstract

Relations between discrete quantities such as people, genes, or streets can be described by networks, which consist of nodes that are connected by edges. Network analysis aims to identify important nodes in a network and to uncover structural properties of a network. A network is said to be bipartite if its nodes can be subdivided into two nonempty sets such that there are no edges between nodes in the same set. It is a difficult task to determine the closest bipartite network to a given network. This paper describes how a given network can be approximated by a bipartite one by solving a sequence of fairly simple optimization problems. The algorithm also produces a node permutation which makes the possible bipartite nature of the initial adjacency matrix evident, and identifies the two sets of nodes. We finally show how the same procedure can be used to detect the presence of a large anti-community in a network and to identify it.

keywords:

network analysis , network approximation , bipartization , anti-community

MSC:

65F15 , 05C50 , 05C82.

††journal: Journal of Computational and Applied Mathematics

1 Introduction

Networks describe how discrete quantities such as genes, people, proteins, or streets are related. They arise in many applications, including genetics, epidemiology, energy distribution, and telecommunication; see, e.g., [7, 17] for discussions on networks and their applications. Networks are represented by graphs $\mathcal{G}=\{\mathcal{V},\mathcal{E},\mathcal{W}\}$ , which are determined by a set of vertices (nodes) $\mathcal{V}=\{v_{i}\}_{i=1}^{n}$ , a set of edges $\mathcal{E}=\{e_{k}\}_{k=1}^{m}$ , and a set of positive weights $\mathcal{W}=\{w_{k}\}_{k=1}^{m}$ . Here $e_{k}=(i_{k},j_{k})$ represents an edge from vertex $v_{i_{k}}$ to vertex $v_{j_{k}}$ . The weight $w_{k}$ is associated with the edge $e_{k}$ ; a large value of $w_{k}>0$ indicates that edge $e_{k}$ is important. For instance, in a road network, the weight $w_{k}$ may be proportional to the amount of traffic on the road that is represented by the edge $e_{k}$ . In this paper, we consider connected undirected graphs without self-loops and multiple edges. In particular, all edges represent “two-way streets,” i.e., if $(i_{k},j_{k})$ is an edge, then so is $(j_{k},i_{k})$ . The weights associated with these edges are assumed to be the same. In unweighted graphs all weights are set to one.

We will represent a graph $\mathcal{G}$ with $n$ nodes by its adjacency matrix $A=[a_{i,j}]_{i,j=1}^{n}$ , where

[TABLE]

Since $\mathcal{G}$ is undirected and the weights associated with each direction of an edge are the same, the matrix $A$ is symmetric. The largest possible number of edges of an undirected graph with $n$ nodes without self-loops is $n^{2}-n$ , but typically the actual number of edges, $m$ , of such graphs that arise in applications is much smaller. The adjacency matrix $A$ , therefore, is generally very sparse.

A graph $\mathcal{G}$ is said to be bipartite if the set of vertices $\mathcal{V}$ that make up the graph can be partitioned into two disjoint nonempty subsets $\mathcal{V}_{1}$ and $\mathcal{V}_{2}$ (with $\mathcal{V}=\mathcal{V}_{1}\cup\mathcal{V}_{2}$ ), such that any edge starting at a vertex in $\mathcal{V}_{1}$ points to a vertex in $\mathcal{V}_{2}$ , and vice versa. This, in particular, excludes the presence of self-loops in a bipartite graph.

Bipartivity is an important structural property. It has been studied also as the $2$ -coloring problem [3]. In fact determining if a graph can be colored with 2 colors is equivalent to determining whether or not the graph is bipartite, and thus testing if a network is bipartite or not is computable in linear time using breadth-first or depth-first search algorithms. It is therefore interesting to determine a bipartite approximation of a non-bipartite graph, or measure the distance of a non-bipartite graph from being bipartite. We say that a splitting of the set of vertices $\mathcal{V}$ of a weighted undirected graph $\mathcal{G}$ into two disjoint nonempty subsets $\mathcal{V}_{1}$ and $\mathcal{V}_{2}$ (with $\mathcal{V}=\mathcal{V}_{1}\cup\mathcal{V}_{2}$ ), is a best bipartization of $\mathcal{G}$ if the sum of the weights $w_{k}$ associated with edges $e_{k}=(i,j)$ that point from vertices $v_{i}$ in $\mathcal{V}_{\ell}$ ( $\ell=1,2$ ) to vertices $v_{j}$ in the same set $\mathcal{V}_{\ell}$ is minimal. Such edges $e_{k}$ are called “frustrated”, and computing the minimum number of edges whose deletion makes the graph bipartite is an NP-hard optimization problem [25]. We remark that the above definition is analogous to the definition of a best bipartization of an undirected unweighted graph proposed by Estrada and Gómez–Gardeñes [8], where the spectral bipartivity index of a network with adjacency matrix $A$ is defined as

[TABLE]

This measure also can be applied to the weighted graphs considered in the present paper.

The problem of discovering approximately bipartite structures in graphs and networks has been considered by various authors. Most popular approaches are based on the eigendecomposition of the Laplacian and signless Laplacian matrices. Other spectral approaches consider the adjacency matrix associated to the graph. In the case of a symmetric bipartite adjacency matrix, the signs of the entries of an eigenvector associated with the smallest eigenvalue can be used to partition the graph, i.e., nodes that correspond to positive entries belong to one set, and nodes that correspond to negative entries belong to the other set; see [21]. In case the smallest eigenvalue is multiple, the splitting of the nodes may vary according to the considered vector in the associated eigenspace. In [16] the presence of $\pm$ pairs in the spectrum of the adjacency matrix of a bipartite graph is exploited in order to identify approximated bipartite structures within protein-protein interaction undirected networks; see also [23] for a spectral approach that can be used to discover approximately bipartite substructures in directed graphs.

We are interested in developing a numerical method for determining a “good” bipartization $({\mathcal{V}}_{1},{\mathcal{V}}_{2})$ , i.e., a bipartization for which the sum of the weights $w_{k}$ associated with the edges $e_{k}=(i,j)$ that point from a vertex $v_{i}$ in $\mathcal{V}_{1}$ to a vertex $v_{j}$ in $\mathcal{V}_{2}$ , or vice versa, is fairly small. The algorithm is approximated, or “heuristic”, in the sense that it does not necessarily produce the best possible bipartization.

As it will be made clear in the following, the same bipartization method may be used for the identification of large anti-communities. A community is a group of nodes which are highly connected among themselves, but are less connected to the rest of the network, or isolated from it. Conversely, an anti-community is a node set that is loosely connected internally, but has many external connections [9]; see [10], where a spectral method is used to detect communities and anti-communities. Community and anti-community detection in networks is an important problem with applications in various fields, including physics, computer science, and social sciences [5, 15, 18, 19, 24]. Although the identification of communities is predominant in the investigation of meso-scale structures in networks, the detection of the so-called core-periphery structures, whose most popular notion was developed by Borgatti and Everett [4], attracts a continuing interest also in the mathematical community; see also [20]. For our purposes, the identification of a single large anti-community can be understood as that of a core-periphery structure in the given network.

This paper is organized as follows. Section 2 discusses some properties of bipartite graphs and Section 3 describes an algorithm for determining a “good” bipartization. An application of the bipartization method to the identification of large anti-communities is discussed in Section 4. Finally, Section 5 presents computed examples and two case studies, while Section 6 contains concluding remarks.

2 Approximating the spectral structure of a bipartite graph

This section discusses some properties of the adjacency matrix for an undirected bipartite graph. Some inequalities that are useful for the design of our bipartization method also will be introduced. The discussion in the first part of the section assumes that the vertices are suitably ordered. Subsequently, we will describe how to achieve such an ordering.

Assume for the moment that the undirected graph $\mathcal{G}=\{\mathcal{V},\mathcal{E},\mathcal{W}\}$ is bipartite, i.e., its vertex set $\mathcal{V}$ can be split into two disjoint nonempty subsets $\mathcal{V}_{1}$ and $\mathcal{V}_{2}$ with $n_{1}$ and $n_{2}$ nodes, respectively, such that there are no edges between the nodes in $\mathcal{V}_{1}$ and between the nodes in $\mathcal{V}_{2}$ . We may assume that $n_{1}\geq n_{2}$ , otherwise we interchange the sets $\mathcal{V}_{1}$ and $\mathcal{V}_{2}$ .

Let the vertices in the set $\mathcal{V}$ be ordered so that the first $n_{1}$ of them belong to the set $\mathcal{V}_{1}$ and the remaining $n_{2}$ vertices belong to $\mathcal{V}_{2}$ . Then the adjacency matrix for the graph $\mathcal{G}$ is of the form

[TABLE]

where $O_{k}$ denotes the $k\times k$ zero matrix, and $C=[c_{i,j}]\in{\mathbb{R}}^{n_{1}\times n_{2}}$ with $c_{i,j}>0$ if the node $v_{i}$ in $\mathcal{V}_{1}$ is connected to the node $v_{n_{1}+j}$ in $\mathcal{V}_{2}$ ; otherwise $c_{i,j}=0$ .

We adapt to our notation a known result in graph theory; see, e.g., [1, Theorem 3.14].

Proposition 2.1.

Let $\mathcal{G}$ be an unweighted graph with $n$ nodes. Then $\mathcal{G}$ is bipartite and the adjacency matrix can be partitioned as in (2.1) if and only if the spectrum of the adjacency matrix is symmetric with respect to the origin, i.e.,

[TABLE]

for some integers $n_{1}\geq n_{2}$ and non-negative numbers $\lambda_{1}\geq\lambda_{2}\geq\cdots\geq\lambda_{n_{2}}$ . The claim holds true also for weighted graphs, as long as the weights are positive.

Proof.

For the sake of clarity, we give a quick sketch of the proof. The necessary condition is straightforward. The sufficient condition can be proved by noting that, for $k=0,1,\ldots$ , $\operatorname{trace}(A_{B}^{2k+1})=0$ if the spectrum is symmetric. Then, the positivity of the weights implies that $(A_{B}^{2k+1})_{i,i}=0$ , that is, the graph is bipartite since it does not contain odd cycles. ∎

Remark 2.2.

Under the assumption of Proposition 2.1, it is immediate to verify that if $\lambda$ is a nonzero eigenvalue of $A_{B}$ and $\mathbf{q}=\left[\begin{smallmatrix}\mathbf{x}\\ \mathbf{y}\end{smallmatrix}\right]$ , with $\mathbf{x}\in{\mathbb{R}}^{n_{1}}$ and $\mathbf{y}\in{\mathbb{R}}^{n_{2}}$ , is an associated eigenvector, then $\left(-\lambda,\left[\begin{smallmatrix}\mathbf{x}\\ -\mathbf{y}\end{smallmatrix}\right]\right)$ is an eigenpair, too. This implies that $\lambda$ is a singular value of the block $C$ in (2.1), while $\mathbf{x}$ and $\mathbf{y}$ are its left and right singular vectors, respectively, if scaled to be of unit length.

Let $n=n_{1}+n_{2}$ with $n_{1}\geq n_{2}\geq 1$ . Then, the above observation gives us the possibility to describe the spectral structure of $A_{B}$ in terms of the singular value decomposition of $C$ ; see also [12, Section 8.6.1]. Let $C=X\widetilde{D}Y^{T}$ be a singular value decomposition of $C$ , where $\widetilde{D}\in{\mathbb{R}}^{n_{1}\times n_{2}}$ has $D=\operatorname{diag}(\lambda_{1},\dots,\lambda_{n_{2}})$ as its upper block, and $X=[X_{1},X_{2}]\in{\mathbb{R}}^{n_{1}\times n_{1}}$ and $Y\in{\mathbb{R}}^{n_{2}\times n_{2}}$ are orthogonal matrices with $X_{1}\in{\mathbb{R}}^{n_{1}\times n_{2}}$ . Introduce the diagonal matrix

[TABLE]

and the orthogonal matrix

[TABLE]

where $U_{1}=\frac{1}{\sqrt{2}}X_{1}$ , $U_{2}=X_{2}$ , and $V=\frac{1}{\sqrt{2}}Y$ , with $U_{1}^{T}U_{1}=V^{T}V=\frac{1}{2}I_{n_{2}}$ and $U_{2}^{T}U_{2}=\frac{1}{2}I_{n_{1}-n_{2}}$ . Then, the spectral factorization

[TABLE]

takes the form

[TABLE]

In the special case when $n_{1}=n_{2}$ , the submatrices of (2.3) with $n_{1}-n_{2}$ columns disappear, and the spectral factorization (2.5) simplifies to

[TABLE]

with $U_{1}U_{1}^{T}=VV^{T}=\frac{1}{2}I_{n_{1}}$ .

Now, let $A$ be an adjacency matrix of an undirected graph. We would like to approximate the graph by a bipartite one, and therefore seek to approximate $A$ by a matrix of the form $A_{B}$ . We do this in several steps and first show some inequalities that are applicable to diagonal eigenvalue matrices.

Proposition 2.3.

Let $\alpha_{1}\geq\alpha_{2}\geq\dots\geq\alpha_{\ell}$ be a nonincreasing real sequence and let $\beta_{1},\beta_{2},\dots,\beta_{\ell}$ be another real sequence. The distance between these sequences measured in the least squares sense,

[TABLE]

is minimal if and only if the $\beta_{i}$ are in nonincreasing order, i.e., if $\beta_{1}\geq\beta_{2}\geq\dots\geq\beta_{\ell}$ .

Proof.

Assume that both sequences are in nonincreasing order and that the distance can be reduced by changing the order of the $\beta_{i}$ . Consider the pairs $(\alpha_{1},\beta_{1})$ and $(\alpha_{2},\beta_{2})$ . Then

[TABLE]

is equivalent to

[TABLE]

Assume $\beta_{1}>\beta_{2}$ . Then $\alpha_{2}\geq\alpha_{1}$ , which is a contradiction unless $\alpha_{1}=\alpha_{2}$ . If the $\beta_{j}$ are ordered arbitrarily, then we can reorder these coefficients pairwise until they form a nonincreasing sequence. Each pairwise swap reduces (2.6). ∎

In our application of Proposition 2.3, we let $\alpha_{1}\geq\alpha_{2}\geq\dots\geq\alpha_{n}$ be the eigenvalues of the adjacency matrix $A\in{\mathbb{R}}^{n\times n}$ . The graph associated with this matrix might not be bipartite. We would like the sequence of eigenvalues of the matrix $A_{B}\in{\mathbb{R}}^{n\times n}$ , given by (2.1), to be close to the sequence $\alpha_{1},\alpha_{2},\dots,\alpha_{n}$ and appear in $\pm$ pairs. By Proposition 2.3, we know that the eigenvalues $\beta_{1},\beta_{2},\dots,\beta_{n}$ of $A_{B}$ should be in nonincreasing order, and by Proposition 2.1 they vanish or appear in $\pm$ pairs. We know from (2.5) that at least $n_{1}-n_{2}$ eigenvalues of $A_{B}$ should be zero.

Proposition 2.4.

Let $\{\alpha_{j}\}_{j=1}^{n}$ , with $n=n_{1}+n_{2}$ and $n_{1}\geq n_{2}$ , be a real nonincreasing sequence. Then the sequence $\{\beta_{j}\}_{j=1}^{n}$ with elements

[TABLE]

is the closest sequence to $\{\alpha_{j}\}_{j=1}^{n}$ in the least squares sense consisting of at least $n_{1}-n_{2}$ zeros and nonvanishing entries appearing in $\pm$ pairs.

Proof.

The sequence $\{\beta_{j}\}_{j=1}^{n}$ consists of $n_{1}-n_{2}$ zero values and $n_{2}$ $\pm$ pairs. Indeed, we have

[TABLE]

and it follows that the sequence is nonincreasing. It remains to establish that the $\beta_{j}$ defined by (2.7) are the best possible. Consider the minimization problems

[TABLE]

The solution sequence $\{\beta_{j}\}_{j=1}^{n}$ is given by (2.7). Thus, the $\beta_{j}$ form a nonincreasing sequence consisting of $n_{1}-n_{2}$ zero values and $n_{2}$ $\pm$ pairs. It is the closest such sequence to the sequence $\{\alpha_{j}\}_{j=1}^{n}$ in the sense that it solves the minimization problems (2.8). ∎

We would like to determine an approximation of the matrix $A$ by a matrix of the form (2.1), where we allow row and column permutations of the latter matrix. Define the spectral factorization

[TABLE]

where $W_{B}$ is an orthogonal matrix and the eigenvalues are ordered according to

[TABLE]

We remark that only the first $n_{1}$ eigenvalues are ordered as in (2.4).

Let us initially assume that the nonzero eigenvalues are distinct. If the eigenvectors are made unique, e.g., by making their first component positive, a comparison with (2.5) shows that

[TABLE]

where $Z$ is the flip matrix

[TABLE]

In the presence of multiple nonzero eigenvalues, the corresponding eigenvectors are not uniquely determined, so the spectral factorization (2.9) is only one of several possible distinct factorizations.

Let

[TABLE]

be a spectral factorization of $A$ with an orthogonal eigenvector matrix $W$ and the eigenvalues ordered according to

[TABLE]

Partition the eigenvector matrix $W$ conformally with the eigenvector matrix $W_{B}$ of $A_{B}$ , i.e.,

[TABLE]

We would like to to approximate the eigenvector matrix $W$ of $A$ by the eigenvector matrix $W_{B}$ of $A_{B}$ . This suggests that we solve the minimization problem

[TABLE]

where $\left\lVert\cdot\right\rVert_{F}$ denotes the Frobenius norm. This problem splits into the three independent problems

[TABLE]

Problem (2.13) can be written as

[TABLE]

The following result shows how we can easily solve this problem.

Proposition 2.5.

The solution of problem (2.16) can be determined by computing the singular value decomposition of $W_{11}+W_{13}Z$ and setting all singular values to one.

Proof.

Consider the problem

[TABLE]

It can be written as

[TABLE]

The first and last terms are independent of $X$ . Therefore we obtain the equivalent linear minimization problem

[TABLE]

Similarly, the linear problem associated to the minimization problem (2.16) is given by

[TABLE]

Hence, the problem (2.16) is equivalent to determining the closest orthogonal matrix in the Frobenius norm to the matrix $W_{11}+W_{13}Z$ . The solution is given by computing the singular value decomposition $P\Sigma Q^{T}$ of $W_{11}+W_{13}Z$ and setting $X_{1}=PQ^{T}$ ; see [13, Theorem 4.1] for a proof of the latter statement. ∎

The minimization problems (2.14) and (2.15) are solved similarly. This gives the eigenvector matrix in the spectral factorization (2.5).

Remark 2.6.

We note that if $P\Sigma Q^{T}$ denotes the singular value decomposition of $W_{11}+W_{13}Z$ , then we can express its polar decomposition by

[TABLE]

Since the first factor $PQ^{T}$ is the minimizer of (2.17), the deviation of $Q\Sigma Q^{T}$ from the identity matrix measures the quality of the approximation.

Remark 2.7.

If some of the nonzero eigenvalues of $A$ in (2.10) are multiple, the corresponding columns of $W_{11}$ , $W_{21}$ , $W_{13}$ , and $W_{23}$ , are not uniquely determined. Anyway, when approximating $W_{11}+W_{13}Z$ by $X_{1}$ , and $W_{21}-W_{23}Z$ by $Y$ , those columns contain linear combinations of the previous ones, and so they belong to the same space. Then, the approximations $X_{1}$ and $Y$ will make factorization (2.9) valid.

3 A spectral bipartization method

We give here an outline of a spectral bipartization method, based on the above results. It exploits the spectral structure (2.5) of a bipartite graph to determine a node permutation that separates the two sets ${\mathcal{V}}_{1}$ and ${\mathcal{V}}_{2}$ , and to construct a bipartite approximation to a connected undirected graph $\mathcal{G}$ , having a perturbed bipartite structure. The algorithm is exact whenever the input is the adjacency matrix of a bipartite graph, however it has to be considered “heuristic”, as we were not able to prove a complete convergence result for it, apart from the spectrum approximation theorems in Section 2.

There are three problems at hand: estimating the cardinality of the sets $\mathcal{V}_{1}$ and $\mathcal{V}_{2}$ , suitably ordering the nodes in $\mathcal{G}$ , and, finally, approximating the adjacency matrix by a matrix of the form (2.1). Let $A$ be the adjacency matrix of $\mathcal{G}$ , and assume the spectral factorization

[TABLE]

is available, where $W$ is an orthogonal matrix and the eigenvalues are ordered by increasing absolute value.

The first step of our algorithm consists of finding the cardinality $n_{1}$ and $n_{2}$ of the two disjoint node sets $\mathcal{V}_{1}$ and $\mathcal{V}_{2}$ , unless they are known in advance. We do this by identifying the number of eigenvalues that are approximately zero.

In principle, this could be done by detecting how many eigenvalues have absolute value larger than a fixed tolerance, but this process is extremely sensitive to the choice of the tolerance. In our numerical experimentation, we found it to be more reliable to detect the largest gap between “small” and “large” eigenvalues.

To do this, we compute the ratios

[TABLE]

Then, for suitably chosen constants $R$ and $\tau$ , we consider the index set

[TABLE]

In our experiments, we set $R=10^{2}$ and $\tau=10^{-8}$ .

An index $i$ is in $\mathcal{J}$ if there is a significant gap between $\lambda_{i}$ and $\lambda_{i+1}$ ( $\rho_{i}>R$ ), and $\lambda_{i+1}$ is numerically nonzero ( $|\lambda_{i+1}|>\tau$ ). If the set $\mathcal{J}$ is empty, then we are not able to identify a partition of the nodes, and we consider the cardinality of the sets $\mathcal{V}_{1}$ and $\mathcal{V}_{2}$ to be the same. On the contrary, we let $k$ be the index defined by

[TABLE]

and set

[TABLE]

where $\left\lceil x\right\rceil$ denotes the closest integer to the real number $x$ .

The above approach is clearly not completely robust. It is easy to trick it by constructing particular numerical examples, for example by letting $C$ in (2.1) have singular values that decay to zero exponentially, or by introducing large gaps in the spectrum of the adjacency matrix. Nevertheless, we found the procedure quite accurate on networks stemming from real-world applications; see, e.g., Figures 6 and 8 in Section 5.

In order to avoid overflow, it may be preferable to use the reciprocal ratios $\rho_{i}^{-1}$ . This is not required in our Matlab implementation, given the features of the programming language. 2. 2.

The subsequent step is to find the sets $\mathcal{V}_{1}$ and $\mathcal{V}_{2}$ , and reorder the nodes. Assume that $\mathcal{G}$ is bipartite, but that the adjacency matrix $A$ corresponds to a random ordering of the nodes, so that

[TABLE]

for a permutation matrix $\Pi$ and a matrix $A_{B}$ of the form (2.1). In this case, the spectral factorization (2.4) becomes

[TABLE]

i.e., the rows of the eigenvector matrix are permuted. In order to recover the structure of the eigenvectors, let us partition the eigenvector matrix as in

[TABLE]

with $W_{1},W_{3}\in{\mathbb{R}}^{n\times n_{2}}$ and $W_{2}\in{\mathbb{R}}^{n\times(n_{1}-n_{2})}$ .

Assume first that $n_{1}>n_{2}\geq 1$ and consider the matrix block $W_{2}$ . For (2.9) to be valid, the last $n_{2}$ rows of $W_{2}$ must vanish. Sorting in descending order the 1-norms of its rows concentrates the smallest entries in the lower block of $W_{2}$ . Applying the corresponding permutation $\sigma$ to the rows of $W$ brings this matrix to the form (2.9) and the adjacency matrix to the form (2.1), with the block $C$ possibly permuted. When $n_{1}=n_{2}$ the block $W_{2}$ is empty, so we consider the matrix $W_{1}-W_{3}Z$ . As its first $n_{1}$ rows should be exactly zero, we sort the 1-norms of its rows in ascending order, and apply the obtained permutation $\sigma$ to the rows of $W$ . After the reordering, the first $n_{1}$ nodes are in the set $\mathcal{V}_{1}$ , and the remaining $n_{2}$ are in the set $\mathcal{V}_{2}$ . We note that applying the permutation $\sigma$ to the rows and columns of the initial adjacency matrix $A$ highlights the presence in the graph of an approximate bipartite structure. 3. 3.

To finally obtain an approximation of the matrix (2.1) by the computed spectral factorization, we first approximate the eigenvector matrix $W_{B}$ by solving problem (2.12), and then approximate the eigenvalues in (2.10) by scalars that appear in $\pm$ pairs using Proposition 2.4. Specifically, we let the $\alpha_{j}$ in the proposition be the eigenvalues (2.11). The $\beta_{j}$ defined in the proposition are the eigenvalues of the matrix $D$ in (2.5), in the same order.

The above procedure, outlined in Algorithm 1, determines the eigenvectors and eigenvalues of a matrix $A_{B}$ with the block structure

[TABLE]

where the matrix $C$ has real entries. The matrix $A_{B}$ may have a different number of nonzero entries than $A$ . In fact, not all nonzero entries may be positive. We can handle this issue in several ways:

Allow $A_{B}$ to be an adjacency matrix for a weighted graph with both positive and negative weights.

2.

Allow $A_{B}$ to be an adjacency matrix for a weighted graph with positive weights. We achieve this by replacing the matrix $C$ in (3.4) by the closest matrix, $C_{+}$ , in the Frobenius norm with nonnegative entries. The matrix $C_{+}$ is obtained from $C$ by setting all negative entries to zero.

3.

Require $A_{B}$ to represent an unweighted graph. The closest such matrix in the Frobenius norm to the matrix (3.4) is obtained by setting every entry of $C$ to the closest member of the set $\{0,1\}$ .

The last procedure is the one adopted in the numerical experiments presented in Section 5.

Algorithm 1 can be applied only to small to medium sized problems, i.e., when it is possible to compute a full spectral factorization of $A$ . For larger problems, one may reduce the complexity of the computation by renouncing the third step of the algorithm. Indeed, when $n_{1}-n_{2}$ is not too large, a partial spectral factorization may lead to constructing a basis for the null space of $A$ , that is, to obtaining the matrix $W_{2}$ . This would allow one to generate the permutation $\sigma$ that takes the adjacency matrix to an almost bipartite form, identifying the two sets ${\mathcal{V}}_{1}$ and ${\mathcal{V}}_{2}$ .

4 Anti-communities

Let us consider a symmetric matrix $A$ of size $n=n_{1}+n_{2}$ with a zero leading square block of size $n_{1}$ . Then, $A$ may be considered the adjacency matrix of a network with an anti-community of $n_{1}$ nodes. The matrix has the form

[TABLE]

with $C$ of size $n_{1}\times n_{2}$ and $B$ a square matrix of order $n_{2}$ . In the following, we denote by $\mathcal{N}(C)$ the null space of $C$ , by $\mathcal{R}(C)$ its range, and by $B_{|\mathcal{N}(C)}$ the restriction of the submatrix $B$ to $\mathcal{N}(C)$ .

Theorem 4.1.

Let $A$ be as in (4.1) and let $\mathbf{x}=\left[\begin{smallmatrix}\mathbf{x}_{1}\\ \mathbf{x}_{2}\end{smallmatrix}\right]$ be partitioned consistently with $A$ . Then the equation

[TABLE]

has $\nu=\dim{\mathcal{N}(C^{T})}$ linearly independent solutions with $\mathbf{x}_{2}=0$ . Moreover, if

[TABLE]

then there are also $d$ linearly independent solutions to $A\mathbf{x}=\mathbf{0}$ with $\mathbf{x}_{2}\neq 0$ , so that $\dim{\mathcal{N}(A)}=d+\nu$ .

Proof.

Let $k=\operatorname{rank}(C)$ and consider the case $n_{1}>n_{2}=k$ . Let us search for vectors $\mathbf{x}$ such that $A\mathbf{x}=\mathbf{0}$ . Then we have

[TABLE]

Since $C$ is of full rank and $n_{1}>n_{2}$ , it follows from $C\mathbf{x}_{2}=0$ that $\mathbf{x}_{2}=0$ and, hence, $C^{T}\mathbf{x}_{1}=0$ . The latter implies that $\mathbf{x}_{1}$ is in the null space of $C^{T}$ , which has dimension $n_{1}-n_{2}$ . Thus, the matrix $A$ admits the following linearly independent eigenvectors corresponding to the eigenvalue $\lambda=0$ ,

[TABLE]

where $\mathbf{u}_{i}$ , $i=1,2,\ldots,n_{1}$ , are the left singular vectors of $C$ . Hence, $\lambda=0$ has multiplicity $n_{1}-n_{2}=\dim{\mathcal{N}(C^{T})}$ .

Let us now assume that $k=n_{1}<n_{2}$ . Then $A$ may or may not have zero eigenvalues. Indeed, for $A$ to have a vanishing eigenvalue, the vector $\mathbf{x}_{2}\in{\mathbb{R}}^{n_{2}}$ that appears in (4.3) has to belong to the null space of $C$ , which has dimension $n_{2}-n_{1}$ . Then, there will be zero eigenvalues if and only if the system

[TABLE]

has a solution.

If instead $k=n_{1}=n_{2}$ , i.e., if $C$ is nonsingular, then $\lambda=0$ implies that both $\mathbf{x}_{1}=0$ and $\mathbf{x}_{2}=0$ . Hence, $\mathbf{x}=\left[\begin{smallmatrix}\mathbf{x}_{1}\\ \mathbf{x}_{2}\end{smallmatrix}\right]=0$ , and all the eigenvalues of $A$ are different from zero.

We finally turn to the case when the submatrix $C$ is rank deficient, that is, $k<\min\{n_{1},n_{2}\}$ . The right-hand side of (4.3) is equivalent to

[TABLE]

Let $\mathbf{x}$ be a nontrivial solution of (4.2). When $\mathbf{x}_{2}=\mathbf{0}$ , there has to be a vector $\mathbf{x}_{1}\neq\mathbf{0}$ with $C^{T}\mathbf{x}_{1}=\mathbf{0}$ . Since in this case the null space of $C^{T}$ has dimension $n_{1}-k$ , there are $n_{1}-k$ linearly independent solutions of (4.2) with $\mathbf{x}_{2}=\mathbf{0}$ .

The existence of a solution $\mathbf{x}$ of (4.2) with a nonzero subvector $\mathbf{x}_{2}$ is equivalent to

[TABLE]

This condition does not hold for most matrix pairs $(B,C)$ . ∎

Remark 4.2.

We note that if $B=0$ , then the equation $A\mathbf{x}=\mathbf{0}$ has exactly

[TABLE]

linearly independent solutions.

Theorem 4.1 shows that if a network has a large anti-community ( $n_{1}>n_{2}$ ), the spectral decomposition $A=WDW^{T}$ has the form

[TABLE]

The structures of $W$ and $D$ are very similar to those of $W_{B}$ and $\Lambda_{B}$ in (2.9), respectively. For this reason, the bipartization algorithm described in Section 3, is able to detect the presence of a large anti-community and to order the nodes so that the adjacency matrix takes the form (4.1). In case a group of nodes is only approximately an anti-community, the algorithm produces an adjacency matrix that approximates (4.1).

To summarize, when $n_{1}>n_{2}$ , if a network is either bipartite or contains a large anti-community, its adjacency matrix has zero eigenvalues; the converse is not true. If $A$ has a multiple zero eigenvalue, then we can recognize the presence of one of the two above cases by observing the structure of the eigenvector matrix.

5 Computed examples

In the following numerical experiments, we fix the integers $n_{1}$ and $n_{2}$ , and construct a random matrix $A$ of the form (2.1), with a sparse block $C$ with density $\xi$ . The matrix is first perturbed, by replacing its (1,1) and (2,2) blocks by sparse matrices of appropriate size and density $\eta$ , and then “scrambled”, by applying the same random permutation to its rows and columns.

We apply the algorithm of Section 2 to the matrix $A$ either by supplying the cardinality of the two sets $\mathcal{V}_{1}$ and $\mathcal{V}_{2}$ (this approach is referred to as specbip- $n$ ), or letting the method estimate $n_{1}$ and $n_{2}$ from the data; we refer to the latter approach as specbip. Since the block (1,2) of the matrix returned by the method is generally permuted with respect to the initial test matrix, the rows and columns are reordered according to the original sequence of the nodes. The final reordering allows us to compare the resulting matrix $A_{B}$ to the test matrix $A$ .

Our results are compared to the ones obtained by red-black ordering using the MatlabBGL library [11], a Matlab package implementing graph algorithms. A matrix has a red-black ordering if the corresponding graph is bipartite. To find a bipartite ordering, this software uses a breadth first search algorithm, starting from an arbitrary vertex. The partition of the nodes is determined by forming a group containing all the vertices having even distance from the root, and another group with the vertices at odd distance from the root. This procedure is designed to bipartite networks, not to produce an approximation when the bipartization is not exact.

Figure 1 displays the results for a test matrix with $(n_{1},n_{2})=(512,256)$ , sparsity $\xi=10^{-2}$ , and perturbation $\eta=10^{-4}$ . In particular, it reports in the upper row a spy plot of the original test matrix, the perturbed version, with random arcs in the (1,1) and (2,2) blocks, and the permuted matrix that is fed to the bipartization methods. The bottom row shows the reconstructed networks. The specbip- $n_{1}$ approach, which receives the information about the cardinality of the node sets, produces the matrix closest to the original. The general algorithm estimates the cardinalities $(\tilde{n}_{1},\tilde{n}_{2})=(492,276)$ , according to the number of “small” eigenvalues; see Figure 2, where the absolute values of the eigenvalues are displayed in nondecreasing order. This algorithm produces a slightly less accurate approximation than the previous one, which is anyway much better than the matrix produced by the red/black ordering.

Figure 3 shows the results for a test matrix similar to the previous one, but with a larger perturbation $\eta=10^{-3}$ . The estimation of $(n_{1},n_{2})$ is inaccurate, but the approximation produced by the specbib methods is quite close to the unperturbed matrix, while the red/black ordering matrix is far from it.

Now, let

[TABLE]

where $E_{11}$ and $E_{22}$ are square matrices of size $n_{1}$ and $n_{2}$ , respectively, and let $|M|$ denote the number of nonzero elements of $M$ . To evaluate the quality of the results, we consider the following three indices

[TABLE]

The first two indices measure the distance of $A_{B}$ from the adjacency matrix of a bipartite graph; see (1.1) for the definition of $b_{s}$ . The third index measures the approximation error with respect to the starting bipartite network (2.1). To better evaluate the error in the bipartition, we introduce the fourth index $\mathcal{E}_{N}=\widetilde{\mathcal{E}}_{N}/n_{1}$ , where $\widetilde{\mathcal{E}}_{N}$ is the number of nodes from the set ${\mathcal{V}}_{1}$ that were incorrectly ascribed to the set ${\mathcal{V}}_{2}$ .

Tables 1, 2, and 3 report the average values of the above four quality indices over 10 realizations of the random test networks. Three different pairs $(n_{1},n_{2})$ are considered; each table refers to different densities $\xi$ and $\eta$ ; $T$ stands for the execution time in seconds.

A comparison of the tables shows that the spectral bipartization algorithm is always more accurate than the red-black ordering method. At the same time, it is much slower than the MatlabBGL function, as in our experiments we compute the whole spectrum of the adjacency matrix, without exploiting its sparsity. To be competitive with existing methods for large-scale problems, the spectral method should be modified in order to perform its task by suitable iterative methods, in order to take advantage of the structure of the adjacency matrix.

From the tables, it can also be observed that knowing in advance the cardinality of the two sets $\mathcal{V}_{1}$ and $\mathcal{V}_{2}$ leads in some cases to a substantial improvement in the quality of the results.

To further investigate the behavior of the bipartition error, we construct a matrix $A$ of the form (2.1), letting $n_{1}=512$ and $n_{2}=256$ , with a sparse random block $C$ having density $\xi=10^{-2}$ . After randomly permuting the rows and columns, we apply our algorithms to this matrix, as well as to those perturbed by replacing the (1,1) and (2,2) blocks by a sparse matrix with density $\eta=10^{-6},10^{-5},\ldots,10^{-3}$ . The graph on the left of Figure 4 shows the value of the bipartization error $\mathcal{E}_{N}$ obtained when the three methods are applied to an unweighted graph, the one on the right correspond to a weighted graph. All values are averaged over 10 realizations of the random matrices. Both graphs show that the bipartization determined by our approaches is closer to the correct one, with respect to red-black ordering, with specbip- $n_{1}$ producing slightly better results. The performance of all algorithms degrades as the perturbation becomes less sparse.

In Figure 5, we display the value of $\mathcal{E}_{N}$ for the same examples, for a fixed $\eta=10^{-2}$ , and letting the density $\xi$ of the block $C$ take values in $[10^{-3},1]$ . The red-black ordering method is more accurate than the specbip algorithm for very sparse networks, while providing the correct cardinality of the set $\mathcal{V}_{1}$ to specbip- $n_{1}$ produces the best results.

5.1 The NDyeast network

We illustrate the performance of the spectral bipartization algorithm when applied to the detection of anti-communities by analyzing a case study. The NDyeast network describes the protein interaction network for yeast, each edge representing an interaction between two proteins [14]. The data set is available at [2]. In this section we analyze this network, testing the presence of a bipartization or of a large anti-community.

The NDyeast network has 2114 nodes. There are 74 self-loops (nodes connected only to themselves) and 268 nodes disconnected from the network. The adjacency matrix resulting by removing both the self-loops and the isolated nodes has size $n=1846$ , and it has 149 connected components. They were identified by the getconcomp function from the PQser Matlab toolbox [6].

In the case of a reducible adjacency matrix, the spectral bipartization algorithm should treat each single connected component one at a time. Since most of the components in the NDyeast network are very small, often just 2 or 3 nodes, we consider the only component with more than 10 nodes, which has 1458 nodes. We process the reduced adjacency matrix $A$ with our bipartization method.

The algorithm determines $n_{0}=564$ zero eigenvalues (see Figure 6) and identifies two sets of nodes with cardinalities $n_{1}=1011$ and $n_{2}=447$ .

The starting adjacency matrix is displayed in the top-left spy plot of Figure 7. The top-right plot shows the same matrix after the ordering produced by the spectral bipartization algorithm is applied to its rows and columns. This graph clearly displays that there is a large group of nodes in the NDyeast network that do not communicate much among themselves, that is, an anti-community. In the same graph we show the bipartization detected by the algorithm by means of red lines.

Our algorithm can also be applied by supplying the values of $(n_{1},n_{2})$ , rather than estimating them from the number of zero eigenvalues. If we do this by setting $\tilde{n}_{1}=800$ and $\tilde{n}_{2}=658$ , we obtain the bottom left graph in the same figure. It shows that in the group of the first 800 proteins, only four of them directly interact.

The bottom-right graph of Figure 7 displays the result of the red-black ordering method, which does not supply any useful information.

We remark that a data set similar to NDyeast (but different) is available at [2]. It is called simply yeast, it consists of 2361 nodes, and it refers to the paper [22]. By processing this data set with our spectral algorithm, we obtain results very similar to the ones displayed in Figure 7.

5.2 The geom network

We also applied the spectral bipartization algorithm to a weighted graph, namely, the geom network, It is extracted from the Computational Geometry Database geombib by B. Jones (version 2002). Nodes represent authors; the value of the entry $(i,j)$ of the adjacency matrix is the number of papers coauthored by authors $i$ and $j$ . The data set is available at [2].

The geom network has 7343 nodes and 11898 edges. After removing 1185 isolated nodes, the network presents 875 connected components, the largest of which has 3621 nodes. We applied the bipartization method to the adjacency matrix associated to this component.

The eigenvalues are displayed in Figure 8: 533 of them are detected as being numerically zero, and the cardinalities of the two node sets are $n_{1}=2077$ and $n_{2}=1544$ . The left graph of Figure 9 reports the spy plot of the original adjacency matrix; the graph on the right shows the matrix reordered by the spectral bipartization algorithm. The graph highlights the presence of an anti-community of about 1000 authors, who did not collaborate with each other when writing papers.

6 Conclusion

This paper describes how an approximate bipartization of a given graph can be determined by solving a sequence of simple optimization problems. The technique can also be applied to detect anti-communities. Computed examples illustrate the performance of the method described.

Acknowledgment

The authors would like to thank the referees for comments.

Bibliography25

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] R. B. Bapat, Graphs and Matrices, Springer, London, 2010.
2[2] V. Batagelj and A. Mrvar, Pajek data sets (2006). Available at http://vlado.fmf.uni-lj.si/pub/networks/data/
3[3] J. A. Bondy and U. S. R. Murty, Graph Theory with Applications, Macmillan, London, 1976.
4[4] S. P. Borgatti and M. G. Everett, Models of Core/Periphery Structures, Social Networks, 21 (1999), pp. 375–395.
5[5] L. Chen, Q. Yu, and B. Chen, Anti-modularity and anti-community detecting in complex networks, Inf. Sci., 275 (2014), pp. 293–313.
6[6] A. Concas, C. Fenu, and G. Rodriguez, P Qser: A Matlab package for spectral seriation, Numer. Algorithms, 80 (2019), pp. 879–902.
7[7] E. Estrada, The Structure of Complex Networks: Theory and Applications, Oxford University Press, Oxford, 2011.
8[8] E. Estrada and J. Gómez–Gardeñes, Network bipartivity and the transportation efficiency of European passenger airlines, Physica D, 323-324 (2016), pp. 57–63.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

A spectral method for bipartizing a network and detecting a large

Abstract

keywords:

MSC:

1 Introduction

2 Approximating the spectral structure of a bipartite graph

Proposition 2.1**.**

Proof.

Remark 2.2**.**

Proposition 2.3**.**

Proof.

Proposition 2.4**.**

Proof.

Proposition 2.5**.**

Proof.

Remark 2.6**.**

Remark 2.7**.**

3 A spectral bipartization method

4 Anti-communities

Theorem 4.1**.**

Proof.

Remark 4.2**.**

5 Computed examples

5.1 The NDyeast network

5.2 The geom network

6 Conclusion

Acknowledgment

Proposition 2.1.

Remark 2.2.

Proposition 2.3.

Proposition 2.4.

Proposition 2.5.

Remark 2.6.

Remark 2.7.

Theorem 4.1.

Remark 4.2.