Identifying vital nodes based on reverse greedy method

Tao Ren; Zhe Li; Yi Qi; Yixin Zhang; Simiao Liu; Yanjie Xu; and Tao; Zhou

arXiv:1907.01388·physics.soc-ph·July 3, 2019

Identifying vital nodes based on reverse greedy method

Tao Ren, Zhe Li, Yi Qi, Yixin Zhang, Simiao Liu, Yanjie Xu, and Tao, Zhou

PDF

TL;DR

This paper introduces a reverse greedy method for identifying vital nodes in networks, which outperforms existing methods in maintaining network connectivity, by iteratively removing the least important nodes.

Contribution

The paper proposes a novel reverse greedy approach for vital node identification that demonstrates superior performance over existing methods.

Findings

01

Reverse greedy method outperforms state-of-the-art techniques

02

Method effectively identifies nodes critical for network connectivity

03

Empirical results on ten real networks validate the approach

Abstract

The identification of vital nodes that maintain the network connectivity is a long-standing challenge in network science. In this paper, we propose a so-called reverse greedy method where the least important nodes are preferentially chosen to make the size of the largest component in the corresponding induced subgraph as small as possible. Accordingly, the nodes being chosen later are more important in maintaining the connectivity. Empirical analyses on ten real networks show that the reverse greedy method performs remarkably better than well-known state-of-the-art methods.

Tables2

Table 1. Table 1: The basic topological features of the ten real networks. N 𝑁 N and E 𝐸 E are the number of nodes and edges, ⟨ k ⟩ delimited-⟨⟩ 𝑘 \left\langle k\right\rangle is the average degree, C 𝐶 C is the clustering coefficient, r 𝑟 r is the assortative coefficient and H 𝐻 H is the degree heterogeneity.

Networks	$N$	$E$	$⟨ k ⟩$	$C$	$r$	$H$
Jazz	198	2742	27.6970	0.6334	0.0202	1.3951
NS	379	914	4.8232	0.7981	-0.0817	1.6630
Email	1133	5451	9.6222	0.2540	0.0782	1.9421
PB	1222	16714	27.3552	0.3600	-0.2213	2.9707
Sex	15810	38540	4.8754	0	-0.1145	5.8276
Facebook	63731	817090	25.6418	0.2532	0.1769	3.4331
USAir	332	2126	12.8072	0.7494	-0.2079	3.4639
Power	4941	6594	2.6691	0.1065	0.0035	1.4504
Router	5022	6258	2.4922	0.0329	-0.1384	5.5031
HepPh	34546	420877	24.3662	0.2962	-0.0063	2.6055

Table 2. Table 2: The performance, measured by robustness R 𝑅 R , of the eight ranking methods on ten real networks. The best performed method for each network, namely the lowest R 𝑅 R in the corresponding row, is emphasized in bold. Notice that, we use the random removal (Random) as the background benchmark in order to show the improvement by each method. The radius ℓ ℓ \ell in CI is set to 2, and the feature f ( i ) 𝑓 𝑖 f(i) in RG is the degree of node i 𝑖 i .

Networks	Random	BC	CC	DC	H-index	KS	PR	CI	RG
Jazz	0.4808	0.3956	0.4199	0.4409	0.4497	0.4571	0.4262	0.3913	0.3477
NS	0.2752	0.0488	0.1336	0.0540	0.1155	0.1582	0.0524	0.0551	0.0252
Email	0.4442	0.2578	0.2893	0.2519	0.2836	0.2937	0.2395	0.2231	0.1844
PB	0.4615	0.2192	0.2908	0.2286	0.2578	0.2611	0.2155	0.1968	0.1740
Sex	0.3842	0.0841	0.2208	0.0725	0.0981	0.1142	0.0690	0.0604	0.0513
Facebook	0.4545	0.2935	0.3570	0.3137	0.3328	0.3389	0.2893	0.2671	0.2372
USAir	0.4321	0.1129	0.1442	0.1228	0.1498	0.1588	0.1072	0.1105	0.0942
Power	0.2069	0.0656	0.1973	0.0634	0.1090	0.2628	0.0594	0.0489	0.0088
Router	0.3044	0.0142	0.0686	0.0121	0.0136	0.0276	0.0136	0.0140	0.0063
HepPh	0.4765	0.3504	0.4259	0.3664	0.3931	0.4022	0.3371	0.3015	0.2657

Equations14

cos t (i, n + 1) = G_{n + 1}^{m a x} (i) + ϵ f (i),

cos t (i, n + 1) = G_{n + 1}^{m a x} (i) + ϵ f (i),

R = \frac{1}{N} Q = 1 \sum N S (Q),

R = \frac{1}{N} Q = 1 \sum N S (Q),

D C (i) = j \sum a_{ij},

D C (i) = j \sum a_{ij},

P R_{i} (t) = s j = 1 \sum N a_{j i} \frac{P R _{j} ( t - 1 )}{k _{j}} + (1 - s) \frac{1}{N},

P R_{i} (t) = s j = 1 \sum N a_{j i} \frac{P R _{j} ( t - 1 )}{k _{j}} + (1 - s) \frac{1}{N},

C C (i) = \frac{N - 1}{j \neq = i \sum d _{ij}},

C C (i) = \frac{N - 1}{j \neq = i \sum d _{ij}},

B C (i) = s \neq = i, s \neq = t, i \neq = t \sum \frac{g _{s t} ( i )}{g _{s t}},

B C (i) = s \neq = i, s \neq = t, i \neq = t \sum \frac{g _{s t} ( i )}{g _{s t}},

C I (i) = (k_{i} - 1) j \in \partial ba l l (i, ℓ) \sum (k_{j} - 1),

C I (i) = (k_{i} - 1) j \in \partial ba l l (i, ℓ) \sum (k_{j} - 1),

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Identifying vital nodes based on reverse greedy method

Tao Ren