Community detection using low-dimensional network embedding algorithms

Aman Barot; Shankar Bhamidi; Souvik Dhara

arXiv:2111.05267·cs.SI·November 10, 2021

Community detection using low-dimensional network embedding algorithms

Aman Barot, Shankar Bhamidi, Souvik Dhara

PDF

Open Access

TL;DR

This paper analyzes the effectiveness of network embedding algorithms DeepWalk and node2vec in community detection within large, sparse networks, providing theoretical guarantees and conditions for successful recovery.

Contribution

It offers a rigorous theoretical comparison of DeepWalk and node2vec, revealing conditions under which each algorithm can successfully recover communities in sparse networks.

Findings

01

node2vec outperforms DeepWalk in sparser networks

02

Random walk length affects community recovery success

03

Algorithms may fail in very sparse settings

Abstract

With the increasing relevance of large networks in important areas such as the study of contact networks for spread of disease, or social networks for their impact on geopolitics, it has become necessary to study machine learning tools that are scalable to very large networks, often containing millions of nodes. One major class of such scalable algorithms is known as network representation learning or network embedding. These algorithms try to learn representations of network functionals (e.g.~nodes) by first running multiple random walks and then using the number of co-occurrences of each pair of nodes in observed random walk segments to obtain a low-dimensional representation of nodes on some Euclidean space. The aim of this paper is to rigorously understand the performance of two major algorithms, DeepWalk and node2vec, in recovering communities for canonical network models with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComplex Network Analysis Techniques · Advanced Graph Neural Networks · Topological and Geometric Data Analysis

MethodsDeepWalk · node2vec