Community detection using low-dimensional network embedding algorithms
Aman Barot, Shankar Bhamidi, Souvik Dhara

TL;DR
This paper analyzes the effectiveness of network embedding algorithms DeepWalk and node2vec in community detection within large, sparse networks, providing theoretical guarantees and conditions for successful recovery.
Contribution
It offers a rigorous theoretical comparison of DeepWalk and node2vec, revealing conditions under which each algorithm can successfully recover communities in sparse networks.
Findings
node2vec outperforms DeepWalk in sparser networks
Random walk length affects community recovery success
Algorithms may fail in very sparse settings
Abstract
With the increasing relevance of large networks in important areas such as the study of contact networks for spread of disease, or social networks for their impact on geopolitics, it has become necessary to study machine learning tools that are scalable to very large networks, often containing millions of nodes. One major class of such scalable algorithms is known as network representation learning or network embedding. These algorithms try to learn representations of network functionals (e.g.~nodes) by first running multiple random walks and then using the number of co-occurrences of each pair of nodes in observed random walk segments to obtain a low-dimensional representation of nodes on some Euclidean space. The aim of this paper is to rigorously understand the performance of two major algorithms, DeepWalk and node2vec, in recovering communities for canonical network models with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Advanced Graph Neural Networks · Topological and Geometric Data Analysis
MethodsDeepWalk · node2vec
