Community Detection Guarantees Using Embeddings Learned by Node2Vec

Andrew Davison; S. Carlyle Morgan; Owen G. Ward

arXiv:2310.17712·stat.ML·October 23, 2024·1 cites

Community Detection Guarantees Using Embeddings Learned by Node2Vec

Andrew Davison, S. Carlyle Morgan, Owen G. Ward

PDF

Open Access 1 Video

TL;DR

This paper provides theoretical guarantees for community detection using node2vec embeddings, showing that k-means clustering on these embeddings can reliably recover communities in stochastic block models.

Contribution

It offers the first theoretical analysis of node2vec embeddings, establishing conditions under which community detection via k-means is consistent.

Findings

01

k-means on node2vec embeddings achieves weakly consistent community recovery

02

Empirical validation supports theoretical results

03

Discussion of embeddings' effectiveness for link prediction

Abstract

Embedding the nodes of a large network into an Euclidean space is a common objective in modern machine learning, with a variety of tools available. These embeddings can then be used as features for tasks such as community detection/node clustering or link prediction, where they achieve state of the art performance. With the exception of spectral clustering methods, there is little theoretical understanding for commonly used approaches to learning embeddings. In this work we examine the theoretical properties of the embeddings learned by node2vec. Our main result shows that the use of $k$ -means clustering on the embedding vectors produced by node2vec gives weakly consistent community recovery for the nodes in (degree corrected) stochastic block models. We also discuss the use of these embeddings for node and link prediction tasks. We demonstrate this result empirically, and examine how…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Community Detection Guarantees using Embeddings Learned by Node2Vec· slideslive

Taxonomy

TopicsComplex Network Analysis Techniques · Advanced Graph Neural Networks · Functional Brain Connectivity Studies

Methodsnode2vec · Spectral Clustering · k-Means Clustering