Spectral clustering and the high-dimensional stochastic blockmodel
Karl Rohe, Sourav Chatterjee, Bin Yu

TL;DR
This paper analyzes spectral clustering on high-dimensional stochastic blockmodels, providing bounds on misclustering and demonstrating eigenvector convergence, thus advancing understanding of community detection in large, complex networks.
Contribution
It introduces novel asymptotic bounds for spectral clustering in high-dimensional stochastic blockmodels and proves eigenvector convergence under a general latent space model.
Findings
Bounds on misclustered nodes in spectral clustering
Eigenvector convergence to population eigenvectors
First results allowing number of clusters to grow with nodes
Abstract
Networks or graphs can easily represent a diverse set of data sources that are characterized by interacting units or actors. Social networks, representing people who communicate with each other, are one example. Communities or clusters of highly connected actors form an essential feature in the structure of several empirical networks. Spectral clustering is a popular and computationally feasible method to discover these communities. The stochastic blockmodel [Social Networks 5 (1983) 109--137] is a social network model with well-defined communities; each node is a member of one community. For a network generated from the Stochastic Blockmodel, we bound the number of nodes "misclustered" by spectral clustering. The asymptotic results in this paper are the first clustering results that allow the number of clusters in the model to grow with the number of nodes, hence the name…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
