Breaking the Small Cluster Barrier of Graph Clustering
Nir Ailon, Yudong Chen, Xu Huan

TL;DR
This paper demonstrates that small clusters do not prevent the recovery of large clusters in graph clustering, introduces an iterative peeling algorithm, and extends results to partial observation settings.
Contribution
It refines the analysis of trace-norm based recovery to handle small clusters and proposes an iterative peeling strategy for almost complete cluster recovery.
Findings
Small clusters do not hinder large cluster recovery under mild conditions.
An iterative peeling algorithm effectively recovers almost all clusters.
The approach extends to partial observation scenarios with active learning.
Abstract
This paper investigates graph clustering in the planted cluster model in the presence of {\em small clusters}. Traditional results dictate that for an algorithm to provably correctly recover the clusters, {\em all} clusters must be sufficiently large (in particular, where is the number of nodes of the graph). We show that this is not really a restriction: by a more refined analysis of the trace-norm based recovery approach proposed in Jalali et al. (2011) and Chen et al. (2012), we prove that small clusters, under certain mild assumptions, do not hinder recovery of large ones. Based on this result, we further devise an iterative algorithm to recover {\em almost all clusters} via a "peeling strategy", i.e., recover large clusters first, leading to a reduced problem, and repeat this procedure. These results are extended to the {\em partial observation}…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComplex Network Analysis Techniques · Topological and Geometric Data Analysis · Advanced Graph Neural Networks
