Robust and computationally feasible community detection in the presence of arbitrary outlier nodes
T. Tony Cai, Xiaodong Li

TL;DR
This paper introduces a robust community detection method under a generalized stochastic block model that includes arbitrary outliers, providing theoretical guarantees and demonstrating practical effectiveness and speed.
Contribution
The paper extends community detection to handle arbitrary outliers in the GSBM, with a convex optimization approach and theoretical guarantees for accurate detection.
Findings
Method is computationally fast and robust to outliers.
Theoretical guarantee for community detection with growing number of clusters.
Numerical results outperform some existing algorithms in robustness and speed.
Abstract
Community detection, which aims to cluster nodes in a given graph into distinct groups based on the observed undirected edges, is an important problem in network data analysis. In this paper, the popular stochastic block model (SBM) is extended to the generalized stochastic block model (GSBM) that allows for adversarial outlier nodes, which are connected with the other nodes in the graph in an arbitrary way. Under this model, we introduce a procedure using convex optimization followed by -means algorithm with . Both theoretical and numerical properties of the method are analyzed. A theoretical guarantee is given for the procedure to accurately detect the communities with small misclassification rate under the setting where the number of clusters can grow with . This theoretical result admits to the best-known result in the literature of computationally feasible…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSpectral Clustering
