Distributed Graph Clustering and Sparsification
He Sun, Luca Zanetti

TL;DR
This paper introduces a simple, distributed graph clustering algorithm that efficiently handles large, structured graphs by combining sampling-based sparsification with a poly-logarithmic convergence time.
Contribution
The paper presents a novel distributed clustering method that is simple, fast, and effective for graphs with strong cluster structures, using a new sampling scheme for sparsification.
Findings
Algorithm finishes in poly-logarithmic rounds.
Recovers a partition close to optimal.
Sampling scheme preserves cluster structure efficiently.
Abstract
Graph clustering is a fundamental computational problem with a number of applications in algorithm design, machine learning, data mining, and analysis of social networks. Over the past decades, researchers have proposed a number of algorithmic design methods for graph clustering. Most of these methods, however, are based on complicated spectral techniques or convex optimisation, and cannot be directly applied for clustering many networks that occur in practice, whose information is often collected on different sites. Designing a simple and distributed clustering algorithm is of great interest, and has wide applications for processing big datasets. In this paper we present a simple and distributed algorithm for graph clustering: for a wide class of graphs that are characterised by a strong cluster-structure, our algorithm finishes in a poly-logarithmic number of rounds, and recovers a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
