PASCO (PArallel Structured COarsening): an overlay to speed up graph clustering algorithms
Etienne Lasalle (OCKHAM), R\'emi Vaudaine (OCKHAM), Titouan Vayer (OCKHAM), Pierre Borgnat (Phys-ENS), R\'emi Gribonval (OCKHAM), Paulo Gon\c{c}alves (OCKHAM), M\`arton Karsai (CEU)

TL;DR
PASCO is a novel overlay framework that accelerates large-scale graph clustering by combining parallel coarsening, multiple clustering runs, and optimal transport for partition alignment, significantly improving efficiency and quality.
Contribution
This work introduces PASCO, a new parallel graph coarsening and clustering framework that preserves structure and enhances scalability for large graphs.
Findings
PASCO significantly reduces clustering computation time.
It maintains high structural fidelity in the resulting partitions.
It outperforms existing methods on synthetic and real-world datasets.
Abstract
Clustering the nodes of a graph is a cornerstone of graph analysis and has been extensively studied. However, some popular methods are not suitable for very large graphs: e.g., spectral clustering requires the computation of the spectral decomposition of the Laplacian matrix, which is not applicable for large graphs with a large number of communities. This work introduces PASCO, an overlay that accelerates clustering algorithms. Our method consists of three steps: 1-We compute several independent small graphs representing the input graph by applying an efficient and structure-preserving coarsening algorithm. 2-A clustering algorithm is run in parallel onto each small graph and provides several partitions of the initial graph. 3-These partitions are aligned and combined with an optimal transport method to output the final partition. The PASCO framework is based on two key contributions:…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSpectral Clustering
