Spectral Clustering with Side Information
Hendrik Fichtenberger, Michael Kapralov, Ekaterina Kochetkova, Silvio Lattanzi, Davide Mazzali, Weronika Wrzos-Kaminska

TL;DR
This paper introduces a sublinear-time spectral clustering algorithm that effectively combines graph structure and vertex labels to achieve near-optimal misclassification rates, and also provides a polynomial-time method to refine community structures.
Contribution
It presents a novel spectral clustering algorithm that integrates side information for improved accuracy and a graph reweighting technique to enhance community detection.
Findings
Achieves nearly optimal misclassification rate of d7 f8(b5 d7 f0(b4))
Operates in sublinear time relative to the number of vertices
Provides a polynomial-time method to reweight edges for better community structure preservation.
Abstract
In the graph clustering problem with a planted solution, the input is a graph on vertices partitioned into clusters, and the task is to infer the clusters from graph structure. A standard assumption is that clusters induce well-connected subgraphs (i.e. -expanders), and form -sparse cuts. Such a graph defines the clustering uniquely up to misclassification rate, and efficient algorithms for achieving this rate are known. While this vanilla version of graph clustering is well studied, in practice, vertices of the graph are typically equipped with labels that provide additional information on cluster ids of the vertices. For example, each vertex could have a cluster label that is corrupted independently with probability . Using only one of the two sources of information leads to misclassification rate , but…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Clustering Algorithms Research · Complex Network Analysis Techniques · Complexity and Algorithms in Graphs
