Approximate Correlation Clustering Using Same-Cluster Queries
Nir Ailon, Anup Bhattacharya, Ragesh Jaiswal

TL;DR
This paper extends semi-supervised same-cluster query frameworks to correlation clustering, providing an efficient approximation algorithm with bounds on query complexity, advancing clustering methods with minimal supervision.
Contribution
It introduces an (1+ε)-approximation algorithm for correlation clustering with polynomial runtime in input size, k, and 1/ε, along with bounds on same-cluster query complexity.
Findings
Achieves (1+ε)-approximation for correlation clustering.
Provides bounds on the number of same-cluster queries needed.
Extends semi-supervised clustering frameworks to correlation clustering.
Abstract
Ashtiani et al. (NIPS 2016) introduced a semi-supervised framework for clustering (SSAC) where a learner is allowed to make same-cluster queries. More specifically, in their model, there is a query oracle that answers queries of the form given any two vertices, do they belong to the same optimal cluster?. Ashtiani et al. showed the usefulness of such a query framework by giving a polynomial time algorithm for the k-means clustering problem where the input dataset satisfies some separation condition. Ailon et al. extended the above work to the approximation setting by giving an efficient (1+\eps)-approximation algorithm for k-means for any small \eps > 0 and any dataset within the SSAC framework. In this work, we extend this line of study to the correlation clustering problem. Correlation clustering is a graph clustering problem where pairwise similarity (or dissimilarity) information is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
