Approximate Correlation Clustering Using Same-Cluster Queries

Nir Ailon; Anup Bhattacharya; Ragesh Jaiswal

arXiv:1712.06865·cs.DS·December 20, 2017

Approximate Correlation Clustering Using Same-Cluster Queries

Nir Ailon, Anup Bhattacharya, Ragesh Jaiswal

PDF

TL;DR

This paper extends semi-supervised same-cluster query frameworks to correlation clustering, providing an efficient approximation algorithm with bounds on query complexity, advancing clustering methods with minimal supervision.

Contribution

It introduces an (1+ε)-approximation algorithm for correlation clustering with polynomial runtime in input size, k, and 1/ε, along with bounds on same-cluster query complexity.

Findings

01

Achieves (1+ε)-approximation for correlation clustering.

02

Provides bounds on the number of same-cluster queries needed.

03

Extends semi-supervised clustering frameworks to correlation clustering.

Abstract

Ashtiani et al. (NIPS 2016) introduced a semi-supervised framework for clustering (SSAC) where a learner is allowed to make same-cluster queries. More specifically, in their model, there is a query oracle that answers queries of the form given any two vertices, do they belong to the same optimal cluster?. Ashtiani et al. showed the usefulness of such a query framework by giving a polynomial time algorithm for the k-means clustering problem where the input dataset satisfies some separation condition. Ailon et al. extended the above work to the approximation setting by giving an efficient (1+\eps)-approximation algorithm for k-means for any small \eps > 0 and any dataset within the SSAC framework. In this work, we extend this line of study to the correlation clustering problem. Correlation clustering is a graph clustering problem where pairwise similarity (or dissimilarity) information is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.