Bipartite Correlation Clustering -- Maximizing Agreements
Megasthenis Asteris, Anastasios Kyrillidis, Dimitris, Papailiopoulos, Alexandros G. Dimakis

TL;DR
This paper introduces a novel approximation algorithm for bipartite correlation clustering with a fixed number of clusters, achieving near-optimal agreements efficiently, and extends results to the unconstrained case with an efficient PTAS.
Contribution
The paper presents a new approximation algorithm for k-BCC with provable guarantees and extends the approach to the unconstrained BCC setting, providing an efficient PTAS.
Findings
Achieves a (1-δ)-approximation for k-BCC with exponential dependence on k and δ^{-1}.
In the unconstrained BCC, a (1-δ)-approximation can be achieved with O(δ^{-1}) clusters.
Provides an efficient PTAS for maximizing agreements in BCC.
Abstract
In Bipartite Correlation Clustering (BCC) we are given a complete bipartite graph with `+' and `-' edges, and we seek a vertex clustering that maximizes the number of agreements: the number of all `+' edges within clusters plus all `-' edges cut across clusters. BCC is known to be NP-hard. We present a novel approximation algorithm for -BCC, a variant of BCC with an upper bound on the number of clusters. Our algorithm outputs a -clustering that provably achieves a number of agreements within a multiplicative -factor from the optimal, for any desired accuracy . It relies on solving a combinatorially constrained bilinear maximization on the bi-adjacency matrix of . It runs in time exponential in and , but linear in the size of the input. Further, we show that, in the (unconstrained) BCC setting, an -approximation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining Algorithms and Applications · Advanced Clustering Algorithms Research · Bayesian Methods and Mixture Models
