Combinatorial Approximations for Cluster Deletion: Simpler, Faster, and   Better

Vicente Balmaseda; Ying Xu; Yixin Cao; Nate Veldt

arXiv:2404.16131·cs.DS·April 26, 2024·2 cites

Combinatorial Approximations for Cluster Deletion: Simpler, Faster, and Better

Vicente Balmaseda, Ying Xu, Yixin Cao, Nate Veldt

PDF

Open Access 1 Repo

TL;DR

This paper improves approximation guarantees for cluster deletion algorithms, introduces a simple derandomization method, and develops a scalable combinatorial approach for linear programming, enhancing efficiency and practicality.

Contribution

Provides tighter analysis of existing algorithms, introduces a simple derandomization technique, and designs a scalable combinatorial method for linear programming in cluster deletion.

Findings

01

Approximation guarantees improved from 4 to 3.

02

Simple greedy derandomization method introduced.

03

New combinatorial approach for linear programming developed.

Abstract

Cluster deletion is an NP-hard graph clustering objective with applications in computational biology and social network analysis, where the goal is to delete a minimum number of edges to partition a graph into cliques. We first provide a tighter analysis of two previous approximation algorithms, improving their approximation guarantees from 4 to 3. Moreover, we show that both algorithms can be derandomized in a surprisingly simple way, by greedily taking a vertex of maximum degree in an auxiliary graph and forming a cluster around it. One of these algorithms relies on solving a linear program. Our final contribution is to design a new and purely combinatorial approach for doing so that is far more scalable in theory and practice.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

vibalcam/combinatorial-cluster-deletion
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsData Quality and Management · Advanced Clustering Algorithms Research