Connected k-Median with Disjoint and Non-disjoint Clusters
Jan Eube, Kelin Luo, Dorian Reineccius, Heiko R\"oglin, Melanie Schmidt

TL;DR
This paper introduces an approximation algorithm for the connected k-median clustering problem with overlapping clusters in arbitrary graphs, addressing its computational hardness and providing exact solutions for special cases.
Contribution
It presents the first approximation algorithm for overlapping connected k-median clustering in general graphs and analyzes hardness for disjoint clusters.
Findings
Approximation algorithm with O(k^2 log n) ratio.
Hardness result of Ω(n^{1-ε}) for disjoint clusters.
Exact algorithm for tree connectivity graphs.
Abstract
The connected -median problem is a constrained clustering problem that combines distance-based -clustering with connectivity information. The problem allows to input a metric space and an unweighted undirected connectivity graph that is completely unrelated to the metric space. The goal is to compute centers and corresponding clusters such that each cluster forms a connected subgraph of , and such that the -median cost is minimized. The problem has applications in very different fields like geodesy (particularly districting), social network analysis (especially community detection), or bioinformatics. We study a version with overlapping clusters where points can be part of multiple clusters which is natural for the use case of community detection. This problem variant is -hard to approximate, and our main result is an $\mathcal{O}(k^2 \log…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
