Diversity-aware $k$-median : Clustering with fair center representation
Suhas Thejaswi, Bruno Ordozgoiti, Aristides Gionis

TL;DR
This paper introduces a diversity-aware $k$-median clustering problem ensuring fair representation of protected groups in cluster centers, analyzing its computational complexity and proposing algorithms for different cases.
Contribution
It formalizes the diversity-aware $k$-median problem, proves its computational hardness in general, and provides approximation algorithms for the disjoint group case, along with heuristics for intractable scenarios.
Findings
The problem is NP-hard and inapproximable when groups overlap.
Approximation algorithms are effective for disjoint group cases.
Heuristic methods yield high-quality solutions for real-world data.
Abstract
We introduce a novel problem for diversity-aware clustering. We assume that the potential cluster centers belong to a set of groups defined by protected attributes, such as ethnicity, gender, etc. We then ask to find a minimum-cost clustering of the data into clusters so that a specified minimum number of cluster centers are chosen from each group. We thus require that all groups are represented in the clustering solution as cluster centers, according to specified requirements. More precisely, we are given a set of clients , a set of facilities , a collection of facility groups , budget , and a set of lower-bound thresholds , one for each group in . The \emph{diversity-aware -median problem} asks to find a set of facilities in such that $|S \cap…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
