Fair Clustering with Minimum Representation Constraints
Connor Lawless, Oktay Gunluk

TL;DR
This paper introduces a novel approach to fair clustering that ensures minimum group representation in multiple clusters, using a mixed-integer optimization formulation and heuristic algorithms to handle NP-hardness, demonstrated on benchmark datasets.
Contribution
It formulates fair clustering with minimum representation constraints as a mixed-integer nonlinear problem and proposes MiniReL, an alternating minimization algorithm with heuristics for practical large-scale application.
Findings
Fair clustering achieved without increased cost
Heuristic strategies enable scalability to large datasets
Numerical results validate effectiveness on benchmarks
Abstract
Clustering is a well-studied unsupervised learning task that aims to partition data points into a number of clusters. In many applications, these clusters correspond to real-world constructs (e.g., electoral districts, playlists, TV channels), where a group (e.g., social or demographic) benefits only if it reaches a minimum level of representation in the cluster (e.g., 50% to elect their preferred candidate). In this paper, we study the k-means and k-medians clustering problems under the additional fairness constraint that each group must attain a minimum level of representation in at least a specified number of clusters. We formulate this problem as a mixed-integer (nonlinear) optimization problem and propose an alternating minimization algorithm, called MiniReL, to solve it. Although incorporating fairness constraints results in an NP-hard assignment problem within the MiniReL…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGame Theory and Voting Systems · Privacy-Preserving Technologies in Data · Advanced Graph Neural Networks
MethodsSparse Evolutionary Training
