Fair Clustering with Minimum Representation Constraints

Connor Lawless; Oktay Gunluk

arXiv:2409.02963·math.OC·September 25, 2025

Fair Clustering with Minimum Representation Constraints

Connor Lawless, Oktay Gunluk

PDF

Open Access

TL;DR

This paper introduces a novel approach to fair clustering that ensures minimum group representation in multiple clusters, using a mixed-integer optimization formulation and heuristic algorithms to handle NP-hardness, demonstrated on benchmark datasets.

Contribution

It formulates fair clustering with minimum representation constraints as a mixed-integer nonlinear problem and proposes MiniReL, an alternating minimization algorithm with heuristics for practical large-scale application.

Findings

01

Fair clustering achieved without increased cost

02

Heuristic strategies enable scalability to large datasets

03

Numerical results validate effectiveness on benchmarks

Abstract

Clustering is a well-studied unsupervised learning task that aims to partition data points into a number of clusters. In many applications, these clusters correspond to real-world constructs (e.g., electoral districts, playlists, TV channels), where a group (e.g., social or demographic) benefits only if it reaches a minimum level of representation in the cluster (e.g., 50% to elect their preferred candidate). In this paper, we study the k-means and k-medians clustering problems under the additional fairness constraint that each group must attain a minimum level of representation in at least a specified number of clusters. We formulate this problem as a mixed-integer (nonlinear) optimization problem and propose an alternating minimization algorithm, called MiniReL, to solve it. Although incorporating fairness constraints results in an NP-hard assignment problem within the MiniReL…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGame Theory and Voting Systems · Privacy-Preserving Technologies in Data · Advanced Graph Neural Networks

MethodsSparse Evolutionary Training