Fair Clustering Under a Bounded Cost
Seyed A. Esmaeili, Brian Brubach, Aravind Srinivasan, John P., Dickerson

TL;DR
This paper addresses fair clustering by bounding the clustering cost and maximizing fairness, introducing algorithms with guarantees for different fairness objectives and validating them on real datasets.
Contribution
It proposes a new approach to fair clustering by constraining cost and optimizing fairness, with theoretical bounds and practical algorithms.
Findings
Algorithms with provable guarantees for utilitarian and egalitarian fairness.
Fundamental lower bounds on approximation for certain fairness objectives.
Experimental validation on real-world datasets.
Abstract
Clustering is a fundamental unsupervised learning problem where a dataset is partitioned into clusters that consist of nearby points in a metric space. A recent variant, fair clustering, associates a color with each point representing its group membership and requires that each color has (approximately) equal representation in each cluster to satisfy group fairness. In this model, the cost of the clustering objective increases due to enforcing fairness in the algorithm. The relative increase in the cost, the ''price of fairness,'' can indeed be unbounded. Therefore, in this paper we propose to treat an upper bound on the clustering objective as a constraint on the clustering problem, and to maximize equality of representation subject to it. We consider two fairness objectives: the group utilitarian objective and the group egalitarian objective, as well as the group leximin objective…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsGame Theory and Voting Systems · Facility Location and Emergency Management · Economic and Environmental Valuation
