Fair Clustering Under a Bounded Cost

Seyed A. Esmaeili; Brian Brubach; Aravind Srinivasan; John P.; Dickerson

arXiv:2106.07239·cs.LG·January 10, 2023·1 cites

Fair Clustering Under a Bounded Cost

Seyed A. Esmaeili, Brian Brubach, Aravind Srinivasan, John P., Dickerson

PDF

Open Access 1 Video

TL;DR

This paper addresses fair clustering by bounding the clustering cost and maximizing fairness, introducing algorithms with guarantees for different fairness objectives and validating them on real datasets.

Contribution

It proposes a new approach to fair clustering by constraining cost and optimizing fairness, with theoretical bounds and practical algorithms.

Findings

01

Algorithms with provable guarantees for utilitarian and egalitarian fairness.

02

Fundamental lower bounds on approximation for certain fairness objectives.

03

Experimental validation on real-world datasets.

Abstract

Clustering is a fundamental unsupervised learning problem where a dataset is partitioned into clusters that consist of nearby points in a metric space. A recent variant, fair clustering, associates a color with each point representing its group membership and requires that each color has (approximately) equal representation in each cluster to satisfy group fairness. In this model, the cost of the clustering objective increases due to enforcing fairness in the algorithm. The relative increase in the cost, the ''price of fairness,'' can indeed be unbounded. Therefore, in this paper we propose to treat an upper bound on the clustering objective as a constraint on the clustering problem, and to maximize equality of representation subject to it. We consider two fairness objectives: the group utilitarian objective and the group egalitarian objective, as well as the group leximin objective…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Fair Clustering Under a Bounded Cost· slideslive

Taxonomy

TopicsGame Theory and Voting Systems · Facility Location and Emergency Management · Economic and Environmental Valuation