Fair Coresets and Streaming Algorithms for Fair k-Means Clustering
Melanie Schmidt, Chris Schwiegelshohn, Christian Sohler

TL;DR
This paper introduces scalable algorithms and coresets for fair k-means clustering that ensure balanced clusters with respect to sensitive attributes, improving efficiency and applicability in large datasets.
Contribution
It presents the first scalable coreset construction for fair clustering and extends Lloyd's and k-means++ algorithms to incorporate fairness constraints.
Findings
Coresets are composable and effective in reducing data size.
The proposed algorithms perform well in empirical tests.
Scalable solutions enable fair clustering on large datasets.
Abstract
We study fair clustering problems as proposed by Chierichetti et al. (NIPS 2017). Here, points have a sensitive attribute and all clusters in the solution are required to be balanced with respect to it (to counteract any form of data-inherent bias). Previous algorithms for fair clustering do not scale well. We show how to model and compute so-called coresets for fair clustering problems, which can be used to significantly reduce the input data size. We prove that the coresets are composable and show how to compute them in a streaming setting. Furthermore, we propose a variant of Lloyd's algorithm that computes fair clusterings and extend it to a fair k-means++ clustering algorithm. We implement these algorithms and provide empirical evidence that the combination of our approximation algorithms and the coreset construction yields a scalable algorithm for fair k-means clustering.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Facility Location and Emergency Management · Stochastic Gradient Optimization Techniques
