Scalable Fair Clustering
Arturs Backurs, Piotr Indyk, Krzysztof Onak, Baruch Schieber, Ali, Vakilian, Tal Wagner

TL;DR
This paper introduces a fast, practical algorithm for fair clustering that improves efficiency and offers better control over cluster balance, addressing limitations of previous methods.
Contribution
We develop a nearly linear time fairlet decomposition algorithm that enhances scalability and cluster balance control over prior super-quadratic approaches.
Findings
Algorithm runs in nearly linear time.
Provides finer control over cluster fairness.
Empirical results validate efficiency and effectiveness.
Abstract
We study the fair variant of the classic -median problem introduced by Chierichetti et al. [2017]. In the standard -median problem, given an input pointset , the goal is to find centers and assign each input point to one of the centers in such that the average distance of points to their cluster center is minimized. In the fair variant of -median, the points are colored, and the goal is to minimize the same average distance objective while ensuring that all clusters have an "approximately equal" number of points of each color. Chierichetti et al. proposed a two-phase algorithm for fair -clustering. In the first step, the pointset is partitioned into subsets called fairlets that satisfy the fairness requirement and approximately preserve the -median objective. In the second step, fairlets are merged into clusters by one of the existing -median…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFacility Location and Emergency Management · Point processes and geometric inequalities
