Check, Please: Verifiably Fair Clustering
Yu He, Jeremy Vollen, Edith Elkind

TL;DR
This paper investigates the computational complexity of verifying proportional fairness in clustering, introduces new concepts for practical auditing, and provides algorithms for efficient verification.
Contribution
It proves coNP-hardness of verifying mPJR, introduces DC-mPJR+ for scalable auditing, and connects these concepts to practical clustering fairness guarantees.
Findings
Verifying mPJR is coNP-hard.
Introduces DC-mPJR+ with an $O(mn ext{ log } n + mnk)$ verification algorithm.
DC-mPJR+ serves as a practical proxy for global fairness in clustering.
Abstract
Popular centroid-based clustering methods are typically optimized for global objectives and may fail to adequately represent large groups of datapoints. To address this concern, recent work puts forward clustering analogs of social choice proportionality concepts, such as Proportionally Representative Fairness (also known as mPJR). For proportionality guarantees to be useful in practice, they must be (a) achievable and (b) efficiently auditable, so that one can check whether standard approaches, such as -means, which are not guaranteed to provide proportional representation in general, nevertheless output proportional solutions on specific inputs. In this work, we study the computational complexity of verifying proportional representation in clustering. We first show that verifying mPJR is coNP-hard. Inspired by PJR+ -- a strengthening of PJR that is polynomial-time verifiable in the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
