Feature-based Individual Fairness in k-Clustering
Debajyoti Kar, Mert Kosan, Debmalya Mandal, Sourav Medya, Arlei Silva,, Palash Dey, Swagato Sanyal

TL;DR
This paper introduces a new notion of individual fairness in k-clustering based on features, presents a randomized approximation algorithm, and demonstrates its effectiveness through experiments showing improved fairness and cost efficiency.
Contribution
The paper defines a novel individual fairness concept in k-clustering, proves NP-hardness, and provides a randomized algorithm with proven approximation guarantees.
Findings
Algorithm achieves 12.5% higher fairness than baselines on average.
Algorithm reduces clustering cost by 34.5% compared to the best baseline.
Experimental validation on six datasets confirms effectiveness.
Abstract
Ensuring fairness in machine learning algorithms is a challenging and essential task. We consider the problem of clustering a set of points while satisfying fairness constraints. While there have been several attempts to capture group fairness in the -clustering problem, fairness at an individual level is relatively less explored. We introduce a new notion of individual fairness in -clustering based on features not necessarily used for clustering. We show that this problem is NP-hard and does not admit a constant factor approximation. Therefore, we design a randomized algorithm that guarantees approximation both in terms of minimizing the clustering distance objective and individual fairness under natural restrictions on the distance metric and fairness constraints. Finally, our experimental results against six competing baselines validate that our algorithm produces individually…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMobile Crowdsensing and Crowdsourcing · Privacy-Preserving Technologies in Data · Visual Attention and Saliency Detection
