Certifying the Fairness of KNN in the Presence of Dataset Bias
Yannan Li, Jingbo Wang, and Chao Wang

TL;DR
This paper introduces a novel certification method for assessing the fairness of KNN classifiers, even when training data contains systematic bias, by using abstract interpretation to efficiently approximate fairness metrics.
Contribution
It is the first to certify KNN fairness under multiple definitions using abstract interpretation, reducing computational costs while handling biased datasets.
Findings
Effective certification on six benchmark datasets
Accurate fairness assessments despite dataset bias
Applicable to multiple fairness definitions
Abstract
We propose a method for certifying the fairness of the classification result of a widely used supervised learning algorithm, the k-nearest neighbors (KNN), under the assumption that the training data may have historical bias caused by systematic mislabeling of samples from a protected minority group. To the best of our knowledge, this is the first certification method for KNN based on three variants of the fairness definition: individual fairness, -fairness, and label-flipping fairness. We first define the fairness certification problem for KNN and then propose sound approximations of the complex arithmetic computations used in the state-of-the-art KNN algorithm. This is meant to lift the computation results from the concrete domain to an abstract domain, to reduce the computational cost. We show effectiveness of this abstract interpretation based technique through…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI
