Unbiased Subdata Selection for Fair Classification: A Unified Framework and Scalable Algorithms
Qing Ye, Weijun Xie

TL;DR
This paper introduces a unified, scalable framework for fair classification that jointly optimizes accuracy and fairness, capable of modeling various fairness measures and applicable to deep models, with proven theoretical properties and practical algorithms.
Contribution
It develops a novel unified framework for fair classification, proves Fisher consistency, and introduces scalable algorithms including an iterative refining strategy for large-scale data.
Findings
Framework can incorporate multiple fairness measures.
Mixed-integer convex reformulations enable benchmarking.
Iterative refining strategy improves fairness and accuracy.
Abstract
As an important problem in modern data analytics, classification has witnessed varieties of applications from different domains. Different from conventional classification approaches, fair classification concerns the issues of unintentional biases against the sensitive features (e.g., gender, race). Due to high nonconvexity of fairness measures, existing methods are often unable to model exact fairness, which can cause inferior fair classification outcomes. This paper fills the gap by developing a novel unified framework to jointly optimize accuracy and fairness. The proposed framework is versatile and can incorporate different fairness measures studied in literature precisely as well as can be applicable to many classifiers including deep classification models. Specifically, in this paper, we first prove Fisher consistency of the proposed framework. We then show that many…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Privacy-Preserving Technologies in Data · Machine Learning and Data Classification
