Mitigating Algorithmic Bias with Limited Annotations
Guanchu Wang, Mengnan Du, Ninghao Liu, Na Zou, Xia Hu

TL;DR
This paper introduces APOD, an interactive framework that effectively reduces algorithmic bias with limited sensitive attribute annotations by guiding annotation efforts and bounding bias, outperforming baselines on benchmark datasets.
Contribution
The paper proposes APOD, a novel active learning framework that combines discrimination penalization with instance selection to mitigate bias with minimal annotations.
Findings
APOD outperforms baseline methods under limited annotation budgets.
APOD achieves comparable bias mitigation to fully annotated approaches.
Theoretical proof bounds the algorithmic bias effectively.
Abstract
Existing work on fairness modeling commonly assumes that sensitive attributes for all instances are fully available, which may not be true in many real-world applications due to the high cost of acquiring sensitive information. When sensitive attributes are not disclosed or available, it is needed to manually annotate a small part of the training data to mitigate bias. However, the skewed distribution across different sensitive groups preserves the skewness of the original dataset in the annotated subset, which leads to non-optimal bias mitigation. To tackle this challenge, we propose Active Penalization Of Discrimination (APOD), an interactive framework to guide the limited annotations towards maximally eliminating the effect of algorithmic bias. The proposed APOD integrates discrimination penalization with active instance selection to efficiently utilize the limited annotation budget,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Ethics and Social Impacts of AI
