Unbiasing on the Fly: Explanation-Guided Human Oversight of Machine Learning System Decisions
Hussaini Mamman, Shuib Basri, Abdullateef Balogun, Abubakar Abdullahi, Imam, Ganesh Kumar, Luiz Fernando Capretz

TL;DR
This paper introduces a real-time, explanation-guided human oversight framework for detecting and correcting discriminatory decisions in deployed machine learning systems, enhancing fairness and trust.
Contribution
It proposes a novel on-the-fly monitoring framework using counterfactual explanations and human-in-the-loop intervention to mitigate discrimination during ML system deployment.
Findings
Framework effectively flags discriminatory outcomes in real-time
Human reviewers can override ML decisions to ensure fairness
Promising approach for responsible ML deployment
Abstract
The widespread adoption of ML systems across critical domains like hiring, finance, and healthcare raises growing concerns about their potential for discriminatory decision-making based on protected attributes. While efforts to ensure fairness during development are crucial, they leave deployed ML systems vulnerable to potentially exhibiting discrimination during their operations. To address this gap, we propose a novel framework for on-the-fly tracking and correction of discrimination in deployed ML systems. Leveraging counterfactual explanations, the framework continuously monitors the predictions made by an ML system and flags discriminatory outcomes. When flagged, post-hoc explanations related to the original prediction and the counterfactual alternatives are presented to a human reviewer for real-time intervention. This human-in-the-loop approach empowers reviewers to accept or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI)
