Simultaneous Improvement of ML Model Fairness and Performance by   Identifying Bias in Data

Bhushan Chaudhari; Akash Agarwal; Tanmoy Bhowmik

arXiv:2210.13182·cs.LG·October 25, 2022·1 cites

Simultaneous Improvement of ML Model Fairness and Performance by Identifying Bias in Data

Bhushan Chaudhari, Akash Agarwal, Tanmoy Bhowmik

PDF

Open Access

TL;DR

This paper introduces a data preprocessing method that identifies and removes biased instances related to protected attributes, improving fairness without sacrificing model accuracy in machine learning tasks.

Contribution

The authors propose a novel bias detection and removal technique that enhances fairness and maintains accuracy, addressing a key challenge in ML model development.

Findings

01

Bias can be mitigated by removing specific instances without accuracy loss

02

The method improves fairness metrics on open-source datasets

03

End users gain control over bias mitigation process

Abstract

Machine learning models built on datasets containing discriminative instances attributed to various underlying factors result in biased and unfair outcomes. It's a well founded and intuitive fact that existing bias mitigation strategies often sacrifice accuracy in order to ensure fairness. But when AI engine's prediction is used for decision making which reflects on revenue or operational efficiency such as credit risk modelling, it would be desirable by the business if accuracy can be somehow reasonably preserved. This conflicting requirement of maintaining accuracy and fairness in AI motivates our research. In this paper, we propose a fresh approach for simultaneous improvement of fairness and accuracy of ML models within a realistic paradigm. The essence of our work is a data preprocessing technique that can detect instances ascribing a specific kind of bias that should be removed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Explainable Artificial Intelligence (XAI) · Imbalanced Data Classification Techniques