Do the Machine Learning Models on a Crowd Sourced Platform Exhibit Bias?   An Empirical Study on Model Fairness

Sumon Biswas; Hridesh Rajan

arXiv:2005.12379·cs.LG·September 23, 2020

Do the Machine Learning Models on a Crowd Sourced Platform Exhibit Bias? An Empirical Study on Model Fairness

Sumon Biswas, Hridesh Rajan

PDF

2 Repos

TL;DR

This empirical study evaluates the fairness of 40 top machine learning models from Kaggle across five tasks, analyzing bias, mitigation techniques, and their impact on performance, revealing practical challenges and future research directions.

Contribution

The paper provides a comprehensive benchmark of real-world models' fairness, evaluates mitigation techniques, and highlights practical issues in applying fairness algorithms.

Findings

01

Some optimization techniques induce unfairness.

02

Fairness control mechanisms are often undocumented.

03

Post-processing mitigation is costly, pre-processing is preferred.

Abstract

Machine learning models are increasingly being used in important decision-making software such as approving bank loans, recommending criminal sentencing, hiring employees, and so on. It is important to ensure the fairness of these models so that no discrimination is made based on protected attribute (e.g., race, sex, age) while decision making. Algorithms have been developed to measure unfairness and mitigate them to a certain extent. In this paper, we have focused on the empirical evaluation of fairness and mitigations on real-world machine learning models. We have created a benchmark of 40 top-rated models from Kaggle used for 5 different tasks, and then using a comprehensive set of fairness metrics, evaluated their fairness. Then, we have applied 7 mitigation techniques on these models and analyzed the fairness, mitigation results, and impacts on performance. We have found that some…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.