Identifying Bias in AI using Simulation

Daniel McDuff; Roger Cheng; Ashish Kapoor

arXiv:1810.00471·cs.LG·October 2, 2018·21 cites

Identifying Bias in AI using Simulation

Daniel McDuff, Roger Cheng, Ashish Kapoor

PDF

Open Access

TL;DR

This paper introduces a simulation-based framework using Bayesian search to identify and diagnose demographic biases in machine learning classifiers, demonstrated on face detection APIs.

Contribution

It presents a novel approach leveraging high-fidelity simulations and Bayesian search to efficiently detect biases in ML models, improving bias diagnosis methods.

Findings

01

Effective identification of demographic biases in face detection APIs

02

Framework reduces time to diagnose biases compared to traditional methods

03

Demonstrates the utility of simulation in bias detection for ML models

Abstract

Machine learned models exhibit bias, often because the datasets used to train them are biased. This presents a serious problem for the deployment of such technology, as the resulting models might perform poorly on populations that are minorities within the training set and ultimately present higher risks to them. We propose to use high-fidelity computer simulations to interrogate and diagnose biases within ML classifiers. We present a framework that leverages Bayesian parameter search to efficiently characterize the high dimensional feature space and more quickly identify weakness in performance. We apply our approach to an example domain, face detection, and show that it can be used to help identify demographic biases in commercial face application programming interfaces (APIs).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Anomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning