Metrics for Dataset Demographic Bias: A Case Study on Facial Expression Recognition
Iris Dominguez-Catena, Daniel Paternain, Mikel Galar

TL;DR
This paper reviews and classifies metrics for measuring demographic bias in datasets, applies them to facial expression recognition datasets, and identifies a minimal set of effective metrics for bias assessment.
Contribution
It introduces a taxonomy for bias metrics, provides a practical guide for their selection, and demonstrates their application through a case study on FER datasets.
Findings
Many metrics are redundant in bias measurement.
A reduced subset of metrics can effectively measure demographic bias.
Insights help improve fairness in AI models.
Abstract
Demographic biases in source datasets have been shown as one of the causes of unfairness and discrimination in the predictions of Machine Learning models. One of the most prominent types of demographic bias are statistical imbalances in the representation of demographic groups in the datasets. In this paper, we study the measurement of these biases by reviewing the existing metrics, including those that can be borrowed from other disciplines. We develop a taxonomy for the classification of these metrics, providing a practical guide for the selection of appropriate metrics. To illustrate the utility of our framework, and to further understand the practical characteristics of the metrics, we conduct a case study of 20 datasets used in Facial Emotion Recognition (FER), analyzing the biases present in them. Our experimental results show that many metrics are redundant and that a reduced…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Domain Adaptation and Few-Shot Learning · Emotion and Mood Recognition
