What Should Be Balanced in a "Balanced" Face Recognition Dataset?

Haiyu Wu; Kevin W. Bowyer

arXiv:2304.09818·cs.CV·August 25, 2023·1 cites

What Should Be Balanced in a "Balanced" Face Recognition Dataset?

Haiyu Wu, Kevin W. Bowyer

PDF

Open Access 1 Repo

TL;DR

This paper examines the limitations of current 'balanced' face recognition datasets and introduces a bias-aware toolkit to create more comprehensive evaluation datasets considering multiple factors affecting accuracy.

Contribution

It highlights the inadequacy of balancing only identities and images and proposes a toolkit for creating more balanced datasets across various influential factors.

Findings

01

Balancing identities and images alone does not ensure fairness.

02

Current datasets often overlook factors like pose, brightness, and quality.

03

A new toolkit helps create more balanced and less biased evaluation datasets.

Abstract

The issue of demographic disparities in face recognition accuracy has attracted increasing attention in recent years. Various face image datasets have been proposed as 'fair' or 'balanced' to assess the accuracy of face recognition algorithms across demographics. These datasets typically balance the number of identities and images across demographics. It is important to note that the number of identities and images in an evaluation dataset are {\em not} driving factors for 1-to-1 face matching accuracy. Moreover, balancing the number of identities and images does not ensure balance in other factors known to impact accuracy, such as head pose, brightness, and image quality. We demonstrate these issues using several recently proposed datasets. To improve the ability to perform less biased evaluations, we propose a bias-aware toolkit that facilitates creation of cross-demographic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

haiyuwu/ba-test-dataset
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Demographic Trends and Gender Preferences