FairFace: Face Attribute Dataset for Balanced Race, Gender, and Age
Kimmo K\"arkk\"ainen, Jungseock Joo

TL;DR
FairFace introduces a balanced face dataset with diverse race, gender, and age groups to improve fairness and accuracy in face analysis models across different demographic groups.
Contribution
The paper presents a new, balanced face dataset with 108,501 images across seven race groups, addressing bias in existing datasets and enhancing model fairness.
Findings
Models trained on FairFace are more accurate on diverse datasets.
FairFace reduces racial and gender bias in face attribute recognition.
Improved generalization performance across demographic groups.
Abstract
Existing public face datasets are strongly biased toward Caucasian faces, and other races (e.g., Latino) are significantly underrepresented. This can lead to inconsistent model accuracy, limit the applicability of face analytic systems to non-White race groups, and adversely affect research findings based on such skewed data. To mitigate the race bias in these datasets, we construct a novel face image dataset, containing 108,501 images, with an emphasis of balanced race composition in the dataset. We define 7 race groups: White, Black, Indian, East Asian, Southeast Asian, Middle East, and Latino. Images were collected from the YFCC-100M Flickr dataset and labeled with race, gender, and age groups. Evaluations were performed on existing face attribute datasets as well as novel image datasets to measure generalization performance. We find that the model trained from our dataset is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗openai/clip-vit-large-patch14model· 24.8M dl· ♡ 198224.8M dl♡ 1982
- 🤗openai/clip-vit-base-patch32model· 20.3M dl· ♡ 89520.3M dl♡ 895
- 🤗openai/clip-vit-base-patch16model· 1.9M dl· ♡ 1541.9M dl♡ 154
- 🤗google/paligemma-3b-pt-224model· 86k dl· ♡ 42686k dl♡ 426
- 🤗google/paligemma-3b-mix-448model· 2.9k dl· ♡ 1162.9k dl♡ 116
- 🤗SaulLu/clip-vit-base-patch32model· 2 dl2 dl
- 🤗timm/vit_base_patch16_clip_224.openaimodel· 143k dl· ♡ 11143k dl♡ 11
- 🤗timm/vit_base_patch32_clip_224.openaimodel· 2.2k dl2.2k dl
- 🤗timm/vit_large_patch14_clip_224.openaimodel· 16k dl· ♡ 216k dl♡ 2
- 🤗mattmdjaga/clip-vit-base-patch32_handlermodel· 3 dl3 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Biometric Identification and Security · Facial Nerve Paralysis Treatment and Research
