FACET: Fairness in Computer Vision Evaluation Benchmark
Laura Gustafson, Chloe Rolland, Nikhila Ravi, Quentin Duval, Aaron, Adcock, Cheng-Yang Fu, Melissa Hall, Candace Ross

TL;DR
FACET is a comprehensive benchmark dataset designed to evaluate fairness in computer vision models across demographic attributes, revealing existing disparities and promoting development of more equitable vision systems.
Contribution
Introduces FACET, a large annotated dataset for assessing fairness in vision tasks, enabling detailed analysis of demographic disparities in model performance.
Findings
Models show performance disparities across demographic groups.
Intersectional attributes reveal compounded biases.
Benchmark facilitates development of fairer vision models.
Abstract
Computer vision models have known performance disparities across attributes such as gender and skin tone. This means during tasks such as classification and detection, model performance differs for certain classes based on the demographics of the people in the image. These disparities have been shown to exist, but until now there has not been a unified approach to measure these differences for common use-cases of computer vision models. We present a new benchmark named FACET (FAirness in Computer Vision EvaluaTion), a large, publicly available evaluation set of 32k images for some of the most common vision tasks - image classification, object detection and segmentation. For every image in FACET, we hired expert reviewers to manually annotate person-related attributes such as perceived skin tone and hair type, manually draw bounding boxes and label fine-grained person-related classes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification
