Gender Slopes: Counterfactual Fairness for Computer Vision Models by Attribute Manipulation
Jungseock Joo, Kimmo K\"arkk\"ainen

TL;DR
This paper introduces a method using attribute manipulation to evaluate and diagnose biases related to gender and race in computer vision models, revealing skewed representations and potential sources of bias.
Contribution
It presents a novel approach employing image synthesis for counterfactual fairness assessment in black-box vision models, linking biases to societal stereotypes.
Findings
Commercial classifiers are influenced by gender and racial cues.
Skewed gender representations are found in online profession searches.
Biases may originate from societal stereotypes reflected in training data.
Abstract
Automated computer vision systems have been applied in many domains including security, law enforcement, and personal devices, but recent reports suggest that these systems may produce biased results, discriminating against people in certain demographic groups. Diagnosing and understanding the underlying true causes of model biases, however, are challenging tasks because modern computer vision systems rely on complex black-box models whose behaviors are hard to decode. We propose to use an encoder-decoder network developed for image attribute manipulation to synthesize facial images varying in the dimensions of gender and race while keeping other signals intact. We use these synthesized images to measure counterfactual fairness of commercial computer vision classifiers by examining the degree to which these classifiers are affected by gender and racial cues controlled in the images,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)
