An Examination of Fairness of AI Models for Deepfake Detection
Loc Trinh, Yan Liu

TL;DR
This paper investigates racial and gender biases in deepfake detection models, revealing significant disparities in performance linked to dataset composition and the creation methods of deepfakes, which can lead to unfair discrimination.
Contribution
It provides a comprehensive analysis of bias in deepfake datasets and detection models, highlighting the impact of dataset composition and deepfake generation techniques on model fairness.
Findings
Large performance disparities across racial groups, up to 10.7% error difference.
Dataset biases, such as overrepresentation of Caucasian females, influence model fairness.
Deepfake creation methods contribute to spurious correlations affecting detection accuracy.
Abstract
Recent studies have demonstrated that deep learning models can discriminate based on protected classes like race and gender. In this work, we evaluate bias present in deepfake datasets and detection models across protected subgroups. Using facial datasets balanced by race and gender, we examine three popular deepfake detectors and find large disparities in predictive performances across races, with up to 10.7% difference in error rate between subgroups. A closer look reveals that the widely used FaceForensics++ dataset is overwhelmingly composed of Caucasian subjects, with the majority being female Caucasians. Our investigation of the racial distribution of deepfakes reveals that the methods used to create deepfakes as positive training signals tend to produce "irregular" faces - when a person's face is swapped onto another person of a different race or gender. This causes detectors to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Generative Adversarial Networks and Image Synthesis · Advanced Image Processing Techniques
