Fair and Interpretable Deepfake Detection in Videos
Akihito Yoshii, Ryosuke Sonoda, Ramya Srinivasan

TL;DR
This paper introduces a fairness-aware deepfake detection framework that combines temporal feature learning, demographic-aware data augmentation, and interpretability techniques to improve bias mitigation and reliability across diverse groups.
Contribution
It presents a novel framework integrating temporal modeling, demographic-aware augmentation, and concept extraction for fairer, more interpretable deepfake detection.
Findings
Achieves the best tradeoff between fairness and accuracy on multiple datasets.
Effectively mitigates bias across demographic groups.
Enhances interpretability for non-expert users.
Abstract
Existing deepfake detection methods often exhibit bias, lack transparency, and fail to capture temporal information, leading to biased decisions and unreliable results across different demographic groups. In this paper, we propose a fairness-aware deepfake detection framework that integrates temporal feature learning and demographic-aware data augmentation to enhance fairness and interpretability. Our method leverages sequence-based clustering for temporal modeling of deepfake videos and concept extraction to improve detection reliability while also facilitating interpretable decisions for non-expert users. Additionally, we introduce a demography-aware data augmentation method that balances underrepresented groups and applies frequency-domain transformations to preserve deepfake artifacts, thereby mitigating bias and improving generalization. Extensive experiments on FaceForensics++,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Face recognition and analysis · Domain Adaptation and Few-Shot Learning
