Fair and Interpretable Deepfake Detection in Videos

Akihito Yoshii; Ryosuke Sonoda; Ramya Srinivasan

arXiv:2510.17264·cs.CV·October 21, 2025

Fair and Interpretable Deepfake Detection in Videos

Akihito Yoshii, Ryosuke Sonoda, Ramya Srinivasan

PDF

Open Access

TL;DR

This paper introduces a fairness-aware deepfake detection framework that combines temporal feature learning, demographic-aware data augmentation, and interpretability techniques to improve bias mitigation and reliability across diverse groups.

Contribution

It presents a novel framework integrating temporal modeling, demographic-aware augmentation, and concept extraction for fairer, more interpretable deepfake detection.

Findings

01

Achieves the best tradeoff between fairness and accuracy on multiple datasets.

02

Effectively mitigates bias across demographic groups.

03

Enhances interpretability for non-expert users.

Abstract

Existing deepfake detection methods often exhibit bias, lack transparency, and fail to capture temporal information, leading to biased decisions and unreliable results across different demographic groups. In this paper, we propose a fairness-aware deepfake detection framework that integrates temporal feature learning and demographic-aware data augmentation to enhance fairness and interpretability. Our method leverages sequence-based clustering for temporal modeling of deepfake videos and concept extraction to improve detection reliability while also facilitating interpretable decisions for non-expert users. Additionally, we introduce a demography-aware data augmentation method that balances underrepresented groups and applies frequency-domain transformations to preserve deepfake artifacts, thereby mitigating bias and improving generalization. Extensive experiments on FaceForensics++,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Face recognition and analysis · Domain Adaptation and Few-Shot Learning