Towards Trustworthy Audio Deepfake Detection: A Systematic Framework for Diagnosing and Mitigating Gender Bias

Aishwarya Fursule; Shruti Kshirsagar; and Anderson R. Avila

arXiv:2605.09087·cs.SD·May 12, 2026

Towards Trustworthy Audio Deepfake Detection: A Systematic Framework for Diagnosing and Mitigating Gender Bias

Aishwarya Fursule, Shruti Kshirsagar, and Anderson R. Avila

PDF

TL;DR

This paper introduces a systematic framework for diagnosing and mitigating gender bias in audio deepfake detection systems, emphasizing the importance of identifying bias sources before applying targeted fixes.

Contribution

It presents the first diagnosis-first approach that identifies bias origins and evaluates novel mitigation strategies, including a new fairness regularisation method, on two models and a benchmark dataset.

Findings

01

Bias does not originate from training data imbalance but from acoustic and feature leakage.

02

Adjusting decision thresholds per gender significantly reduces unfairness without harming accuracy.

03

A new epoch-level fairness regularisation outperforms existing batch-level methods.

Abstract

Audio deepfake detection systems are increasingly deployed in high-stakes security applications, yet their fairness across demographic groups remains critically underexamined. Prior work measures gender disparity but does not investigate where it comes from or how to fix it systematically. We present the first diagnosis-first framework that identifies bias source before applying targeted mitigation, evaluated on two models, AASIST and Wav2Vec2+ResNet18, on ASVSpoof5. Our diagnosis shows that bias does not stem from imbalanced training data but from acoustic representation differences, gender leakage in learned features, and structural evaluation asymmetry. We test mitigation strategies across in-processing, post-processing and combined families, including novel methods introduced in this work. Adjusting the decision threshold separately per gender reduces unfairness by 54% to 75% at no…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.