Audio Deepfake Detection: A Survey
Jiangyan Yi, Chenglong Wang, Jianhua Tao, Xiaohui Zhang, Chu Yuan, Zhang, Yan Zhao

TL;DR
This survey comprehensively reviews audio deepfake detection methods, datasets, and challenges, highlighting the need for larger datasets, better generalization, and interpretability in future research.
Contribution
It provides a systematic overview of audio deepfake detection developments, including a unified comparison of features and classifiers across multiple datasets.
Findings
Effective detection algorithms exist but are not fully solved.
Current methods struggle with generalization to unknown attacks.
Lack of large-scale, real-world datasets hampers progress.
Abstract
Audio deepfake detection is an emerging active topic. A growing number of literatures have aimed to study deepfake detection algorithms and achieved effective performance, the problem of which is far from being solved. Although there are some review literatures, there has been no comprehensive survey that provides researchers with a systematic overview of these developments with a unified evaluation. Accordingly, in this survey paper, we first highlight the key differences across various types of deepfake audio, then outline and analyse competitions, datasets, features, classifications, and evaluation of state-of-the-art approaches. For each aspect, the basic techniques, advanced developments and major challenges are discussed. In addition, we perform a unified comparison of representative features and classifiers on ASVspoof 2021, ADD 2023 and In-the-Wild datasets for audio deepfake…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Media Forensic Detection · Music and Audio Processing · Anomaly Detection Techniques and Applications
