Trusted Fake Audio Detection Based on Dirichlet Distribution
Chi Ding, Junxiao Xue, Cong Wang, Hao Zhou

TL;DR
This paper introduces a fake audio detection method that models decision trustworthiness using the Dirichlet distribution, improving reliability and robustness in identifying fake speech across multiple datasets.
Contribution
The paper proposes a novel fake audio detection approach that incorporates Dirichlet distribution to model uncertainty and trustworthiness of model decisions.
Findings
High accuracy on ASVspoof datasets
Enhanced robustness against various attacks
Improved trustworthiness of detection decisions
Abstract
With the continuous development of deep learning-based speech conversion and speech synthesis technologies, the cybersecurity problem posed by fake audio has become increasingly serious. Previously proposed models for defending against fake audio have attained remarkable performance. However, they all fall short in modeling the trustworthiness of the decisions made by the models themselves. Based on this, we put forward a plausible fake audio detection approach based on the Dirichlet distribution with the aim of enhancing the reliability of fake audio detection. Specifically, we first generate evidence through a neural network. Uncertainty is then modeled using the Dirichlet distribution. By modeling the belief distribution with the parameters of the Dirichlet distribution, an estimate of uncertainty can be obtained for each decision. Finally, the predicted probabilities and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Media Forensic Detection · Advanced Steganography and Watermarking Techniques · Chaos-based Image/Signal Encryption
