Audio Deepfake Attribution: An Initial Dataset and Investigation
Xinrui Yan, Jiangyan Yi, Jianhua Tao, Jie Chen

TL;DR
This paper introduces the first dataset for audio deepfake attribution and proposes a novel method, CRML, to identify the source of deepfake audio, including unknown tools, in real-world scenarios.
Contribution
It presents a new dataset for audio deepfake attribution and a novel CRML method for open-set identification of audio generation tools.
Findings
CRML effectively addresses open-set risks in real-world scenarios.
The dataset enables comprehensive evaluation of attribution methods.
CRML improves discriminative representation learning for known and unknown classes.
Abstract
The rapid progress of deep speech synthesis models has posed significant threats to society such as malicious manipulation of content. This has led to an increase in studies aimed at detecting so-called deepfake audio. However, existing works focus on the binary detection of real audio and fake audio. In real-world scenarios such as model copyright protection and digital evidence forensics, binary classification alone is insufficient. It is essential to identify the source of deepfake audio. Therefore, audio deepfake attribution has emerged as a new challenge. To this end, we designed the first deepfake audio dataset for the attribution of audio generation tools, called Audio Deepfake Attribution (ADA), and conducted a comprehensive investigation on system fingerprints. To address the challenges of attribution of continuously emerging unknown audio generation tools in the real world, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Digital Media Forensic Detection · Speech Recognition and Synthesis
Methods*Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Residual Connection · Bottleneck Residual Block · Kaiming Initialization · Convolution · Average Pooling · Max Pooling · Residual Block
