Open-Set Source Tracing of Audio Deepfake Systems

Nicholas Klein; Hemlata Tak; Elie Khoury

arXiv:2507.06470·eess.AS·July 10, 2025

Open-Set Source Tracing of Audio Deepfake Systems

Nicholas Klein, Hemlata Tak, Elie Khoury

PDF

Open Access

TL;DR

This paper addresses the challenge of open-set source tracing for audio deepfake systems, proposing a novel energy score adaptation and training methods that significantly improve detection performance against unseen deepfake sources.

Contribution

It introduces SME, a new energy score for out-of-distribution detection, and demonstrates its effectiveness in open-set audio deepfake source tracing.

Findings

01

SME improves FPR95 by 31% over traditional methods

02

SME-guided training reduces FPR95 to 8.3%

03

Augmentation techniques enhance open-set detection robustness

Abstract

Existing research on source tracing of audio deepfake systems has focused primarily on the closed-set scenario, while studies that evaluate open-set performance are limited to a small number of unseen systems. Due to the large number of emerging audio deepfake systems, robust open-set source tracing is critical. We leverage the protocol of the Interspeech 2025 special session on source tracing to evaluate methods for improving open-set source tracing performance. We introduce a novel adaptation to the energy score for out-of-distribution (OOD) detection, softmax energy (SME). We find that replacing the typical temperature-scaled energy score with SME provides a relative average improvement of 31% in the standard FPR95 (false positive rate at true positive rate of 95%) measure. We further explore SME-guided training as well as copy synthesis, codec, and reverberation augmentations,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Digital Media Forensic Detection · Speech and Audio Processing