Advancing Continual Learning for Robust Deepfake Audio Classification

Feiyi Dong; Qingchen Tang; Yichen Bai; Zihan Wang

arXiv:2407.10108·eess.AS·July 16, 2024

Advancing Continual Learning for Robust Deepfake Audio Classification

Feiyi Dong, Qingchen Tang, Yichen Bai, Zihan Wang

PDF

Open Access

TL;DR

This paper presents CADE, a continual learning approach for deepfake audio detection that uses memory, distillation, and embedding similarity losses to improve robustness against unseen spoofing attacks.

Contribution

Introducing CADE, a novel continual learning method with memory and multiple loss functions to enhance deepfake audio classification without extensive retraining.

Findings

01

CADE outperforms baseline methods on ASVspoof2019 dataset.

02

Memory-efficient approach preserves old knowledge effectively.

03

Embedding similarity loss improves positive sample alignment.

Abstract

The emergence of new spoofing attacks poses an increasing challenge to audio security. Current detection methods often falter when faced with unseen spoofing attacks. Traditional strategies, such as retraining with new data, are not always feasible due to extensive storage. This paper introduces a novel continual learning method Continual Audio Defense Enhancer (CADE). First, by utilizing a fixed memory size to store randomly selected samples from previous datasets, our approach conserves resources and adheres to privacy constraints. Additionally, we also apply two distillation losses in CADE. By distillation in classifiers, CADE ensures that the student model closely resembles that of the teacher model. This resemblance helps the model retain old information while facing unseen data. We further refine our model's performance with a novel embedding similarity loss that extends across…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsWater Systems and Optimization · Music and Audio Processing · Speech and Audio Processing