Advancing Continual Learning for Robust Deepfake Audio Classification
Feiyi Dong, Qingchen Tang, Yichen Bai, Zihan Wang

TL;DR
This paper presents CADE, a continual learning approach for deepfake audio detection that uses memory, distillation, and embedding similarity losses to improve robustness against unseen spoofing attacks.
Contribution
Introducing CADE, a novel continual learning method with memory and multiple loss functions to enhance deepfake audio classification without extensive retraining.
Findings
CADE outperforms baseline methods on ASVspoof2019 dataset.
Memory-efficient approach preserves old knowledge effectively.
Embedding similarity loss improves positive sample alignment.
Abstract
The emergence of new spoofing attacks poses an increasing challenge to audio security. Current detection methods often falter when faced with unseen spoofing attacks. Traditional strategies, such as retraining with new data, are not always feasible due to extensive storage. This paper introduces a novel continual learning method Continual Audio Defense Enhancer (CADE). First, by utilizing a fixed memory size to store randomly selected samples from previous datasets, our approach conserves resources and adheres to privacy constraints. Additionally, we also apply two distillation losses in CADE. By distillation in classifiers, CADE ensures that the student model closely resembles that of the teacher model. This resemblance helps the model retain old information while facing unseen data. We further refine our model's performance with a novel embedding similarity loss that extends across…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWater Systems and Optimization · Music and Audio Processing · Speech and Audio Processing
