One-Class Learning with Adaptive Centroid Shift for Audio Deepfake   Detection

Hyun Myung Kim; Kangwook Jang; Hoirin Kim

arXiv:2406.16716·eess.AS·June 25, 2024

One-Class Learning with Adaptive Centroid Shift for Audio Deepfake Detection

Hyun Myung Kim, Kangwook Jang, Hoirin Kim

PDF

Open Access 1 Repo

TL;DR

This paper introduces an adaptive centroid shift method for one-class learning that improves deepfake audio detection by creating robust, well-separated embeddings of bonafide speech, outperforming existing systems.

Contribution

The paper proposes a novel adaptive centroid shift technique that enhances one-class learning for audio deepfake detection, focusing solely on bonafide samples for improved robustness.

Findings

01

Achieves 2.19% EER on ASVspoof 2021 dataset

02

Effectively maps bonafide embeddings into a single cluster

03

Successfully disentangles bonafide and spoof classes

Abstract

As speech synthesis systems continue to make remarkable advances in recent years, the importance of robust deepfake detection systems that perform well in unseen systems has grown. In this paper, we propose a novel adaptive centroid shift (ACS) method that updates the centroid representation by continually shifting as the weighted average of bonafide representations. Our approach uses only bonafide samples to define their centroid, which can yield a specialized centroid for one-class learning. Integrating our ACS with one-class learning gathers bonafide representations into a single cluster, forming well-separated embeddings robust to unseen spoofing attacks. Our proposed method achieves an equal error rate (EER) of 2.19% on the ASVspoof 2021 deepfake dataset, outperforming all existing systems. Furthermore, the t-SNE visualization illustrates that our method effectively maps the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

avishai111/One-Class-Learning-with-Adaptive-Centroid-Shift-for-Audio-Deepfake-Detection
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Digital Media Forensic Detection · Speech and Audio Processing