WaveFake: A Data Set to Facilitate Audio Deepfake Detection

Joel Frank; Lea Sch\"onherr

arXiv:2111.02813·cs.LG·November 5, 2021·45 cites

WaveFake: A Data Set to Facilitate Audio Deepfake Detection

Joel Frank, Lea Sch\"onherr

PDF

Open Access 2 Repos 1 Models

TL;DR

This paper introduces WaveFake, a new audio dataset from multiple generative models and languages, along with baseline detection models, to advance research in audio deepfake detection.

Contribution

It provides a comprehensive audio dataset, signal processing techniques, and baseline models to address the gap in audio deepfake detection research.

Findings

01

Dataset includes nine sample sets from five architectures in two languages

02

Baseline models demonstrate initial detection capabilities

03

Provides foundational tools for future audio deepfake research

Abstract

Deep generative modeling has the potential to cause significant harm to society. Recognizing this threat, a magnitude of research into detecting so-called "Deepfakes" has emerged. This research most often focuses on the image domain, while studies exploring generated audio signals have, so-far, been neglected. In this paper we make three key contributions to narrow this gap. First, we provide researchers with an introduction to common signal processing techniques used for analyzing audio signals. Second, we present a novel data set, for which we collected nine sample sets from five different network architectures, spanning two languages. Finally, we supply practitioners with two baseline models, adopted from the signal processing community, to facilitate further research in this area.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

🤗
ash56/ssl-aasist
model· 31 dl· ♡ 3
31 dl♡ 3

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Music Technology and Sound Studies · Digital Media Forensic Detection