SVDD 2024: The Inaugural Singing Voice Deepfake Detection Challenge

You Zhang; Yongyi Zang; Jiatong Shi; Ryuichi Yamamoto; Tomoki Toda; Zhiyao Duan

arXiv:2408.16132·eess.AS·January 27, 2026

SVDD 2024: The Inaugural Singing Voice Deepfake Detection Challenge

You Zhang, Yongyi Zang, Jiatong Shi, Ryuichi Yamamoto, Tomoki Toda, Zhiyao Duan

PDF

Open Access 1 Repo

TL;DR

The paper introduces the inaugural Singing Voice Deepfake Detection Challenge, aiming to improve AI-generated singing voice detection through controlled and real-world scenarios, with significant participation and promising results.

Contribution

It establishes a new benchmark for singing voice deepfake detection with two distinct tracks and provides baseline results and insights for future research.

Findings

01

Top team achieved 1.65% EER in controlled track

02

47 teams participated in the challenge

03

Baseline models were benchmarked on wild data

Abstract

With the advancements in singing voice generation and the growing presence of AI singers on media platforms, the inaugural Singing Voice Deepfake Detection (SVDD) Challenge aims to advance research in identifying AI-generated singing voices from authentic singers. This challenge features two tracks: a controlled setting track (CtrSVDD) and an in-the-wild scenario track (WildSVDD). The CtrSVDD track utilizes publicly available singing vocal data to generate deepfakes using state-of-the-art singing voice synthesis and conversion systems. Meanwhile, the WildSVDD track expands upon the existing SingFake dataset, which includes data sourced from popular user-generated content websites. For the CtrSVDD track, we received submissions from 47 teams, with 37 surpassing our baselines and the top team achieving a 1.65% equal error rate. For the WildSVDD track, we benchmarked the baselines. This…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

svddchallenge/ctrsvdd2024_baseline
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Speech and Audio Processing · Speech Recognition and Synthesis