SingFake: Singing Voice Deepfake Detection

Yongyi Zang; You Zhang; Mojtaba Heydari; Zhiyao Duan

arXiv:2309.07525·cs.SD·February 5, 2026·1 cites

SingFake: Singing Voice Deepfake Detection

Yongyi Zang, You Zhang, Mojtaba Heydari, Zhiyao Duan

PDF

Open Access 1 Repo

TL;DR

This paper introduces SingFake, a new dataset for detecting deepfake singing voices, highlights the challenges of this task compared to speech, and evaluates existing systems showing the need for specialized detection methods.

Contribution

We present the first in-the-wild singing voice deepfake dataset, SingFake, and evaluate speech countermeasure systems, revealing their limitations and the need for dedicated singing voice deepfake detection research.

Findings

01

Speech countermeasure systems perform poorly on singing deepfakes.

02

Training on SingFake improves detection performance.

03

Challenges remain with unseen singers, codecs, languages, and musical styles.

Abstract

The rise of singing voice synthesis presents critical challenges to artists and industry stakeholders over unauthorized voice usage. Unlike synthesized speech, synthesized singing voices are typically released in songs containing strong background music that may hide synthesis artifacts. Additionally, singing voices present different acoustic and linguistic characteristics from speech utterances. These unique properties make singing voice deepfake detection a relevant but significantly different problem from synthetic speech detection. In this work, we propose the singing voice deepfake detection task. We first present SingFake, the first curated in-the-wild dataset consisting of 28.93 hours of bonafide and 29.40 hours of deepfake song clips in five languages from 40 singers. We provide a train/validation/test split where the test sets include various scenarios. We then use SingFake to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yongyizang/SingFake
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Speech Recognition and Synthesis · Speech and Audio Processing