From Audio Deepfake Detection to AI-Generated Music Detection -- A   Pathway and Overview

Yupei Li; Manuel Milling; Lucia Specia; Bj\"orn W. Schuller

arXiv:2412.00571·cs.SD·December 11, 2024

From Audio Deepfake Detection to AI-Generated Music Detection -- A Pathway and Overview

Yupei Li, Manuel Milling, Lucia Specia, Bj\"orn W. Schuller

PDF

Open Access

TL;DR

This paper reviews AI-generated music detection methods, explores how audio deepfake detection techniques can be adapted for music, and discusses future research directions to address challenges in the field.

Contribution

It provides a comprehensive overview of AIGM detection, proposes leveraging foundation models from deepfake audio detection, and discusses future research pathways.

Findings

01

Overview of existing AIGM detection methods

02

Proposal to adapt deepfake audio detection techniques for music

03

Discussion of future challenges and research directions

Abstract

As Artificial Intelligence (AI) technologies continue to evolve, their use in generating realistic, contextually appropriate content has expanded into various domains. Music, an art form and medium for entertainment, deeply rooted into human culture, is seeing an increased involvement of AI into its production. However, despite the effective application of AI music generation (AIGM) tools, the unregulated use of them raises concerns about potential negative impacts on the music industry, copyright and artistic integrity, underscoring the importance of effective AIGM detection. This paper provides an overview of existing AIGM detection methods. To lay a foundation to the general workings and challenges of AIGM detection, we first review general principles of AIGM, including recent advancements in deepfake audios, as well as multimodal detection techniques. We further propose a potential…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing