A Fourier Explanation of AI-music Artifacts
Darius Afchar, Gabriel Meseguer-Brocal, Kamil Akesbi, Romain Hennequin

TL;DR
This paper reveals that generative AI music models produce frequency artifacts due to their architecture, and proposes a simple detection method that achieves over 99% accuracy in identifying AI-generated music.
Contribution
It provides a mathematical analysis of frequency artifacts in AI music models and introduces an effective, interpretable detection criterion based on spectral peaks.
Findings
Frequency artifacts are inherent to deconvolution modules in generative models.
The proposed detection method surpasses 99% accuracy in various scenarios.
Artifacts are linked to model architecture, not training data or weights.
Abstract
The rapid rise of generative AI has transformed music creation, with millions of users engaging in AI-generated music. Despite its popularity, concerns regarding copyright infringement, job displacement, and ethical implications have led to growing scrutiny and legal challenges. In parallel, AI-detection services have emerged, yet these systems remain largely opaque and privately controlled, mirroring the very issues they aim to address. This paper explores the fundamental properties of synthetic content and how it can be detected. Specifically, we analyze deconvolution modules commonly used in generative models and mathematically prove that their outputs exhibit systematic frequency artifacts -- manifesting as small yet distinctive spectral peaks. This phenomenon, related to the well-known checkerboard artifact, is shown to be inherent to a chosen model architecture rather than a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic Technology and Sound Studies
