# M6: multi-generator, multi-domain, multi-lingual and cultural, multi-genres, multi-instrument machine-generated music detection databases

**Authors:** Yupei Li, Hanqian Li, Lucia Specia, Bjorn Schuller

PMC · DOI: 10.1038/s41598-026-36044-w · 2026-02-13

## TL;DR

This paper introduces M6, a diverse dataset for detecting machine-generated music across various genres, languages, and instruments.

## Contribution

The novel contribution is the creation of a comprehensive and multi-faceted benchmark dataset for machine-generated music detection.

## Key findings

- M6 includes multiple generators, domains, languages, cultural contexts, genres, and instruments.
- Baseline performance scores show the complexity of detecting machine-generated music.
- The dataset is publicly available to support future research and collaboration.

## Abstract

Machine-generated music (MGM) has become a powerful tool with applications in music therapy, personalised editing, and creative inspiration. However, its unregulated use threatens the entertainment, education, and arts sectors by diminishing the value of high-quality human compositions. Effective detection of machine-generated music (MGMD) is essential, yet progress is hindered by the lack of comprehensive datasets. To address this gap, we introduce M6, a large-scope benchmark dataset designed for MGMD research. M6 is distinguished by its diversity, encompassing multiple generators, domains, languages, cultural contexts, genres, and instruments, all provided in WAV format. We detail the data collection methodology and analysis, alongside baseline performance scores from foundational binary classification models, highlighting the complexity of MGMD and the need for further advancements. M6 serves as a robust resource to support future research in developing effective detection methods. The dataset is available at https://huggingface.co/datasets/yl7622/M6 to promote collaboration and innovation.

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13000184/full.md

---
Source: https://tomesphere.com/paper/PMC13000184