MMT-BERT: Chord-aware Symbolic Music Generation Based on Multitrack   Music Transformer and MusicBERT

Jinlong Zhu; Keigo Sakurai; Ren Togo; Takahiro Ogawa; Miki Haseyama

arXiv:2409.00919·cs.SD·September 4, 2024

MMT-BERT: Chord-aware Symbolic Music Generation Based on Multitrack Music Transformer and MusicBERT

Jinlong Zhu, Keigo Sakurai, Ren Togo, Takahiro Ogawa, Miki Haseyama

PDF

Open Access

TL;DR

This paper introduces MMT-BERT, a novel framework for symbolic multitrack music generation that leverages a new music representation, a chord analysis model, and a fine-tuned MusicBERT discriminator to improve musical coherence and quality.

Contribution

It presents a new symbolic music representation with MusicLang, adapts MusicBERT for generation, and fine-tunes it as a discriminator, addressing data and architecture challenges in symbolic music synthesis.

Findings

01

Outperforms state-of-the-art methods in music quality.

02

Enhances musical consonance and human-likeness.

03

Demonstrates robustness across diverse musical styles.

Abstract

We propose a novel symbolic music representation and Generative Adversarial Network (GAN) framework specially designed for symbolic multitrack music generation. The main theme of symbolic music generation primarily encompasses the preprocessing of music data and the implementation of a deep learning framework. Current techniques dedicated to symbolic music generation generally encounter two significant challenges: training data's lack of information about chords and scales and the requirement of specially designed model architecture adapted to the unique format of symbolic music representation. In this paper, we solve the above problems by introducing new symbolic music representation with MusicLang chord analysis model. We propose our MMT-BERT architecture adapting to the representation. To build a robust multitrack music generator, we fine-tune a pre-trained MusicBERT model to serve…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Music Technology and Sound Studies · Human Motion and Animation