MMT-BERT: Chord-aware Symbolic Music Generation Based on Multitrack Music Transformer and MusicBERT
Jinlong Zhu, Keigo Sakurai, Ren Togo, Takahiro Ogawa, Miki Haseyama

TL;DR
This paper introduces MMT-BERT, a novel framework for symbolic multitrack music generation that leverages a new music representation, a chord analysis model, and a fine-tuned MusicBERT discriminator to improve musical coherence and quality.
Contribution
It presents a new symbolic music representation with MusicLang, adapts MusicBERT for generation, and fine-tunes it as a discriminator, addressing data and architecture challenges in symbolic music synthesis.
Findings
Outperforms state-of-the-art methods in music quality.
Enhances musical consonance and human-likeness.
Demonstrates robustness across diverse musical styles.
Abstract
We propose a novel symbolic music representation and Generative Adversarial Network (GAN) framework specially designed for symbolic multitrack music generation. The main theme of symbolic music generation primarily encompasses the preprocessing of music data and the implementation of a deep learning framework. Current techniques dedicated to symbolic music generation generally encounter two significant challenges: training data's lack of information about chords and scales and the requirement of specially designed model architecture adapted to the unique format of symbolic music representation. In this paper, we solve the above problems by introducing new symbolic music representation with MusicLang chord analysis model. We propose our MMT-BERT architecture adapting to the representation. To build a robust multitrack music generator, we fine-tune a pre-trained MusicBERT model to serve…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Human Motion and Animation
