MelodyGLM: Multi-task Pre-training for Symbolic Melody Generation
Xinda Wu, Zhijie Huang, Kejun Zhang, Jiaxing Yu, Xu Tan, Tieyao Zhang,, Zihao Wang, Lingyun Sun

TL;DR
MelodyGLM is a multi-task pre-training framework that effectively captures multi-scale, multi-dimensional structures in symbolic melodies, leveraging a large dataset and novel infilling tasks to improve melody generation quality.
Contribution
The paper introduces MelodyGLM, a novel multi-task pre-training approach with new infilling strategies and a large-scale melody dataset, enhancing symbolic melody generation.
Findings
Outperforms previous pre-training methods in melody tasks.
Achieves near-human quality in melody inpainting.
Improves consistency, rhythmicity, and structure in generated melodies.
Abstract
Pre-trained language models have achieved impressive results in various music understanding and generation tasks. However, existing pre-training methods for symbolic melody generation struggle to capture multi-scale, multi-dimensional structural information in note sequences, due to the domain knowledge discrepancy between text and music. Moreover, the lack of available large-scale symbolic melody datasets limits the pre-training improvement. In this paper, we propose MelodyGLM, a multi-task pre-training framework for generating melodies with long-term structure. We design the melodic n-gram and long span sampling strategies to create local and global blank infilling tasks for modeling the local and global structures in melodies. Specifically, we incorporate pitch n-grams, rhythm n-grams, and their combined n-grams into the melodic n-gram blank infilling tasks for modeling the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Neuroscience and Music Perception
MethodsInpainting
