TunesFormer: Forming Irish Tunes with Control Codes by Bar Patching
Shangda Wu, Xiaobing Li, Feng Yu, Maosong Sun

TL;DR
TunesFormer is a Transformer-based model that efficiently generates Irish melodies conforming to user-defined musical forms using bar patching and control codes, offering faster performance and controllability.
Contribution
Introduces a novel dual-decoder Transformer model with bar patching and control codes for form-specific melody generation, trained on a large Irish tune dataset.
Findings
TunesFormer is 3.22 times faster than GPT-2.
It maintains comparable controllability and quality.
The model effectively generates melodies adhering to specified musical forms.
Abstract
This paper introduces TunesFormer, an efficient Transformer-based dual-decoder model specifically designed for the generation of melodies that adhere to user-defined musical forms. Trained on 214,122 Irish tunes, TunesFormer utilizes techniques including bar patching and control codes. Bar patching reduces sequence length and generation time, while control codes guide TunesFormer in producing melodies that conform to desired musical forms. Our evaluation demonstrates TunesFormer's superior efficiency, being 3.22 times faster than GPT-2 and 1.79 times faster than a model with linear complexity of equal scale while offering comparable performance in controllability and other metrics. TunesFormer provides a novel tool for musicians, composers, and music enthusiasts alike to explore the vast landscape of Irish music. Our model and code are available at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Diverse Musicological Studies
