MuseBarControl: Enhancing Fine-Grained Control in Symbolic Music Generation through Pre-Training and Counterfactual Loss
Yangyang Shu, Haiming Xu, Ziqin Zhou, Anton van den Hengel, Lingqiao, Liu

TL;DR
This paper introduces MuseBarControl, a method that improves fine-grained bar-level control in symbolic music generation by using pre-training tasks and a counterfactual loss, resulting in more precise and quality-preserving outputs.
Contribution
It proposes a novel pre-training task and counterfactual loss to enable better bar-level control in symbolic music generation, overcoming limitations of traditional fine-tuning methods.
Findings
13.06% improvement in control accuracy
Enhanced control without sacrificing musical quality
Effective alignment of control signals with musical tokens
Abstract
Automatically generating symbolic music-music scores tailored to specific human needs-can be highly beneficial for musicians and enthusiasts. Recent studies have shown promising results using extensive datasets and advanced transformer architectures. However, these state-of-the-art models generally offer only basic control over aspects like tempo and style for the entire composition, lacking the ability to manage finer details, such as control at the level of individual bars. While fine-tuning a pre-trained symbolic music generation model might seem like a straightforward method for achieving this finer control, our research indicates challenges in this approach. The model often fails to respond adequately to new, fine-grained bar-level control signals. To address this, we propose two innovative solutions. First, we introduce a pre-training task designed to link control signals directly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic Technology and Sound Studies · Neuroscience and Music Perception · Music and Audio Processing
