Content-based Controls For Music Large Language Modeling
Liwei Lin, Gus Xia, Junyan Jiang, and Yixiao Zhang

TL;DR
This paper introduces Coco-Mulla, a content-based control method for music large language models that enables direct manipulation of musical features like pitch and rhythm, using a parameter-efficient fine-tuning approach.
Contribution
It presents a novel, resource-efficient fine-tuning technique for Transformer-based music models that enhances content-based control capabilities with minimal data and parameters.
Findings
High-quality music generation with less than 4% of original parameters.
Effective control over chords and rhythms in generated music.
Flexible music variation and arrangement through combined controls.
Abstract
Recent years have witnessed a rapid growth of large-scale language models in the domain of music audio. Such models enable end-to-end generation of higher-quality music, and some allow conditioned generation using text descriptions. However, the control power of text controls on music is intrinsically limited, as they can only describe music indirectly through meta-data (such as singers and instruments) or high-level representations (such as genre and emotion). We aim to further equip the models with direct and content-based controls on innate music languages such as pitch, chords and drum track. To this end, we contribute Coco-Mulla, a content-based control method for music large language modeling. It uses a parameter-efficient fine-tuning (PEFT) method tailored for Transformer-based audio models. Experiments show that our approach achieved high-quality music generation with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Speech Recognition and Synthesis
