Exploring Adapter Design Tradeoffs for Low Resource Music Generation
Atharva Mehta, Shivam Chauhan, Monojit Choudhury

TL;DR
This paper investigates how different adapter configurations affect low-resource music generation, revealing trade-offs between local detail capture and long-range dependency preservation, with insights into computational efficiency and model performance.
Contribution
It systematically studies adapter design choices for music generation models, providing guidelines for optimal configurations balancing quality, diversity, and resource use.
Findings
Convolution-based adapters excel in capturing local musical details.
Transformer-based adapters better preserve long-range dependencies.
Mid-sized adapters (40M parameters) offer optimal balance between expressivity and quality.
Abstract
Fine-tuning large-scale music generation models, such as MusicGen and Mustango, is a computationally expensive process, often requiring updates to billions of parameters and, therefore, significant hardware resources. Parameter-Efficient Fine-Tuning (PEFT) techniques, particularly adapter-based methods, have emerged as a promising alternative, enabling adaptation with minimal trainable parameters while preserving model performance. However, the design choices for adapters, including their architecture, placement, and size, are numerous, and it is unclear which of these combinations would produce optimal adapters and why, for a given case of low-resource music genre. In this paper, we attempt to answer this question by studying various adapter configurations for two AI music models, MusicGen and Mustango, on two genres: Hindustani Classical and Turkish Makam music. Our findings reveal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic Technology and Sound Studies · Music and Audio Processing · Architecture and Computational Design
MethodsAdapter
