Parameter-Efficient Transfer Learning for Music Foundation Models
Yiwei Ding, Alexander Lerch

TL;DR
This paper explores parameter-efficient transfer learning methods for music foundation models, demonstrating they outperform probing and fine-tuning in music auto-tagging and match fine-tuning in key detection and tempo estimation with less training cost.
Contribution
It introduces three PETL methods—adapter, prompt, and reparameterization-based—that require fewer parameters and computational resources for adapting music foundation models.
Findings
PETL methods outperform probing and fine-tuning in music auto-tagging.
PETL methods achieve similar results to fine-tuning in key detection and tempo estimation with less training.
Current foundation models' usefulness for key and tempo tasks is questioned due to small models performing similarly.
Abstract
More music foundation models are recently being released, promising a general, mostly task independent encoding of musical information. Common ways of adapting music foundation models to downstream tasks are probing and fine-tuning. These common transfer learning approaches, however, face challenges. Probing might lead to suboptimal performance because the pre-trained weights are frozen, while fine-tuning is computationally expensive and is prone to overfitting. Our work investigates the use of parameter-efficient transfer learning (PETL) for music foundation models which integrates the advantage of probing and fine-tuning. We introduce three types of PETL methods: adapter-based methods, prompt-based methods, and reparameterization-based methods. These methods train only a small number of parameters, and therefore do not require significant computational resources. Results show that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Speech and Audio Processing
