Parameter-Efficient Transfer Learning for Music Foundation Models

Yiwei Ding; Alexander Lerch

arXiv:2411.19371·cs.SD·December 2, 2024

Parameter-Efficient Transfer Learning for Music Foundation Models

Yiwei Ding, Alexander Lerch

PDF

Open Access 1 Repo

TL;DR

This paper explores parameter-efficient transfer learning methods for music foundation models, demonstrating they outperform probing and fine-tuning in music auto-tagging and match fine-tuning in key detection and tempo estimation with less training cost.

Contribution

It introduces three PETL methods—adapter, prompt, and reparameterization-based—that require fewer parameters and computational resources for adapting music foundation models.

Findings

01

PETL methods outperform probing and fine-tuning in music auto-tagging.

02

PETL methods achieve similar results to fine-tuning in key detection and tempo estimation with less training.

03

Current foundation models' usefulness for key and tempo tasks is questioned due to small models performing similarly.

Abstract

More music foundation models are recently being released, promising a general, mostly task independent encoding of musical information. Common ways of adapting music foundation models to downstream tasks are probing and fine-tuning. These common transfer learning approaches, however, face challenges. Probing might lead to suboptimal performance because the pre-trained weights are frozen, while fine-tuning is computationally expensive and is prone to overfitting. Our work investigates the use of parameter-efficient transfer learning (PETL) for music foundation models which integrates the advantage of probing and fine-tuning. We introduce three types of PETL methods: adapter-based methods, prompt-based methods, and reparameterization-based methods. These methods train only a small number of parameters, and therefore do not require significant computational resources. Results show that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

suncerock/peft-music
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Music Technology and Sound Studies · Speech and Audio Processing