Not All Adapters Matter: Selective Adapter Freezing for Memory-Efficient Fine-Tuning of Language Models

Hyegang Son; Yonglak Son; Changhoon Kim; Young Geun Kim

arXiv:2412.03587·cs.CL·May 16, 2025

Not All Adapters Matter: Selective Adapter Freezing for Memory-Efficient Fine-Tuning of Language Models

Hyegang Son, Yonglak Son, Changhoon Kim, Young Geun Kim

PDF

Open Access 1 Video

TL;DR

This paper introduces SAFE, a method for selectively freezing adapters in large language models during fine-tuning, significantly reducing resource consumption while maintaining or improving task performance.

Contribution

SAFE is a novel selective freezing technique that identifies and freezes less important adapters early, optimizing resource use without sacrificing accuracy.

Findings

01

SAFE reduces memory, computation, and training time by over 40%.

02

SAFE achieves comparable or better performance than baseline methods.

03

Selective freezing induces regularization, improving generalization.

Abstract

Transformer-based large-scale pre-trained models achieve great success. Fine-tuning is the standard practice for leveraging these models in downstream tasks. Among the fine-tuning methods, adapter-tuning provides a parameter-efficient fine-tuning by introducing lightweight trainable modules while keeping most pre-trained parameters frozen. However, existing adapter-tuning methods still impose substantial resource usage. Through our investigation, we show that each adapter unequally contributes to both task performance and resource usage. Motivated by this insight, we propose Selective Adapter FrEezing (SAFE), which gradually freezes less important adapters early to reduce unnecessary resource usage while maintaining performance. In our experiments, SAFE reduces memory usage, computation amount, and training time by 42.85\%, 34.59\%, and 11.82\%, respectively, while achieving comparable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Not All Adapters Matter: Selective Adapter Freezing for Memory-Efficient Fine-Tuning of Language Models· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis

MethodsAdapter