UniPELT: A Unified Framework for Parameter-Efficient Language Model Tuning
Yuning Mao, Lambert Mathias, Rui Hou, Amjad Almahairi, Hao Ma, Jiawei, Han, Wen-tau Yih, Madian Khabsa

TL;DR
UniPELT is a unified framework that adaptively combines multiple parameter-efficient language model tuning methods, improving performance on NLP tasks by learning to select the best submodules for each task.
Contribution
Introduces UniPELT, a framework that integrates various PELT methods with a gating mechanism to automatically select optimal tuning strategies for different tasks.
Findings
Achieves 1-4% improvements on GLUE benchmark over individual PELT methods.
Outperforms fine-tuning across various setups.
Surpasses the best individual submodule performance, showing the effectiveness of combining multiple methods.
Abstract
Recent parameter-efficient language model tuning (PELT) methods manage to match the performance of fine-tuning with much fewer trainable parameters and perform especially well when training data is limited. However, different PELT methods may perform rather differently on the same task, making it nontrivial to select the most appropriate method for a specific task, especially considering the fast-growing number of new PELT methods and tasks. In light of model diversity and the difficulty of model selection, we propose a unified framework, UniPELT, which incorporates different PELT methods as submodules and learns to activate the ones that best suit the current data or task setup via gating mechanism. On the GLUE benchmark, UniPELT consistently achieves 1~4% gains compared to the best individual PELT method that it incorporates and even outperforms fine-tuning under different setups.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis
