Exploring the Impact of Model Scaling on Parameter-Efficient Tuning
Yusheng Su, Chi-Min Chan, Jiali Cheng, Yujia Qin, Yankai Lin,, Shengding Hu, Zonghan Yang, Ning Ding, Xingzhi Sun, Guotong Xie, Zhiyuan Liu,, Maosong Sun

TL;DR
This paper investigates how increasing model size reduces the performance gap among parameter-efficient tuning methods, introducing a flexible APET method and analyzing its effects across various NLP tasks.
Contribution
It introduces the Arbitrary PET (APET) method, demonstrating that larger models diminish the importance of tunable parameter positions and enable comparable performance with fewer parameters.
Findings
Model scaling reduces the impact of tunable parameter positions.
Larger models allow tuning methods to match full fine-tuning performance with fewer parameters.
Tuning methods tend to optimize a similar number of parameters across tasks.
Abstract
Parameter-efficient tuning (PET) methods can effectively drive extremely large pre-trained language models (PLMs) by training only minimal parameters. Different PET methods utilize different manually designed tunable modules. In small PLMs, there are usually noticeable performance differences among PET methods. Nevertheless, as the model scale increases, the performance differences become marginal. Hence, we hypothesize that model scaling mitigates the impact of design differences on PET methods. To investigate this hypothesis, we introduce a more flexible PET method called Arbitrary PET (APET) method. The APET method is compatible with a tunable module, which consists of any number of parameters distributed in arbitrary positions. Then, we utilize it and conduct experiments on 11 NLP tasks across 3 representative PLMs. Our investigations reveal that model scaling (1) mitigates the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Software Engineering Research
