PrunePEFT: Iterative Hybrid Pruning for Parameter-Efficient Fine-tuning of LLMs

Tongzhou Yu; Zhuhao Zhang; Guanghui Zhu; Shen Jiang; Meikang Qiu; Yihua Huang

arXiv:2506.07587·cs.LG·June 10, 2025

PrunePEFT: Iterative Hybrid Pruning for Parameter-Efficient Fine-tuning of LLMs

Tongzhou Yu, Zhuhao Zhang, Guanghui Zhu, Shen Jiang, Meikang Qiu, Yihua Huang

PDF

Open Access

TL;DR

PrunePEFT introduces an iterative hybrid pruning method that optimizes parameter-efficient fine-tuning configurations for large language models, reducing search overhead and enhancing performance.

Contribution

It formulates PEFT strategy search as a pruning problem, enabling efficient, iterative removal of redundant modules to improve fine-tuning outcomes.

Findings

01

Reduces computational cost of PEFT configuration search

02

Improves fine-tuning performance with optimized module selection

03

Scales effectively to large pre-trained models

Abstract

Parameter Efficient Fine-Tuning (PEFT) methods have emerged as effective and promising approaches for fine-tuning pre-trained language models. Compared with Full parameter Fine-Tuning (FFT), PEFT achieved comparable task performance with a substantial reduction of trainable parameters, which largely saved the training and storage costs. However, using the PEFT method requires considering a vast design space, such as the type of PEFT modules and their insertion layers. Inadequate configurations can lead to sub-optimal results. Conventional solutions such as architectural search techniques, while effective, tend to introduce substantial additional overhead. In this paper, we propose a novel approach, PrunePEFT, which formulates the PEFT strategy search as a pruning problem and introduces a hybrid pruning strategy that capitalizes on the sensitivity of pruning methods to different PEFT…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsParallel Computing and Optimization Techniques · Natural Language Processing Techniques · Machine Learning in Materials Science