A Bayesian Hybrid Parameter-Efficient Fine-Tuning Method for Large Language Models
Yidong Chai (1, 2), Yang Liu (1, 2), Yonghang Zhou (1, 2), Jiaheng Xie (3), Daniel Dajun Zeng (4) ((1) School of Management, Hefei University of Technology, Hefei, China, (2) Key Laboratory of Process Optimization, Intelligent Decision-making, Ministry of Education, Hefei, China

TL;DR
This paper introduces BH-PEFT, a Bayesian hybrid fine-tuning method for large language models that enables uncertainty quantification and dynamic adaptation, improving performance on business-related tasks.
Contribution
The paper proposes a novel Bayesian hybrid PEFT method that integrates multiple techniques and models uncertainty, enhancing adaptability and decision reliability in real-world applications.
Findings
Outperforms existing PEFT methods on business tasks
Enables uncertainty quantification for more reliable decisions
Improves adaptability in dynamic data scenarios
Abstract
Large Language Models (LLMs) have demonstrated transformative potential in reshaping the world. As these models are pretrained on general corpora, they often require domain-specific fine-tuning to optimize performance in specialized business applications. Due to their massive scale, parameter-efficient fine-tuning (PEFT) methods are widely used to reduce training costs. Among them, hybrid PEFT methods that combine multiple PEFT techniques have achieved the best performance. However, existing hybrid PEFT methods face two main challenges when fine-tuning LLMs for specialized applications: (1) relying on point estimates, lacking the ability to quantify uncertainty for reliable decision-making, and (2) struggling to dynamically adapt to emerging data, lacking the ability to suit real-world situations. We propose Bayesian Hybrid Parameter-Efficient Fine-Tuning (BH-PEFT), a novel method that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
