BLoB: Bayesian Low-Rank Adaptation by Backpropagation for Large Language   Models

Yibin Wang; Haizhou Shi; Ligong Han; Dimitris Metaxas; Hao Wang

arXiv:2406.11675·cs.LG·January 28, 2025·1 cites

BLoB: Bayesian Low-Rank Adaptation by Backpropagation for Large Language Models

Yibin Wang, Haizhou Shi, Ligong Han, Dimitris Metaxas, Hao Wang

PDF

Open Access 1 Repo 1 Video

TL;DR

BLoB introduces a Bayesian low-rank adaptation method that jointly fine-tunes mean and covariance of LLM parameters during training, improving uncertainty estimation and generalization for domain-specific tasks.

Contribution

It proposes a novel Bayesian adaptation algorithm that updates both mean and covariance during fine-tuning, unlike previous post-training Bayesian methods.

Findings

01

Enhanced uncertainty estimation on in-distribution data.

02

Improved generalization to out-of-distribution data.

03

Effective joint adjustment of mean and covariance during training.

Abstract

Large Language Models (LLMs) often suffer from overconfidence during inference, particularly when adapted to downstream domain-specific tasks with limited data. Previous work addresses this issue by employing approximate Bayesian estimation after the LLMs are trained, enabling them to quantify uncertainty. However, such post-training approaches' performance is severely limited by the parameters learned during training. In this paper, we go beyond post-training Bayesianization and propose Bayesian Low-Rank Adaptation by Backpropagation (BLoB), an algorithm that continuously and jointly adjusts both the mean and covariance of LLM parameters throughout the whole fine-tuning process. Our empirical results verify the effectiveness of BLoB in terms of generalization and uncertainty estimation, when evaluated on both in-distribution and out-of-distribution data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wang-ml-lab/bayesian-peft
pytorchOfficial

Videos

BLoB: Bayesian Low-Rank Adaptation by Backpropagation for Large Language Models· slideslive

Taxonomy

TopicsSpeech Recognition and Synthesis · Topic Modeling