Training-Free Bayesianization for Low-Rank Adapters of Large Language Models

Haizhou Shi; Yibin Wang; Ligong Han; Huan Zhang; Hao Wang

arXiv:2412.05723·stat.ML·September 29, 2025

Training-Free Bayesianization for Low-Rank Adapters of Large Language Models

Haizhou Shi, Yibin Wang, Ligong Han, Huan Zhang, Hao Wang

PDF

Open Access 1 Repo 1 Models

TL;DR

This paper introduces Training-Free Bayesianization (TFB), a novel framework that converts trained low-rank adapters of large language models into Bayesian models without additional training, improving uncertainty estimation and generalization.

Contribution

TFB provides a theoretically grounded, training-free method to Bayesianize low-rank adapters, simplifying uncertainty quantification in large language models.

Findings

01

TFB achieves superior uncertainty estimation compared to existing methods.

02

TFB improves model generalization without additional training.

03

Theoretical analysis links TFB to KL-regularized variational inference.

Abstract

Estimating the uncertainty of responses from Large Language Models (LLMs) remains a critical challenge. While recent Bayesian methods have demonstrated effectiveness in quantifying uncertainty through low-rank weight updates, they typically require complex fine-tuning or post-training procedures. In this paper, we propose Training-Free Bayesianization (TFB), a simple yet theoretically grounded framework that efficiently transforms trained low-rank adapters into Bayesian ones without additional training. TFB systematically searches for the maximally acceptable level of variance in the weight posterior, constrained within a family of low-rank isotropic Gaussian distributions. Our theoretical analysis shows that under mild conditions, this search process is equivalent to KL-regularized variational optimization, a generalized form of variational inference. Through comprehensive experiments,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wang-ml-lab/bayesian-peft
pytorchOfficial

Models

🤗
FlyLee/bayesian-peft
model· ♡ 1
♡ 1

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Speech Recognition and Synthesis · Natural Language Processing Techniques

MethodsVariational Inference