Training-Free Bayesianization for Low-Rank Adapters of Large Language Models
Haizhou Shi, Yibin Wang, Ligong Han, Huan Zhang, Hao Wang

TL;DR
This paper introduces Training-Free Bayesianization (TFB), a novel framework that converts trained low-rank adapters of large language models into Bayesian models without additional training, improving uncertainty estimation and generalization.
Contribution
TFB provides a theoretically grounded, training-free method to Bayesianize low-rank adapters, simplifying uncertainty quantification in large language models.
Findings
TFB achieves superior uncertainty estimation compared to existing methods.
TFB improves model generalization without additional training.
Theoretical analysis links TFB to KL-regularized variational inference.
Abstract
Estimating the uncertainty of responses from Large Language Models (LLMs) remains a critical challenge. While recent Bayesian methods have demonstrated effectiveness in quantifying uncertainty through low-rank weight updates, they typically require complex fine-tuning or post-training procedures. In this paper, we propose Training-Free Bayesianization (TFB), a simple yet theoretically grounded framework that efficiently transforms trained low-rank adapters into Bayesian ones without additional training. TFB systematically searches for the maximally acceptable level of variance in the weight posterior, constrained within a family of low-rank isotropic Gaussian distributions. Our theoretical analysis shows that under mild conditions, this search process is equivalent to KL-regularized variational optimization, a generalized form of variational inference. Through comprehensive experiments,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Speech Recognition and Synthesis · Natural Language Processing Techniques
MethodsVariational Inference
