Scalable Variational Bayesian Fine-Tuning of LLMs via Orthogonalized Low-Rank Adapters
Haotian Xiang, Bingcong Li, and Qin Lu

TL;DR
This paper introduces PoLAR-VBLL, a scalable Bayesian fine-tuning method for LLMs that improves uncertainty quantification using orthogonalized low-rank adapters and variational inference.
Contribution
It proposes a novel orthogonalized low-rank adapter (PoLAR) combined with Bayesian last layer inference for scalable, expressive uncertainty-aware fine-tuning of LLMs.
Findings
PoLAR-VBLL improves uncertainty calibration on in-distribution data.
The method enhances out-of-distribution generalization.
It achieves better calibration and generalization compared to existing approaches.
Abstract
When deploying large language models (LLMs) to safety-critical applications, uncertainty quantification (UQ) is of utmost importance to self-assess the reliability of the LLM-based decisions. However, such decisions typically suffer from overconfidence, particularly after parameter-efficient fine-tuning (PEFT) for downstream domain-specific tasks with limited data. Existing methods to alleviate this issue either rely on Laplace approximation based post-hoc framework, which may yield suboptimal calibration depending on the training trajectory, or variational Bayesian training that requires multiple complete forward passes through the entire LLM backbone at inference time for Monte Carlo estimation, posing scalability challenges for deployment. To address these limitations, we build on the Bayesian last layer (BLL) model, where the LLM-based deterministic feature extractor is followed by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
