Scalable Variational Bayesian Fine-Tuning of LLMs via Orthogonalized Low-Rank Adapters

Haotian Xiang; Bingcong Li; and Qin Lu

arXiv:2604.03388·cs.LG·April 7, 2026

Scalable Variational Bayesian Fine-Tuning of LLMs via Orthogonalized Low-Rank Adapters

Haotian Xiang, Bingcong Li, and Qin Lu

PDF

TL;DR

This paper introduces PoLAR-VBLL, a scalable Bayesian fine-tuning method for LLMs that improves uncertainty quantification using orthogonalized low-rank adapters and variational inference.

Contribution

It proposes a novel orthogonalized low-rank adapter (PoLAR) combined with Bayesian last layer inference for scalable, expressive uncertainty-aware fine-tuning of LLMs.

Findings

01

PoLAR-VBLL improves uncertainty calibration on in-distribution data.

02

The method enhances out-of-distribution generalization.

03

It achieves better calibration and generalization compared to existing approaches.

Abstract

When deploying large language models (LLMs) to safety-critical applications, uncertainty quantification (UQ) is of utmost importance to self-assess the reliability of the LLM-based decisions. However, such decisions typically suffer from overconfidence, particularly after parameter-efficient fine-tuning (PEFT) for downstream domain-specific tasks with limited data. Existing methods to alleviate this issue either rely on Laplace approximation based post-hoc framework, which may yield suboptimal calibration depending on the training trajectory, or variational Bayesian training that requires multiple complete forward passes through the entire LLM backbone at inference time for Monte Carlo estimation, posing scalability challenges for deployment. To address these limitations, we build on the Bayesian last layer (BLL) model, where the LLM-based deterministic feature extractor is followed by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.