Scalable Bayesian Low-Rank Adaptation of Large Language Models via Stochastic Variational Subspace Inference
Colin Samplawski, Adam D. Cobb, Manoj Acharya, Ramneet Kaur, Susmit Jha

TL;DR
This paper introduces ScalaBL, a scalable Bayesian low-rank adaptation method for large language models that uses stochastic variational inference in a low-dimensional subspace, enabling uncertainty quantification in very large models with minimal additional parameters.
Contribution
It proposes a novel scalable Bayesian inference approach for LLMs using low-rank subspace parameterization, allowing efficient uncertainty estimation in models with billions of parameters.
Findings
Achieves competitive performance with state-of-the-art methods.
Requires only about 1000 additional parameters.
Scales to the largest Bayesian LLMs to date.
Abstract
Despite their widespread use, large language models (LLMs) are known to hallucinate incorrect information and be poorly calibrated. This makes the uncertainty quantification of these models of critical importance, especially in high-stakes domains, such as autonomy and healthcare. Prior work has made Bayesian deep learning-based approaches to this problem more tractable by performing inference over the low-rank adaptation (LoRA) parameters of a fine-tuned model. While effective, these approaches struggle to scale to larger LLMs due to requiring further additional parameters compared to LoRA. In this work we present ble ayesian ow-Rank Adaptation via Stochastic Variational Subspace Inference (ScalaBL). We perform Bayesian inference in an -dimensional subspace, for LoRA rank . By repurposing the LoRA parameters as projection matrices, we are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Domain Adaptation and Few-Shot Learning · Speech and Audio Processing
MethodsBalanced Selection
