Minimal Ranks, Maximum Confidence: Parameter-efficient Uncertainty Quantification for LoRA

Patryk Marsza{\l}ek; Klaudia Ba{\l}azy; Jacek Tabor; Tomasz Ku\'smierczyk

arXiv:2502.12122·cs.LG·September 3, 2025

Minimal Ranks, Maximum Confidence: Parameter-efficient Uncertainty Quantification for LoRA

Patryk Marsza{\l}ek, Klaudia Ba{\l}azy, Jacek Tabor, Tomasz Ku\'smierczyk

PDF

Open Access 1 Repo

TL;DR

This paper introduces a parameter-efficient Bayesian LoRA method that models uncertainty effectively in low-dimensional spaces, maintaining efficiency and improving calibration for large language models.

Contribution

It proposes a novel subspace inference approach for Bayesian LoRA, enabling effective uncertainty quantification with minimal additional parameters.

Findings

01

Uncertainty can be modeled in low-dimensional spaces effectively.

02

Weight covariances exhibit low ranks.

03

The method improves calibration and generalization.

Abstract

Low-Rank Adaptation (LoRA) enables parameter-efficient fine-tuning of large language models by decomposing weight updates into low-rank matrices, significantly reducing storage and computational overhead. While effective, standard LoRA lacks mechanisms for uncertainty quantification, leading to overconfident and poorly calibrated models. Bayesian variants of LoRA address this limitation, but at the cost of a significantly increased number of trainable parameters, partially offsetting the original efficiency gains. Additionally, these models are harder to train and may suffer from unstable convergence. In this work, we propose a novel parameter-efficient Bayesian LoRA via subspace inference, demonstrating that effective uncertainty quantification can be achieved in very low-dimensional parameter spaces. The proposed method achieves strong performance with improved calibration and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gmum/b-lora-xs
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFault Detection and Control Systems