Uncertainty quantification in fine-tuned LLMs using LoRA ensembles

Oleksandr Balabanov; Hampus Linander

arXiv:2402.12264·cs.LG·May 22, 2025·1 cites

Uncertainty quantification in fine-tuned LLMs using LoRA ensembles

Oleksandr Balabanov, Hampus Linander

PDF

Open Access 1 Repo

TL;DR

This paper introduces a method for quantifying uncertainty in fine-tuned large language models using low-rank adaptation ensembles, providing insights into model knowledge retention and trustworthiness.

Contribution

It presents a novel approach combining posterior approximations with low-rank adaptation ensembles for uncertainty quantification in fine-tuned LLMs.

Findings

01

Enables principled uncertainty estimation in fine-tuned LLMs.

02

Reveals retention of prior knowledge during overfitting.

03

Analyzes model behavior across multiple-choice datasets.

Abstract

Fine-tuning large language models can improve task specific performance, although a general understanding of what the fine-tuned model has learned, forgotten and how to trust its predictions is still missing. We derive principled uncertainty quantification for fine-tuned LLMs with posterior approximations using computationally efficient low-rank adaptation ensembles. We analyze three common multiple-choice datasets using low-rank adaptation ensembles based on Mistral-7b, and draw quantitative and qualitative conclusions on their perceived complexity and balance between retained prior knowledge and domain specific adaptation during and after fine-tuning. We identify unexpected retention of acquired knowledge during fine-tuning in the overfitting regime.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

oleksandr-balabanov/equivariant-posteriors
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFault Detection and Control Systems · Fuzzy Logic and Control Systems · Anomaly Detection Techniques and Applications