Functional-level Uncertainty Quantification for Calibrated Fine-tuning on LLMs

Ruijia Niu; Dongxia Wu; Rose Yu; Yi-An Ma

arXiv:2410.06431·cs.LG·May 15, 2026

Functional-level Uncertainty Quantification for Calibrated Fine-tuning on LLMs

Ruijia Niu, Dongxia Wu, Rose Yu, Yi-An Ma

PDF

TL;DR

The paper introduces UQ4CT, a novel method for calibrating uncertainty in fine-tuned LLMs by focusing on functional space, leading to better confidence estimates and robustness under distribution shifts.

Contribution

It proposes a functional-level uncertainty calibration approach using a mixture-of-experts framework, improving calibration and generalization of fine-tuned LLMs.

Findings

01

UQ4CT reduces Expected Calibration Error by over 25% on multiple benchmarks.

02

UQ4CT maintains high accuracy while improving calibration under distribution shifts.

03

The method outperforms existing post hoc uncertainty estimation techniques.

Abstract

Accurate uncertainty quantification in large language models (LLMs) is essential for reliable confidence estimation, yet fine-tuned LLMs often become overconfident under limited adaptation data. Existing uncertainty methods for PEFT-based LLMs are largely post hoc, estimating uncertainty after fine-tuning rather than improving how adapters specialize to task-specific input-output relationships. We propose Functional-Level Uncertainty Quantification for Calibrated Fine-Tuning (UQ4CT), which calibrates uncertainty over the functional space induced by prompt-dependent mixtures of LoRA experts. UQ4CT implements this perspective through a mixture-of-experts fine-tuning framework, where a calibration loss aligns functional-level confidence with predictive correctness during training. Across four multiple-choice benchmarks and two open-ended generative QA tasks, UQ4CT reduces Expected…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.