Learning Conformal Abstention Policies for Adaptive Risk Management in   Large Language and Vision-Language Models

Sina Tayebati; Divake Kumar; Nastaran Darabi; Dinithi Jayasuriya,; Ranganath Krishnan; Amit Ranjan Trivedi

arXiv:2502.06884·cs.LG·February 12, 2025

Learning Conformal Abstention Policies for Adaptive Risk Management in Large Language and Vision-Language Models

Sina Tayebati, Divake Kumar, Nastaran Darabi, Dinithi Jayasuriya,, Ranganath Krishnan, Amit Ranjan Trivedi

PDF

Open Access 1 Repo

TL;DR

This paper introduces a reinforcement learning-based method to adaptively set conformal prediction thresholds, improving uncertainty quantification and decision reliability in large language and vision-language models for safety-critical tasks.

Contribution

It presents a novel learnable conformal abstention approach that dynamically optimizes thresholds using reinforcement learning, surpassing static methods in accuracy and reliability.

Findings

01

Improves accuracy by up to 3.2% over baseline methods.

02

Increases AUROC for hallucination detection by 22.19%.

03

Reduces calibration error by 70-85%.

Abstract

Large Language and Vision-Language Models (LLMs/VLMs) are increasingly used in safety-critical applications, yet their opaque decision-making complicates risk assessment and reliability. Uncertainty quantification (UQ) helps assess prediction confidence and enables abstention when uncertainty is high. Conformal prediction (CP), a leading UQ method, provides statistical guarantees but relies on static thresholds, which fail to adapt to task complexity and evolving data distributions, leading to suboptimal trade-offs in accuracy, coverage, and informativeness. To address this, we propose learnable conformal abstention, integrating reinforcement learning (RL) with CP to optimize abstention thresholds dynamically. By treating CP thresholds as adaptive actions, our approach balances multiple objectives, minimizing prediction set size while maintaining reliable coverage. Extensive evaluations…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sinatayebati/vlm-uncertainty
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsSparse Evolutionary Training