Adaptive Conformal Prediction for Improving Factuality of Generations by Large Language Models

Aleksandr Rubashevskii; Dzianis Piatrashyn; Preslav Nakov; Maxim Panov

arXiv:2604.13991·cs.CL·April 16, 2026

Adaptive Conformal Prediction for Improving Factuality of Generations by Large Language Models

Aleksandr Rubashevskii, Dzianis Piatrashyn, Preslav Nakov, Maxim Panov

PDF

TL;DR

This paper introduces an adaptive conformal prediction method for large language models that improves factuality and conditional coverage by enabling prompt-dependent calibration and selective prediction.

Contribution

It extends conformal score transformation techniques to LLMs, providing input-aware uncertainty estimates and enhancing factuality filtering capabilities.

Findings

01

Significantly outperforms existing baselines in conditional coverage.

02

Supports prompt-dependent calibration for better factuality assessment.

03

Enables selective prediction to filter unreliable outputs.

Abstract

Large language models (LLMs) are prone to generating factually incorrect outputs. Recent work has applied conformal prediction to provide uncertainty estimates and statistical guarantees for the factuality of LLM generations. However, existing approaches are typically not prompt-adaptive, limiting their ability to capture input-dependent variability. As a result, they may filter out too few items (leading to over-coverage) or too many (under-coverage) for a given task or prompt. We propose an adaptive conformal prediction approach that extends conformal score transformation methods to LLMs, with applications to long-form generation and multiple-choice question answering. This enables prompt-dependent calibration, retaining marginal coverage guarantees while improving conditional coverage. In addition, the approach naturally supports selective prediction, allowing unreliable claims or…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.