Conformal Information Pursuit for Interactively Guiding Large Language Models
Kwan Ho Ryan Chan, Yuyan Ge, Edgar Dobriban, Hamed Hassani, Ren\'e Vidal

TL;DR
This paper introduces Conformal Information Pursuit (C-IP), a novel uncertainty estimation method for interactive question-answering with large language models, improving query efficiency and interpretability over existing approaches.
Contribution
It proposes C-IP, a distribution-free, robust uncertainty estimation technique based on conformal prediction sets, enhancing sequential querying strategies for LLMs in interactive tasks.
Findings
C-IP outperforms previous methods in 20 Questions task with shorter query chains.
C-IP achieves competitive accuracy in medical diagnosis setting with greater interpretability.
Conformal prediction sets provide a robust, distribution-free measure of uncertainty.
Abstract
A significant use case of instruction-finetuned Large Language Models (LLMs) is to solve question-answering tasks interactively. In this setting, an LLM agent is tasked with making a prediction by sequentially querying relevant information from the user, as opposed to a single-turn conversation. This paper explores sequential querying strategies that aim to minimize the expected number of queries. One such strategy is Information Pursuit (IP), a greedy algorithm that at each iteration selects the query that maximizes information gain or equivalently minimizes uncertainty. However, obtaining accurate estimates of mutual information or conditional entropy for LLMs is very difficult in practice due to over- or under-confident LLM proba- bilities, which leads to suboptimal query selection and predictive performance. To better estimate the uncertainty at each iteration, we propose Conformal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
