The Curious Language Model: Strategic Test-Time Information Acquisition
Michael Cooper, Rohan Wadhawan, John Michael Giorgi, Chenhao Tan, Davis Liang

TL;DR
This paper introduces CuriosiTree, a heuristic test-time policy for large language models that strategically acquires information cost-effectively, improving decision accuracy in clinical diagnosis simulations.
Contribution
The paper presents CuriosiTree, a novel greedy tree search-based method for zero-shot, cost-aware information acquisition in large language models.
Findings
Outperforms baseline strategies in clinical diagnosis accuracy
Enables cost-effective integration of diverse information sources
Improves decision confidence with strategic action selection
Abstract
Decision-makers often possess insufficient information to render a confident decision. In these cases, the decision-maker can often undertake actions to acquire the necessary information about the problem at hand, e.g., by consulting knowledgeable authorities or by conducting experiments. Importantly, different levers of information acquisition come with different costs, posing the challenge of selecting the actions that are both informative and cost-effective. In this work, we propose CuriosiTree, a heuristic-based, test-time policy for zero-shot information acquisition in large language models (LLMs). CuriosiTree employs a greedy tree search to estimate the expected information gain of each action and strategically chooses actions based on a balance of anticipated information gain and associated cost. Empirical validation in a clinical diagnosis simulation shows that CuriosiTree…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Explainable Artificial Intelligence (XAI) · Ferroelectric and Negative Capacitance Devices
