Optimizing Chain-of-Thought Confidence via Topological and Dirichlet Risk Analysis
Abhishek More, Anthony Zhang, Nicole Bonilla, Ashvik Vivekan, Kevin Zhu, Parham Sharafoleslami, Maheep Chaudhary

TL;DR
This paper introduces EDTR, a novel method combining topological analysis and Dirichlet uncertainty to improve confidence calibration in large language models during complex reasoning tasks.
Contribution
The paper presents EDTR, a new decoding strategy that uses geometric and topological features to better estimate LLM confidence across multiple reasoning paths.
Findings
EDTR achieves 41% better calibration than existing methods.
EDTR attains perfect accuracy on AIME and high calibration on GSM8K.
EDTR significantly reduces overconfidence in LLM reasoning.
Abstract
Chain-of-thought (CoT) prompting enables Large Language Models to solve complex problems, but deploying these models safely requires reliable confidence estimates, a capability where existing methods suffer from poor calibration and severe overconfidence on incorrect predictions. We propose Enhanced Dirichlet and Topology Risk (EDTR), a novel decoding strategy that combines topological analysis with Dirichlet-based uncertainty quantification to measure LLM confidence across multiple reasoning paths. EDTR treats each CoT as a vector in high-dimensional space and extracts eight topological risk features capturing the geometric structure of reasoning distributions: tighter, more coherent clusters indicate higher confidence while dispersed, inconsistent paths signal uncertainty. We evaluate EDTR against three state-of-the-art calibration methods across four diverse reasoning benchmarks…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Graph Neural Networks · Machine Learning in Healthcare · Explainable Artificial Intelligence (XAI)
