Optimizing Chain-of-Thought Confidence via Topological and Dirichlet Risk Analysis

Abhishek More; Anthony Zhang; Nicole Bonilla; Ashvik Vivekan; Kevin Zhu; Parham Sharafoleslami; Maheep Chaudhary

arXiv:2511.06437·cs.AI·November 11, 2025

Optimizing Chain-of-Thought Confidence via Topological and Dirichlet Risk Analysis

Abhishek More, Anthony Zhang, Nicole Bonilla, Ashvik Vivekan, Kevin Zhu, Parham Sharafoleslami, Maheep Chaudhary

PDF

Open Access 2 Videos

TL;DR

This paper introduces EDTR, a novel method combining topological analysis and Dirichlet uncertainty to improve confidence calibration in large language models during complex reasoning tasks.

Contribution

The paper presents EDTR, a new decoding strategy that uses geometric and topological features to better estimate LLM confidence across multiple reasoning paths.

Findings

01

EDTR achieves 41% better calibration than existing methods.

02

EDTR attains perfect accuracy on AIME and high calibration on GSM8K.

03

EDTR significantly reduces overconfidence in LLM reasoning.

Abstract

Chain-of-thought (CoT) prompting enables Large Language Models to solve complex problems, but deploying these models safely requires reliable confidence estimates, a capability where existing methods suffer from poor calibration and severe overconfidence on incorrect predictions. We propose Enhanced Dirichlet and Topology Risk (EDTR), a novel decoding strategy that combines topological analysis with Dirichlet-based uncertainty quantification to measure LLM confidence across multiple reasoning paths. EDTR treats each CoT as a vector in high-dimensional space and extracts eight topological risk features capturing the geometric structure of reasoning distributions: tighter, more coherent clusters indicate higher confidence while dispersed, inconsistent paths signal uncertainty. We evaluate EDTR against three state-of-the-art calibration methods across four diverse reasoning benchmarks…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Optimizing Chain-of-Thought Confidence via Topological and Dirichlet Risk Analysis· underline

Taxonomy

TopicsAdvanced Graph Neural Networks · Machine Learning in Healthcare · Explainable Artificial Intelligence (XAI)