Identifying Influential N-grams in Confidence Calibration via Regression Analysis

Shintaro Ozaki; Wataru Hashimoto; Hidetaka Kamigaito; Katsuhiko Hayashi; Taro Watanabe

arXiv:2604.05757·cs.CL·April 8, 2026

Identifying Influential N-grams in Confidence Calibration via Regression Analysis

Shintaro Ozaki, Wataru Hashimoto, Hidetaka Kamigaito, Katsuhiko Hayashi, Taro Watanabe

PDF

TL;DR

This paper investigates linguistic expressions influencing confidence in large language models, revealing overconfidence linked to specific phrases and proposing suppression to improve calibration without performance loss.

Contribution

It introduces a regression-based method to identify linguistic cues related to confidence, highlighting their impact and enabling calibration improvements.

Findings

01

LLMs remain overconfident during reasoning tasks.

02

Certain linguistic expressions correlate with overconfidence.

03

Suppressing these expressions improves confidence calibration.

Abstract

While large language models (LLMs) improve performance by explicit reasoning, their responses are often overconfident, even though they include linguistic expressions demonstrating uncertainty. In this work, we identify what linguistic expressions are related to confidence by applying the regression method. Specifically, we predict confidence of those linguistic expressions in the reasoning parts of LLMs as the dependent variables and analyze the relationship between a specific $n$ -gram and confidence. Across multiple models and QA benchmarks, we show that LLMs remain overconfident when reasoning is involved and attribute this behavior to specific linguistic information. Interestingly, several of the extracted expressions coincide with cue phrases intentionally inserted on test-time scaling to improve reasoning performance. Through our test on causality and verification that the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.