Calibrating Reasoning in Language Models with Internal Consistency

Zhihui Xie; Jizhou Guo; Tong Yu; Shuai Li

arXiv:2405.18711·cs.AI·December 6, 2024·3 cites

Calibrating Reasoning in Language Models with Internal Consistency

Zhihui Xie, Jizhou Guo, Tong Yu, Shuai Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces internal consistency as a new way to evaluate and calibrate reasoning in large language models by analyzing their internal representations, leading to improved reasoning accuracy.

Contribution

It proposes a novel internal consistency measure based on intermediate layer predictions to better assess and calibrate LLM reasoning processes.

Findings

01

Internal consistency effectively distinguishes correct from incorrect reasoning.

02

Calibrating reasoning with internal consistency boosts model performance.

03

Analysis reveals patterns in attention modules related to internal inconsistency.

Abstract

Large language models (LLMs) have demonstrated impressive capabilities in various reasoning tasks, aided by techniques like chain-of-thought prompting that elicits verbalized reasoning. However, LLMs often generate text with obvious mistakes and contradictions, raising doubts about their ability to robustly process and utilize generated rationales. In this work, we investigate reasoning in LLMs through the lens of internal representations, focusing on how these representations are influenced by generated rationales. Our preliminary analysis reveals that while generated rationales improve answer accuracy, inconsistencies emerge between the model's internal representations in middle layers and those in final layers, potentially undermining the reliability of their reasoning processes. To address this, we propose internal consistency as a measure of the model's confidence by examining the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zhxieml/internal-consistency
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling