Confidence Matters: Revisiting Intrinsic Self-Correction Capabilities of   Large Language Models

Loka Li; Zhenhao Chen; Guangyi Chen; Yixuan Zhang; Yusheng Su; Eric; Xing; Kun Zhang

arXiv:2402.12563·cs.CL·May 14, 2024·3 cites

Confidence Matters: Revisiting Intrinsic Self-Correction Capabilities of Large Language Models

Loka Li, Zhenhao Chen, Guangyi Chen, Yixuan Zhang, Yusheng Su, Eric, Xing, Kun Zhang

PDF

Open Access 1 Repo

TL;DR

This paper investigates the intrinsic self-correction abilities of Large Language Models, emphasizing the importance of their confidence levels, and introduces an IoE prompting framework to enhance their self-correction accuracy.

Contribution

It identifies confidence as a key factor in LLM self-correction and proposes a novel IoE prompting method to improve their intrinsic correction capabilities.

Findings

01

IoE prompting improves self-correction accuracy

02

LLMs can assess their own confidence levels

03

Framework demonstrates consistent performance gains

Abstract

The recent success of Large Language Models (LLMs) has catalyzed an increasing interest in their self-correction capabilities. This paper presents a comprehensive investigation into the intrinsic self-correction of LLMs, attempting to address the ongoing debate about its feasibility. Our research has identified an important latent factor - the "confidence" of LLMs - during the self-correction process. Overlooking this factor may cause the models to over-criticize themselves, resulting in unreliable conclusions regarding the efficacy of self-correction. We have experimentally observed that LLMs possess the capability to understand the "confidence" in their own responses. It motivates us to develop an "If-or-Else" (IoE) prompting framework, designed to guide LLMs in assessing their own "confidence", facilitating intrinsic self-corrections. We conduct extensive experiments and demonstrate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mbzuai-clear/ioe-prompting
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling