CRITIC: Large Language Models Can Self-Correct with Tool-Interactive   Critiquing

Zhibin Gou; Zhihong Shao; Yeyun Gong; Yelong Shen; Yujiu Yang; Nan; Duan; Weizhu Chen

arXiv:2305.11738·cs.CL·February 22, 2024·57 cites

CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing

Zhibin Gou, Zhihong Shao, Yeyun Gong, Yelong Shen, Yujiu Yang, Nan, Duan, Weizhu Chen

PDF

Open Access 1 Repo 1 Models

TL;DR

CRITIC introduces a framework enabling large language models to self-validate and refine their outputs using external tools, significantly improving accuracy, correctness, and safety across various tasks.

Contribution

This work presents a novel tool-interactive critiquing framework allowing LLMs to self-correct by leveraging external tools, a significant step beyond traditional static models.

Findings

01

CRITIC improves factual accuracy in LLM outputs.

02

CRITIC reduces toxicity and harmful content.

03

Enhanced performance in question answering and code synthesis.

Abstract

Recent developments in large language models (LLMs) have been impressive. However, these models sometimes show inconsistencies and problematic behavior, such as hallucinating facts, generating flawed code, or creating offensive and toxic content. Unlike these models, humans typically utilize external tools to cross-check and refine their initial content, like using a search engine for fact-checking, or a code interpreter for debugging. Inspired by this observation, we introduce a framework called CRITIC that allows LLMs, which are essentially "black boxes" to validate and progressively amend their own outputs in a manner similar to human interaction with tools. More specifically, starting with an initial output, CRITIC interacts with appropriate tools to evaluate certain aspects of the text, and then revises the output based on the feedback obtained during this validation process.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

microsoft/ProphetNet
pytorchOfficial

Models

🤗
ibm-granite/granite-guardian-3.2-5b-lora-factuality-correction
model· ♡ 1
♡ 1

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Software Engineering Research · Natural Language Processing Techniques