Small Language Model Can Self-correct

Haixia Han; Jiaqing Liang; Jie Shi; Qianyu He; Yanghua Xiao

arXiv:2401.07301·cs.CL·May 14, 2024·1 cites

Small Language Model Can Self-correct

Haixia Han, Jiaqing Liang, Jie Shi, Qianyu He, Yanghua Xiao

PDF

Open Access

TL;DR

This paper introduces Intrinsic Self-Correction (ISC), a method enabling small language models to spontaneously correct their outputs, significantly improving accuracy in reasoning tasks through fine-tuning with a novel Partial Answer Masking technique.

Contribution

The paper proposes ISC and PAM, a new approach for small LMs to self-correct without external prompts, enhancing their output quality in reasoning tasks.

Findings

01

ISC improves output accuracy over baseline models.

02

Self-correction benefits are consistent across different model sizes.

03

Method is effective for commonsense and factual reasoning tasks.

Abstract

Generative Language Models (LMs) such as ChatGPT have exhibited remarkable performance across various downstream tasks. Nevertheless, one of their most prominent drawbacks is generating inaccurate or false information with a confident tone. Previous studies have devised sophisticated pipelines and prompts to induce large LMs to exhibit the capability for self-correction. However, large LMs are explicitly prompted to verify and modify its answers separately rather than completing all steps spontaneously like humans. Moreover, these complex prompts are extremely challenging for small LMs to follow. In this paper, we introduce the \underline{I}ntrinsic \underline{S}elf-\underline{C}orrection (ISC) in generative language models, aiming to correct the initial output of LMs in a self-triggered manner, even for those small LMs with 6 billion parameters. Specifically, we devise a pipeline for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications