Think Thrice Before You Act: Progressive Thought Refinement in Large   Language Models

Chengyu Du; Jinyi Han; Yizhou Ying; Aili Chen; Qianyu He; Haokun Zhao,; Sirui Xia; Haoran Guo; Jiaqing Liang; Zulong Chen; Liangyue Li; Yanghua Xiao

arXiv:2410.13413·cs.CL·October 18, 2024

Think Thrice Before You Act: Progressive Thought Refinement in Large Language Models

Chengyu Du, Jinyi Han, Yizhou Ying, Aili Chen, Qianyu He, Haokun Zhao,, Sirui Xia, Haoran Guo, Jiaqing Liang, Zulong Chen, Liangyue Li, Yanghua Xiao

PDF

Open Access

TL;DR

This paper introduces Progressive Thought Refinement (PTR), a novel framework enabling large language models to iteratively improve their responses, leading to better accuracy and quality across diverse tasks without task-specific fine-tuning.

Contribution

The paper proposes PTR, a two-phase training method that enhances LLMs' ability to self-refine responses, improving performance and response quality in open-ended scenarios.

Findings

01

Performance improved from 49.6% to 53.5% on average across ten tasks

02

Significant enhancement in response quality for open-ended tasks

03

Effective without task-specific fine-tuning

Abstract

Recent advancements in large language models (LLMs) have demonstrated that progressive refinement, rather than providing a single answer, results in more accurate and thoughtful outputs. However, existing methods often rely heavily on supervision signals to evaluate previous responses, making it difficult to assess output quality in more open-ended scenarios effectively. Additionally, these methods are typically designed for specific tasks, which limits their generalization to new domains. To address these limitations, we propose Progressive Thought Refinement (PTR), a framework that enables LLMs to refine their responses progressively. PTR operates in two phases: (1) Thought data construction stage: We propose a weak and strong model collaborative selection strategy to build a high-quality progressive refinement dataset to ensure logical consistency from thought to answers, and the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling