Toward Adaptive Reasoning in Large Language Models with Thought Rollback

Sijia Chen; Baochun Li

arXiv:2412.19707·cs.AI·December 30, 2024

Toward Adaptive Reasoning in Large Language Models with Thought Rollback

Sijia Chen, Baochun Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces Thought Rollback (TR), a novel framework enabling large language models to adaptively revise their reasoning steps through error correction, significantly improving problem-solving accuracy in complex tasks.

Contribution

The paper proposes Thought Rollback, a new error correction mechanism that allows LLMs to revise their reasoning by rolling back to previous thoughts, enhancing reasoning flexibility and accuracy.

Findings

01

GPT-4 with TR outperforms previous methods by 9% on MATH dataset.

02

TR achieves state-of-the-art problem-solving rates in mathematical and multi-task reasoning.

03

TR reduces interaction costs while improving reasoning reliability.

Abstract

Large language models (LLMs) have been routinely used to solve various tasks using step-by-step reasoning. However, the structure of intermediate reasoning steps, or thoughts, is rigid and unidirectional, such as chains, trees, or acyclic-directed graphs. Consequently, the resulting inflexible and forward-only reasoning may not address challenging tasks and fail when the LLM frequently gives false responses, i.e., ``hallucinations''. This paper proposes a new reasoning framework, called Thought Rollback (TR), allowing LLMs to adaptively build thought structure while maintaining effective reasoning toward problem-solving under ``hallucinations''. The core mechanism of TR is rolling back thoughts, which allows LLMs to perform error analysis on thoughts, and thus roll back to any previously mistaken thought for revision. Subsequently, by including such trial-and-error in the prompt to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

iQua/llmpebase
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Recommender Systems and Techniques

MethodsAttention Is All You Need · Byte Pair Encoding · Absolute Position Encodings · Linear Layer · Softmax · Dense Connections · Dropout · Residual Connection · Multi-Head Attention · Adam