BoostStep: Boosting mathematical capability of Large Language Models via   improved single-step reasoning

Beichen Zhang; Yuhong Liu; Xiaoyi Dong; Yuhang Zang; Pan Zhang,; Haodong Duan; Yuhang Cao; Dahua Lin; Jiaqi Wang

arXiv:2501.03226·cs.CL·February 18, 2025

BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning

Beichen Zhang, Yuhong Liu, Xiaoyi Dong, Yuhang Zang, Pan Zhang,, Haodong Duan, Yuhang Cao, Dahua Lin, Jiaqi Wang

PDF

Open Access 1 Repo

TL;DR

BoostStep introduces step-aligned in-context learning and a 'first-try' strategy to improve reasoning accuracy in large language models, significantly enhancing their performance on complex mathematical problems.

Contribution

It proposes a novel step-aligned ICL mechanism and a 'first-try' strategy, improving reasoning accuracy and performance of LLMs on mathematical benchmarks.

Findings

01

BoostStep improves GPT-4o's CoT performance by 4.6%.

02

It achieves an additional 7.5% gain with tree search.

03

Enhances state-of-the-art LLMs with simpler examples.

Abstract

Large language models (LLMs) have demonstrated impressive ability in solving complex mathematical problems with multi-step reasoning and can be further enhanced with well-designed in-context learning (ICL) examples. However, this potential is often constrained by two major challenges in ICL: granularity mismatch and irrelevant information. We observe that while LLMs excel at decomposing mathematical problems, they often struggle with reasoning errors in fine-grained steps. Moreover, ICL examples retrieved at the question level may omit critical steps or even mislead the model with irrelevant details. To address this issue, we propose BoostStep, a method that enhances reasoning accuracy through step-aligned ICL, a novel mechanism that carefully aligns retrieved reference steps with the corresponding reasoning steps. Additionally, BoostStep incorporates an effective "first-try" strategy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

beichenzbc/booststep
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Intelligent Tutoring Systems and Adaptive Learning

MethodsFocus