BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning
Beichen Zhang, Yuhong Liu, Xiaoyi Dong, Yuhang Zang, Pan Zhang,, Haodong Duan, Yuhang Cao, Dahua Lin, Jiaqi Wang

TL;DR
BoostStep introduces step-aligned in-context learning and a 'first-try' strategy to improve reasoning accuracy in large language models, significantly enhancing their performance on complex mathematical problems.
Contribution
It proposes a novel step-aligned ICL mechanism and a 'first-try' strategy, improving reasoning accuracy and performance of LLMs on mathematical benchmarks.
Findings
BoostStep improves GPT-4o's CoT performance by 4.6%.
It achieves an additional 7.5% gain with tree search.
Enhances state-of-the-art LLMs with simpler examples.
Abstract
Large language models (LLMs) have demonstrated impressive ability in solving complex mathematical problems with multi-step reasoning and can be further enhanced with well-designed in-context learning (ICL) examples. However, this potential is often constrained by two major challenges in ICL: granularity mismatch and irrelevant information. We observe that while LLMs excel at decomposing mathematical problems, they often struggle with reasoning errors in fine-grained steps. Moreover, ICL examples retrieved at the question level may omit critical steps or even mislead the model with irrelevant details. To address this issue, we propose BoostStep, a method that enhances reasoning accuracy through step-aligned ICL, a novel mechanism that carefully aligns retrieved reference steps with the corresponding reasoning steps. Additionally, BoostStep incorporates an effective "first-try" strategy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Intelligent Tutoring Systems and Adaptive Learning
MethodsFocus
