Stepwise Reasoning Checkpoint Analysis: A Test Time Scaling Method to Enhance LLMs' Reasoning

Zezhong Wang; Xingshan Zeng; Weiwen Liu; Yufei Wang; Liangyou Li; Yasheng Wang; Lifeng Shang; Xin Jiang; Qun Liu; Kam-Fai Wong

arXiv:2505.17829·cs.CL·May 26, 2025

Stepwise Reasoning Checkpoint Analysis: A Test Time Scaling Method to Enhance LLMs' Reasoning

Zezhong Wang, Xingshan Zeng, Weiwen Liu, Yufei Wang, Liangyou Li, Yasheng Wang, Lifeng Shang, Xin Jiang, Qun Liu, Kam-Fai Wong

PDF

1 Video

TL;DR

This paper introduces SRCA, a novel framework with checkpoints and clustering strategies that enhances the reasoning accuracy of Large Language Models by reducing path homogenization and utilizing intermediate results more effectively.

Contribution

The paper proposes Stepwise Reasoning Checkpoint Analysis (SRCA), a new method that improves test-time scaling for LLMs by introducing checkpoints and clustering to maintain diversity and leverage intermediate answers.

Findings

01

SRCA outperforms existing TTS methods in reasoning accuracy.

02

It effectively reduces path homogenization in reasoning paths.

03

The method demonstrates robustness and fault-tolerance in mathematical reasoning tasks.

Abstract

Mathematical reasoning through Chain-of-Thought (CoT) has emerged as a powerful capability of Large Language Models (LLMs), which can be further enhanced through Test-Time Scaling (TTS) methods like Beam Search and DVTS. However, these methods, despite improving accuracy by allocating more computational resources during inference, often suffer from path homogenization and inefficient use of intermediate results. To address these limitations, we propose Stepwise Reasoning Checkpoint Analysis (SRCA), a framework that introduces checkpoints between reasoning steps. It incorporates two key strategies: (1) Answer-Clustered Search, which groups reasoning paths by their intermediate checkpoint answers to maintain diversity while ensuring quality, and (2) Checkpoint Candidate Augmentation, which leverages all intermediate answers for final decision-making. Our approach effectively reduces path…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Stepwise Reasoning Checkpoint Analysis: A Test Time Scaling Method to Enhance LLMs' Reasoning· underline