SIGMA: Refining Large Language Model Reasoning via Sibling-Guided Monte Carlo Augmentation

Yanwei Ren; Haotian Zhang; Fuxiang Wu; Jiayan Qiu; Jiaxing Huang; Baosheng Yu; Liu Liu

arXiv:2506.06470·cs.AI·June 10, 2025

SIGMA: Refining Large Language Model Reasoning via Sibling-Guided Monte Carlo Augmentation

Yanwei Ren, Haotian Zhang, Fuxiang Wu, Jiayan Qiu, Jiaxing Huang, Baosheng Yu, Liu Liu

PDF

Open Access

TL;DR

SIGMA enhances large language model reasoning by leveraging sibling nodes in Monte Carlo Tree Search, using critique and revision models to refine reasoning paths and improve accuracy with less data.

Contribution

Introduces SIGMA, a novel sibling-guided augmentation framework that reuses sibling nodes in MCTS to refine reasoning and reduce data requirements.

Findings

01

SIGMA improves reasoning accuracy on MATH benchmark.

02

SIGMA achieves 54.92% accuracy with only 30K samples.

03

Outperforms models trained on much larger datasets.

Abstract

Enhancing large language models by simply scaling up datasets has begun to yield diminishing returns, shifting the spotlight to data quality. Monte Carlo Tree Search (MCTS) has emerged as a powerful technique for generating high-quality chain-of-thought data, yet conventional approaches typically retain only the top-scoring trajectory from the search tree, discarding sibling nodes that often contain valuable partial insights, recurrent error patterns, and alternative reasoning strategies. This unconditional rejection of non-optimal reasoning branches may waste vast amounts of informative data in the whole search tree. We propose SIGMA (Sibling Guided Monte Carlo Augmentation), a novel framework that reintegrates these discarded sibling nodes to refine LLM reasoning. SIGMA forges semantic links among sibling nodes along each search path and applies a two-stage refinement: a critique…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Artificial Intelligence in Healthcare and Education