FoE: Forest of Errors Makes the First Solution the Best in Large Reasoning Models

Kehan Jiang; Haonan Dong; Zhaolu Kang; Zhengzhou Zhu; Guojie Song

arXiv:2604.02967·cs.AI·April 6, 2026

FoE: Forest of Errors Makes the First Solution the Best in Large Reasoning Models

Kehan Jiang, Haonan Dong, Zhaolu Kang, Zhengzhou Zhu, Guojie Song

PDF

TL;DR

This paper reveals that in large reasoning models, the first solution is often the best due to a forest-structured error propagation, and introduces RED to improve reasoning accuracy and efficiency.

Contribution

The paper uncovers the Forest of Errors phenomenon in reasoning models and proposes RED, a novel framework to suppress errors and enhance performance.

Findings

01

RED outperforms 8 baselines with up to 19.0% accuracy gain.

02

RED reduces token consumption by 37.7% to 70.4%.

03

Extensive experiments validate RED's effectiveness across benchmarks.

Abstract

Recent Large Reasoning Models (LRMs) like DeepSeek-R1 have demonstrated remarkable success in complex reasoning tasks, exhibiting human-like patterns in exploring multiple alternative solutions. Upon closer inspection, however, we uncover a surprising phenomenon: The First is The Best, where alternative solutions are not merely suboptimal but potentially detrimental. This observation challenges widely accepted test-time scaling laws, leading us to hypothesize that errors within the reasoning path scale concurrently with test time. Through comprehensive empirical analysis, we characterize errors as a forest-structured Forest of Errors (FoE) and conclude that FoE makes the First the Best, which is underpinned by rigorous theoretical analysis. Leveraging these insights, we propose RED, a self-guided efficient reasoning framework comprising two components: I) Refining First, which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.