SCOUT: Teaching Pre-trained Language Models to Enhance Reasoning via Flow Chain-of-Thought

Guanghao Li; Wenhao Jiang; Mingfeng Chen; Yan Li; Hao Yu; Shuting Dong; Tao Ren; Ming Tang; Chun Yuan

arXiv:2505.24181·cs.AI·June 2, 2025

SCOUT: Teaching Pre-trained Language Models to Enhance Reasoning via Flow Chain-of-Thought

Guanghao Li, Wenhao Jiang, Mingfeng Chen, Yan Li, Hao Yu, Shuting Dong, Tao Ren, Ming Tang, Chun Yuan

PDF

Open Access

TL;DR

This paper introduces Flow Chain of Thought (Flow CoT), a new reasoning paradigm modeled as a progressive trajectory of latent states, and presents SCOUT, a lightweight fine-tuning framework that enhances reasoning in large language models without pretraining.

Contribution

The paper proposes Flow CoT as a scalable reasoning framework and introduces SCOUT, a novel fine-tuning method that improves reasoning accuracy and explanation quality without requiring pretraining.

Findings

01

SCOUT achieves up to 1.8% accuracy gains on reasoning benchmarks.

02

Flow CoT enables deeper, iterative reasoning and better explanation granularity.

03

SCOUT is effective across eight reasoning benchmarks.

Abstract

Chain of Thought (CoT) prompting improves the reasoning performance of large language models (LLMs) by encouraging step by step thinking. However, CoT-based methods depend on intermediate reasoning steps, which limits scalability and generalization. Recent work explores recursive reasoning, where LLMs reuse internal layers across iterations to refine latent representations without explicit CoT supervision. While promising, these approaches often require costly pretraining and lack a principled framework for how reasoning should evolve across iterations. We address this gap by introducing Flow Chain of Thought (Flow CoT), a reasoning paradigm that models recursive inference as a progressive trajectory of latent cognitive states. Flow CoT frames each iteration as a distinct cognitive stage deepening reasoning across iterations without relying on manual supervision. To realize this, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIntelligent Tutoring Systems and Adaptive Learning

MethodsALIGN