DiffCoT: Diffusion-styled Chain-of-Thought Reasoning in LLMs

Shidong Cao; Hongzhan Lin; Yuxuan Gu; Ziyang Luo; Jing Ma

arXiv:2601.03559·cs.CL·April 21, 2026

DiffCoT: Diffusion-styled Chain-of-Thought Reasoning in LLMs

Shidong Cao, Hongzhan Lin, Yuxuan Gu, Ziyang Luo, Jing Ma

PDF

1 Repo

TL;DR

DiffCoT introduces a diffusion-inspired iterative framework for chain-of-thought reasoning in large language models, enhancing robustness and error correction in multi-step problem solving.

Contribution

It reformulates CoT reasoning as a denoising process with a causal diffusion schedule, enabling correction of intermediate errors and improved reasoning performance.

Findings

01

DiffCoT outperforms existing CoT methods on multiple benchmarks.

02

It improves robustness and error correction in multi-step reasoning.

03

Experiments show consistent gains across diverse models.

Abstract

Chain-of-Thought (CoT) reasoning improves multi-step mathematical problem solving in large language models but remains vulnerable to exposure bias and error accumulation, as early mistakes propagate irreversibly through autoregressive decoding. In this work, we propose DiffCoT, a diffusion-styled CoT framework that reformulates CoT reasoning as an iterative denoising process. DiffCoT integrates diffusion principles at the reasoning-step level via a sliding-window mechanism, enabling unified generation and retrospective correction of intermediate steps while preserving token-level autoregression. To maintain causal consistency, we further introduce a causal diffusion noise schedule that respects the temporal structure of reasoning chains. Extensive experiments on three multi-step CoT reasoning benchmarks across diverse model backbones demonstrate that DiffCoT consistently outperforms…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

caoshidong66/DiffCoT
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.