SpiralThinker: Latent Reasoning through an Iterative Process with Text-Latent Interleaving
Shengmin Piao, Sanghyun Park

TL;DR
SpiralThinker introduces a stabilized iterative latent reasoning framework that interleaves latent and textual reasoning, achieving state-of-the-art results across various reasoning tasks.
Contribution
It presents a novel method combining progressive alignment and structured annotations to stabilize and improve latent reasoning dynamics.
Findings
Achieves state-of-the-art performance on reasoning tasks.
Both iteration and alignment are essential for effective reasoning.
Proper alignment and iteration numbers vary by dataset.
Abstract
Recent advances in large reasoning models have been driven by reinforcement learning and test-time scaling, accompanied by growing interest in latent rather than purely textual reasoning. However, existing latent reasoning methods lack mechanisms to ensure stable reasoning dynamics in latent space and a systematic way to interleave implicit and explicit reasoning. We introduce SpiralThinker, a stabilized iterative latent reasoning framework that performs iterative updates over latent representations while interleaving latent and textual reasoning steps. At its core, it combines a progressive alignment objective that explicitly regulates latent representations across iterations with structured annotations for text-latent interleaving, thereby stabilizing latent updates and maintaining coherence with textual reasoning. Across mathematical, logical, and commonsense reasoning tasks,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
