SoftCoT++: Test-Time Scaling with Soft Chain-of-Thought Reasoning

Yige Xu; Xu Guo; Zhiwei Zeng; Chunyan Miao

arXiv:2505.11484·cs.CL·May 28, 2025

SoftCoT++: Test-Time Scaling with Soft Chain-of-Thought Reasoning

Yige Xu, Xu Guo, Zhiwei Zeng, Chunyan Miao

PDF

Open Access 1 Repo

TL;DR

SoftCoT++ enhances reasoning in large language models by enabling diverse exploration of continuous latent thoughts through perturbation and contrastive learning, significantly improving performance across multiple benchmarks.

Contribution

It introduces SoftCoT++ which extends SoftCoT by enabling diverse reasoning paths via latent thought perturbation and contrastive learning, improving test-time scaling methods.

Findings

01

SoftCoT++ outperforms SoftCoT with self-consistency scaling.

02

It significantly boosts reasoning performance across five benchmarks.

03

It is compatible with conventional scaling techniques.

Abstract

Test-Time Scaling (TTS) refers to approaches that improve reasoning performance by allocating extra computation during inference, without altering the model's parameters. While existing TTS methods operate in a discrete token space by generating more intermediate steps, recent studies in Coconut and SoftCoT have demonstrated that thinking in the continuous latent space can further enhance the reasoning performance. Such latent thoughts encode informative thinking without the information loss associated with autoregressive token generation, sparking increased interest in continuous-space reasoning. Unlike discrete decoding, where repeated sampling enables exploring diverse reasoning paths, latent representations in continuous space are fixed for a given input, which limits diverse exploration, as all decoded paths originate from the same latent thought. To overcome this limitation, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xuyige/softcot
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning

MethodsContrastive Learning