Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space

Zhen Zhang; Xuehai He; Weixiang Yan; Ao Shen; Chenyang Zhao; Shuohang Wang; Yelong Shen; Xin Eric Wang

arXiv:2505.15778·cs.CL·May 22, 2025

Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space

Zhen Zhang, Xuehai He, Weixiang Yan, Ao Shen, Chenyang Zhao, Shuohang Wang, Yelong Shen, Xin Eric Wang

PDF

Open Access 1 Repo 1 Video

TL;DR

Soft Thinking introduces a continuous concept space approach for reasoning with large language models, enabling smoother, more expressive reasoning paths that improve accuracy and efficiency over traditional discrete token methods.

Contribution

It proposes a training-free, continuous concept space method that enhances reasoning capabilities of LLMs by generating soft, abstract tokens, surpassing discrete token limitations.

Findings

01

Improves pass@1 accuracy by up to 2.48 points

02

Reduces token usage by up to 22.4%

03

Maintains high interpretability and readability

Abstract

Human cognition typically involves thinking through abstract, fluid concepts rather than strictly using discrete linguistic tokens. Current reasoning models, however, are constrained to reasoning within the boundaries of human language, processing discrete token embeddings that represent fixed points in the semantic space. This discrete constraint restricts the expressive power and upper potential of such reasoning models, often causing incomplete exploration of reasoning paths, as standard Chain-of-Thought (CoT) methods rely on sampling one token per step. In this work, we introduce Soft Thinking, a training-free method that emulates human-like "soft" reasoning by generating soft, abstract concept tokens in a continuous concept space. These concept tokens are created by the probability-weighted mixture of token embeddings, which form the continuous concept space, enabling smooth…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

eric-ai-lab/soft-thinking
pytorchOfficial

Videos

Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space· slideslive

Taxonomy

TopicsCollaboration in agile enterprises · Scheduling and Optimization Algorithms