Soft Thinking: Unlocking the Reasoning Potential of LLMs in Continuous Concept Space
Zhen Zhang, Xuehai He, Weixiang Yan, Ao Shen, Chenyang Zhao, Shuohang Wang, Yelong Shen, Xin Eric Wang

TL;DR
Soft Thinking introduces a continuous concept space approach for reasoning with large language models, enabling smoother, more expressive reasoning paths that improve accuracy and efficiency over traditional discrete token methods.
Contribution
It proposes a training-free, continuous concept space method that enhances reasoning capabilities of LLMs by generating soft, abstract tokens, surpassing discrete token limitations.
Findings
Improves pass@1 accuracy by up to 2.48 points
Reduces token usage by up to 22.4%
Maintains high interpretability and readability
Abstract
Human cognition typically involves thinking through abstract, fluid concepts rather than strictly using discrete linguistic tokens. Current reasoning models, however, are constrained to reasoning within the boundaries of human language, processing discrete token embeddings that represent fixed points in the semantic space. This discrete constraint restricts the expressive power and upper potential of such reasoning models, often causing incomplete exploration of reasoning paths, as standard Chain-of-Thought (CoT) methods rely on sampling one token per step. In this work, we introduce Soft Thinking, a training-free method that emulates human-like "soft" reasoning by generating soft, abstract concept tokens in a continuous concept space. These concept tokens are created by the probability-weighted mixture of token embeddings, which form the continuous concept space, enabling smooth…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsCollaboration in agile enterprises · Scheduling and Optimization Algorithms
