Efficient Reasoning via Thought-Training and Thought-Free Inference
Canhui Wu, Qiong Cao, Chao Xue, Wei Xi, Xiaodong He

TL;DR
This paper introduces 3TF, a framework that trains models to perform complex reasoning internally while producing concise, thought-free outputs externally, improving efficiency without sacrificing reasoning quality.
Contribution
The paper proposes a hybrid training approach enabling models to internalize reasoning processes and generate short, thought-free outputs at inference, distinct from compression-based methods.
Findings
3TF-trained models outperform baselines on reasoning benchmarks.
Models can perform implicit reasoning without explicit step-by-step outputs.
Thought-free inference maintains high reasoning accuracy.
Abstract
Recent advances in large language models (LLMs) have leveraged explicit Chain-of-Thought (CoT) prompting to improve reasoning accuracy. However, most existing methods primarily focus on compressing verbose reasoning outputs. These Long-to-Short transformations aim to improve efficiency, but require a large amount of short CoT data. In this work, we introduce \textbf{3TF} (\textbf{T}hought-\textbf{T}raining and \textbf{T}hought-\textbf{F}ree inference), a framework for efficient reasoning that takes a Short-to-Long perspective. We first train a hybrid model that can operate in both reasoning and non-reasoning modes, and then further train it on CoT-annotated data to internalize structured reasoning, while enforcing concise, thought-free outputs at inference time using the no-reasoning mode. Unlike compression-based approaches, 3TF improves the reasoning quality of non-reasoning outputs,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Explainable Artificial Intelligence (XAI)
