System-1.5 Reasoning: Traversal in Language and Latent Spaces with Dynamic Shortcuts
Xiaoqiang Wang, Suyuchen Wang, Yun Zhu, Bang Liu

TL;DR
System-1.5 Reasoning introduces an adaptive latent-space reasoning framework with dynamic shortcuts, significantly improving efficiency and maintaining high reasoning performance in large language models.
Contribution
It proposes a novel adaptive reasoning method with dynamic shortcuts in latent space, combining early exiting and hidden state reuse to enhance efficiency and reasoning quality.
Findings
Achieves comparable accuracy to traditional CoT methods on GSM8K.
Speeds up inference by over 20 times.
Reduces token generation by 92.31% on average.
Abstract
Chain-of-thought (CoT) reasoning enables large language models (LLMs) to move beyond fast System-1 responses and engage in deliberative System-2 reasoning. However, this comes at the cost of significant inefficiency due to verbose intermediate output. Recent latent-space reasoning methods improve efficiency by operating on hidden states without decoding into language, yet they treat all steps uniformly, failing to distinguish critical deductions from auxiliary steps and resulting in suboptimal use of computational resources. In this paper, we propose System-1.5 Reasoning, an adaptive reasoning framework that dynamically allocates computation across reasoning steps through shortcut paths in latent space. Specifically, System-1.5 Reasoning introduces two types of dynamic shortcuts. The model depth shortcut (DS) adaptively reasons along the vertical depth by early exiting non-critical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Natural Language Processing Techniques
MethodsAttention Is All You Need · Linear Layer · Layer Normalization · Byte Pair Encoding · Residual Connection · Dense Connections · Adapter · Softmax · Early exiting using confidence measures · Position-Wise Feed-Forward Layer
