Self-Route: Automatic Mode Switching via Capability Estimation for Efficient Reasoning

Yang He; Xiao Ding; Bibo Cai; Yufei Zhang; Kai Xiong; Zhouhao Sun; Bing Qin; Ting Liu

arXiv:2505.20664·cs.CL·May 28, 2025

Self-Route: Automatic Mode Switching via Capability Estimation for Efficient Reasoning

Yang He, Xiao Ding, Bibo Cai, Yufei Zhang, Kai Xiong, Zhouhao Sun, Bing Qin, Ting Liu

PDF

Open Access

TL;DR

Self-Route is a dynamic reasoning framework that automatically switches between modes in large language models to improve efficiency, reducing token usage by 30-55% without sacrificing accuracy.

Contribution

It introduces a capability estimation-based router and a new dataset for training, enabling real-time mode switching in reasoning models.

Findings

01

Reduces token consumption by up to 55%.

02

Maintains comparable accuracy to reasoning models.

03

Effective across various model sizes and paradigms.

Abstract

While reasoning-augmented large language models (RLLMs) significantly enhance complex task performance through extended reasoning chains, they inevitably introduce substantial unnecessary token consumption, particularly for simpler problems where Short Chain-of-Thought (Short CoT) suffices. This overthinking phenomenon leads to inefficient resource usage without proportional accuracy gains. To address this issue, we propose Self-Route, a dynamic reasoning framework that automatically selects between general and reasoning modes based on model capability estimation. Our approach introduces a lightweight pre-inference stage to extract capability-aware embeddings from hidden layer representations, enabling real-time evaluation of the model's ability to solve problems. We further construct Gradient-10K, a model difficulty estimation-based dataset with dense complexity sampling, to train the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFuzzy Logic and Control Systems