Prolonged Reasoning Is Not All You Need: Certainty-Based Adaptive Routing for Efficient LLM/MLLM Reasoning

Jinghui Lu; Haiyang Yu; Siliang Xu; Shiwei Ran; Guozhi Tang; Siqi Wang; Bin Shan; Teng Fu; Hao Feng; Jingqun Tang; Han Wang; Can Huang

arXiv:2505.15154·cs.CL·May 22, 2025

Prolonged Reasoning Is Not All You Need: Certainty-Based Adaptive Routing for Efficient LLM/MLLM Reasoning

Jinghui Lu, Haiyang Yu, Siliang Xu, Shiwei Ran, Guozhi Tang, Siqi Wang, Bin Shan, Teng Fu, Hao Feng, Jingqun Tang, Han Wang, Can Huang

PDF

Open Access

TL;DR

This paper introduces Certainty-based Adaptive Reasoning (CAR), a framework that dynamically switches between short and long reasoning in LLMs/MLLMs based on confidence, improving efficiency and accuracy over fixed reasoning strategies.

Contribution

The paper proposes a novel adaptive reasoning framework that uses model perplexity to decide when to perform detailed reasoning, addressing the limitations of prolonged reasoning.

Findings

01

CAR outperforms fixed reasoning strategies on multimodal and text reasoning tasks.

02

Using perplexity to trigger reasoning improves efficiency without sacrificing accuracy.

03

Prolonged reasoning can degrade performance on simpler tasks.

Abstract

Recent advancements in reasoning have significantly enhanced the capabilities of Large Language Models (LLMs) and Multimodal Large Language Models (MLLMs) across diverse tasks. However, excessive reliance on chain-of-thought (CoT) reasoning can impair model performance and brings unnecessarily lengthened outputs, reducing efficiency. Our work reveals that prolonged reasoning does not universally improve accuracy and even degrade performance on simpler tasks. To address this, we propose Certainty-based Adaptive Reasoning (CAR), a novel framework that dynamically switches between short answers and long-form reasoning based on the model perplexity. CAR first generates a short answer and evaluates its perplexity, triggering reasoning only when the model exhibits low confidence (i.e., high perplexity). Experiments across diverse multimodal VQA/KIE benchmarks and text reasoning datasets show…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Semantic Web and Ontologies · Multi-Agent Systems and Negotiation