CP-Router: An Uncertainty-Aware Router Between LLM and LRM
Jiayuan Su, Fulin Lin, Zhaopeng Feng, Han Zheng, Teng Wang, Zhenyu Xiao, Xinlong Zhao, Zuozhu Liu, Lu Cheng, Hongwei Wang

TL;DR
CP-Router is a model-agnostic, uncertainty-based routing framework that dynamically chooses between LLMs and LRMs for efficient and accurate question answering, reducing token usage while maintaining or improving performance.
Contribution
It introduces a training-free, uncertainty-guided routing method using conformal prediction and entropy-based criteria, enhancing LLM and LRM collaboration for diverse QA tasks.
Findings
Reduces token usage while maintaining accuracy in MCQA tasks.
Demonstrates robustness across diverse model pairings and open-ended QA.
Achieves improved efficiency and performance in multiple benchmarks.
Abstract
Recent advances in Large Reasoning Models (LRMs) have significantly improved long-chain reasoning capabilities over Large Language Models (LLMs). However, LRMs often produce unnecessarily lengthy outputs even for simple queries, leading to inefficiencies or even accuracy degradation compared to LLMs. To overcome this, we propose CP-Router, a training-free and model-agnostic routing framework that dynamically selects between an LLM and an LRM, demonstrated with multiple-choice question answering (MCQA) prompts. The routing decision is guided by the prediction uncertainty estimates derived via Conformal Prediction (CP), which provides rigorous coverage guarantees. To further refine the uncertainty differentiation across inputs, we introduce Full and Binary Entropy (FBE), a novel entropy-based criterion that adaptively selects the appropriate CP threshold. Experiments across diverse MCQA…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Data Storage Technologies · Network Packet Processing and Optimization
