Think When Needed: Model-Aware Reasoning Routing for LLM-based Ranking
Huizhong Guo, Tianjun Wei, Dongxia Wang, Yingpeng Du, Ziyan Wang, Jie Zhang, Zhu Sun

TL;DR
This paper introduces a reasoning routing framework for LLM-based ranking that dynamically decides when to apply reasoning, improving ranking performance while reducing computational costs by using pre-generation signals and adaptive policy selection.
Contribution
It proposes a lightweight, plug-and-play router head that determines whether to reason or not before generation, optimizing the accuracy-efficiency trade-off in LLM ranking tasks.
Findings
Achieves +6.3% NDCG@10 on MovieLens with fewer tokens
Reduces token consumption by 49.5% while maintaining performance
Demonstrates effectiveness across multiple datasets and models
Abstract
Large language models (LLMs) are increasingly applied to ranking tasks in retrieval and recommendation. Although reasoning prompting can enhance ranking utility, our preliminary exploration reveals that its benefits are inconsistent and come at a substantial computational cost, suggesting that when to reason is as crucial as how to reason. To address this issue, we propose a reasoning routing framework that employs a lightweight, plug-and-play router head to decide whether to use direct inference (Non-Think) or reasoning (Think) for each instance before generation. The router head relies solely on pre-generation signals: i) compact ranking-aware features (e.g., candidate dispersion) and ii) model-aware difficulty signals derived from a diagnostic checklist reflecting the model's estimated need for reasoning. By leveraging these features before generation, the router outputs a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInformation Retrieval and Search Behavior · Topic Modeling · Expert finding and Q&A systems
