Routing with Generated Data: Annotation-Free LLM Skill Estimation and Expert Selection
Tianyi Niu, Justin Chih-Yao Chen, Genta Indra Winata, Shi-Xiong Zhang, Supriyo Chakraborty, Sambit Sahu, Yue Zhang, Elias Stengel-Eskin, Mohit Bansal

TL;DR
This paper introduces Routing with Generated Data (RGD), a method for LLM model selection using only generated queries and answers, and proposes CASCAL, a robust query-only router that outperforms existing methods especially with lower-quality generators.
Contribution
The paper presents RGD for training routers without ground-truth data and introduces CASCAL, a new query-only routing approach that is more robust to generator quality variations.
Findings
Query-answer routers degrade faster with lower generator quality.
Effective generators must produce diverse and accurate responses.
CASCAL outperforms existing routers by 4.6% accuracy with weak generator data.
Abstract
Large Language Model (LLM) routers dynamically select optimal models for given inputs. Existing approaches typically assume access to ground-truth labeled data, which is often unavailable in practice, especially when user request distributions are heterogeneous and unknown. We introduce Routing with Generated Data (RGD), a challenging setting in which routers are trained exclusively on generated queries and answers produced from high-level task descriptions by generator LLMs. We evaluate query-answer routers (using both queries and labels) and query-only routers across four diverse benchmarks and 12 models, finding that query-answer routers degrade faster than query-only routers as generator quality decreases. Our analysis reveals two crucial characteristics of effective generators: they must accurately respond to their own questions, and their questions must produce sufficient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Big Data and Digital Economy · Natural Language Processing Techniques
