When Are Experts Misrouted? Counterfactual Routing Analysis in Mixture-of-Experts Language Models
Youngsik Yoon, Siwei Wang, Wei Chen, Jungseul Ok

TL;DR
This paper investigates how routing decisions in Mixture-of-Experts language models can be suboptimal, especially on fragile tokens, and proposes a minimal router update to improve routing utility and task performance.
Contribution
It introduces a counterfactual routing analysis revealing misrouting issues and demonstrates that simple router updates can significantly enhance model performance.
Findings
Standard routing aligns well on confident tokens but poorly on fragile tokens.
Lower-loss alternative routes exist inside the model but are not selected by the current router.
Minimal router-only updates improve task performance on benchmark datasets.
Abstract
Mixture-of-Experts (MoE) language models route each token to a small subset of experts, but whether the routes selected by a trained top- router are good ones is rarely evaluated directly. Holding the model fixed, we compare each standard route against sampled equal-compute alternatives for the same token and score each by the next-token probability it assigns to the realized token in a verified reasoning trajectory. The result is sharply token-conditional: the standard router is well-aligned with route utility on confident tokens but uninformative on the fragile tokens that drive hard reasoning, where lower-loss equal-compute routes consistently exist inside the frozen model but are not selected. The same pattern holds across Qwen3-30B-A3B, GPT-OSS-20B, DeepSeek-V2-Lite, and OLMoE-1B-7B, and follows structurally from how standard top- training evaluates routing decisions: the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
