When Are Experts Misrouted? Counterfactual Routing Analysis in Mixture-of-Experts Language Models

Youngsik Yoon; Siwei Wang; Wei Chen; Jungseul Ok

arXiv:2605.07260·cs.LG·May 11, 2026

When Are Experts Misrouted? Counterfactual Routing Analysis in Mixture-of-Experts Language Models

Youngsik Yoon, Siwei Wang, Wei Chen, Jungseul Ok

PDF

TL;DR

This paper investigates how routing decisions in Mixture-of-Experts language models can be suboptimal, especially on fragile tokens, and proposes a minimal router update to improve routing utility and task performance.

Contribution

It introduces a counterfactual routing analysis revealing misrouting issues and demonstrates that simple router updates can significantly enhance model performance.

Findings

01

Standard routing aligns well on confident tokens but poorly on fragile tokens.

02

Lower-loss alternative routes exist inside the model but are not selected by the current router.

03

Minimal router-only updates improve task performance on benchmark datasets.

Abstract

Mixture-of-Experts (MoE) language models route each token to a small subset of experts, but whether the routes selected by a trained top- $k$ router are good ones is rarely evaluated directly. Holding the model fixed, we compare each standard route against sampled equal-compute alternatives for the same token and score each by the next-token probability it assigns to the realized token in a verified reasoning trajectory. The result is sharply token-conditional: the standard router is well-aligned with route utility on confident tokens but uninformative on the fragile tokens that drive hard reasoning, where lower-loss equal-compute routes consistently exist inside the frozen model but are not selected. The same pattern holds across Qwen3-30B-A3B, GPT-OSS-20B, DeepSeek-V2-Lite, and OLMoE-1B-7B, and follows structurally from how standard top- $k$ training evaluates routing decisions: the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.