The Illusion of Specialization: Unveiling the Domain-Invariant "Standing Committee" in Mixture-of-Experts Models
Yan Wang, Yitao Xu, Nanhan Shen, Jinyan Su, Jimin Huang, Zining Zhu

TL;DR
This paper reveals that Mixture-of-Experts models rely on a domain-invariant 'Standing Committee' of experts, challenging the assumption of domain-specific specialization and highlighting a structural bias toward centralized computation.
Contribution
Introduces COMMITTEEAUDIT, a framework for analyzing expert routing, and uncovers a universal, domain-invariant 'Standing Committee' that dominates routing across models and domains.
Findings
Standing Committees consistently capture most routing mass across domains
Qualitative analysis shows Committees anchor reasoning and syntax
Peripheral experts handle domain-specific knowledge
Abstract
Mixture of Experts models are widely assumed to achieve domain specialization through sparse routing. In this work, we question this assumption by introducing COMMITTEEAUDIT, a post hoc framework that analyzes routing behavior at the level of expert groups rather than individual experts. Across three representative models and the MMLU benchmark, we uncover a domain-invariant Standing Committee. This is a compact coalition of routed experts that consistently captures the majority of routing mass across domains, layers, and routing budgets, even when architectures already include shared experts. Qualitative analysis further shows that Standing Committees anchor reasoning structure and syntax, while peripheral experts handle domain-specific knowledge. These findings reveal a strong structural bias toward centralized computation, suggesting that specialization in Mixture of Experts models…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMobile Crowdsensing and Crowdsourcing · Advanced Graph Neural Networks · Complex Network Analysis Techniques
