Decoding Knowledge Attribution in Mixture-of-Experts: A Framework of Basic-Refinement Collaboration and Efficiency Analysis
Junzhuo Li, Bo Wang, Xiuze Zhou, Peijie Jiang, Jia Liu, Xuming Hu

TL;DR
This paper introduces a framework for understanding knowledge attribution in sparse Mixture-of-Experts models, revealing how dynamic routing and collaboration improve efficiency, robustness, and task-specific specialization.
Contribution
It proposes a cross-level attribution algorithm and a basic-refinement framework, providing new insights into MoE interpretability, efficiency, and robustness across different architectures and tasks.
Findings
MoE models achieve 37% higher per-layer efficiency.
Deep MoE architectures mitigate expert failures through shared redundancy.
Semantic-driven routing correlates strongly with expert specialization.
Abstract
The interpretability of Mixture-of-Experts (MoE) models, especially those with heterogeneous designs, remains underexplored. Existing attribution methods for dense models fail to capture dynamic routing-expert interactions in sparse MoE architectures. To address this issue, we propose a cross-level attribution algorithm to analyze sparse MoE architectures (Qwen 1.5-MoE, OLMoE, Mixtral-8x7B) against dense models (Qwen 1.5-7B, Llama-7B, Mistral-7B). Results show MoE models achieve 37% higher per-layer efficiency via a "mid-activation, late-amplification" pattern: early layers screen experts, while late layers refine knowledge collaboratively. Ablation studies reveal a "basic-refinement" framework--shared experts handle general tasks (entity recognition), while routed experts specialize in domain-specific processing (geographic attributes). Semantic-driven routing is evidenced by strong…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMobile Crowdsensing and Crowdsourcing · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
MethodsSoftmax · Attention Is All You Need · Mixture of Experts
