Decoding Knowledge Attribution in Mixture-of-Experts: A Framework of Basic-Refinement Collaboration and Efficiency Analysis

Junzhuo Li; Bo Wang; Xiuze Zhou; Peijie Jiang; Jia Liu; Xuming Hu

arXiv:2505.24593·cs.CL·June 12, 2025

Decoding Knowledge Attribution in Mixture-of-Experts: A Framework of Basic-Refinement Collaboration and Efficiency Analysis

Junzhuo Li, Bo Wang, Xiuze Zhou, Peijie Jiang, Jia Liu, Xuming Hu

PDF

Open Access 1 Video

TL;DR

This paper introduces a framework for understanding knowledge attribution in sparse Mixture-of-Experts models, revealing how dynamic routing and collaboration improve efficiency, robustness, and task-specific specialization.

Contribution

It proposes a cross-level attribution algorithm and a basic-refinement framework, providing new insights into MoE interpretability, efficiency, and robustness across different architectures and tasks.

Findings

01

MoE models achieve 37% higher per-layer efficiency.

02

Deep MoE architectures mitigate expert failures through shared redundancy.

03

Semantic-driven routing correlates strongly with expert specialization.

Abstract

The interpretability of Mixture-of-Experts (MoE) models, especially those with heterogeneous designs, remains underexplored. Existing attribution methods for dense models fail to capture dynamic routing-expert interactions in sparse MoE architectures. To address this issue, we propose a cross-level attribution algorithm to analyze sparse MoE architectures (Qwen 1.5-MoE, OLMoE, Mixtral-8x7B) against dense models (Qwen 1.5-7B, Llama-7B, Mistral-7B). Results show MoE models achieve 37% higher per-layer efficiency via a "mid-activation, late-amplification" pattern: early layers screen experts, while late layers refine knowledge collaboratively. Ablation studies reveal a "basic-refinement" framework--shared experts handle general tasks (entity recognition), while routed experts specialize in domain-specific processing (geographic attributes). Semantic-driven routing is evidenced by strong…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Decoding Knowledge Attribution in Mixture-of-Experts: A Framework of Basic-Refinement Collaboration and Efficiency Analysis· underline

Taxonomy

TopicsMobile Crowdsensing and Crowdsourcing · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications

MethodsSoftmax · Attention Is All You Need · Mixture of Experts