Awakening Dormant Experts:Counterfactual Routing to Mitigate MoE Hallucinations

Wentao Hu; Yanbo Zhai; Xiaohui Hu; Mingkuan Zhao; Shanhong yu; Xue Liu; Kaidong Yu; Shuangyong Song; Xuelong Li

arXiv:2604.14246·cs.LG·April 30, 2026

Awakening Dormant Experts:Counterfactual Routing to Mitigate MoE Hallucinations

Wentao Hu, Yanbo Zhai, Xiaohui Hu, Mingkuan Zhao, Shanhong yu, Xue Liu, Kaidong Yu, Shuangyong Song, Xuelong Li

PDF

TL;DR

This paper introduces Counterfactual Routing, a method to activate dormant experts in sparse MoE models, improving factual accuracy without extra inference cost by dynamically leveraging long-tail knowledge.

Contribution

It proposes a training-free inference framework that awakens dormant experts in MoE models using layer-wise perturbation and counterfactual impact analysis.

Findings

01

CoR improves factual accuracy by 3.1% on average.

02

It maintains constant inference budget while enhancing knowledge retrieval.

03

Experiments on multiple datasets validate the effectiveness of CoR.

Abstract

Sparse Mixture-of-Experts (MoE) models have achieved remarkable scalability, yet they remain vulnerable to hallucinations, particularly when processing long-tail knowledge. We identify that this fragility stems from static Top- $k$ routing: routers tend to favor high-frequency patterns over rare factual associations. Consequently, ``specialist experts'' possessing critical long-tail knowledge are often assigned low gating scores and remain ``dormant'' -- under-prioritized for specific tokens despite their proven causal importance on other inputs. To address this, we propose Counterfactual Routing (CoR), a training-free inference framework designed to awaken these dormant experts. CoR integrates layer-wise perturbation analysis with the Counterfactual Expert Impact (CEI) metric to dynamically shift computational resources from syntax-dominant to knowledge-intensive layers while…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.