Adaptive-expert-weight-based load balance scheme for dynamic routing of MoE
Jialin Wen, Xiaojun Li, Junping Yao, Xinyan Kong, Peng Cheng

TL;DR
This paper introduces a new routing strategy for MoE models that balances expert loads dynamically, improving efficiency without sacrificing accuracy.
Contribution
A threshold-based dynamic routing algorithm that adjusts expert weights to balance loads without using auxiliary loss functions.
Findings
The proposed method achieves comparable accuracy to top-2 routing while significantly reducing load standard deviation.
Threshold-based dynamic expert activation reduces model parameters and mitigates load imbalance.
Experiments on NLU benchmarks show improved computational efficiency and balanced expert usage.
Abstract
Load imbalance is a major performance bottleneck in training mixture-of-experts (MoE) models, as unbalanced expert loads can lead to routing collapse. Most existing approaches address this issue by introducing auxiliary loss functions to balance the load; however, the hyperparameters within these loss functions often need to be tuned for different tasks. Furthermore, increasing the number of activated experts tends to exacerbate load imbalance, while fixing the activation count can reduce the model’s confidence in handling difficult tasks. To address these challenges, this paper proposes a dynamically balanced routing strategy that employs a threshold-based dynamic routing algorithm. After each routing step, the method adjusts expert weights to influence the load distribution in the subsequent routing. Unlike loss-function-based balancing methods, our approach operates directly at the…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMobile Agent-Based Network Management · Network Traffic and Congestion Control · Software-Defined Networks and 5G
