$\texttt{MoE-RBench}$: Towards Building Reliable Language Models with Sparse Mixture-of-Experts
Guanjie Chen, Xinyu Zhao, Tianlong Chen, Yu Cheng

TL;DR
This paper introduces MoE-RBench, a comprehensive framework for assessing the reliability of sparse mixture-of-experts language models across safety, robustness, and adversarial resilience, highlighting how proper training improves their dependability.
Contribution
It provides the first systematic evaluation of MoE models' reliability, comparing them to dense models, and offers insights into training practices for more dependable language models.
Findings
MoE models can be more reliable than dense models with proper training.
Robustness of MoE is highly sensitive to training settings.
Proper hyperparameters and inference techniques improve MoE reliability.
Abstract
Mixture-of-Experts (MoE) has gained increasing popularity as a promising framework for scaling up large language models (LLMs). However, the reliability assessment of MoE lags behind its surging applications. Moreover, when transferred to new domains such as in fine-tuning MoE models sometimes underperform their dense counterparts. Motivated by the research gap and counter-intuitive phenomenon, we propose , the first comprehensive assessment of SMoE reliability from three aspects: safety and hallucination, resilience to adversarial attacks, and out-of-distribution robustness. Extensive models and datasets are tested to compare the MoE to dense networks from these reliability dimensions. Our empirical observations suggest that with appropriate hyperparameters, training recipes, and inference techniques, we can build the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Machine Learning in Healthcare
MethodsMixture of Experts
