$\texttt{MoE-RBench}$: Towards Building Reliable Language Models with   Sparse Mixture-of-Experts

Guanjie Chen; Xinyu Zhao; Tianlong Chen; Yu Cheng

arXiv:2406.11353·cs.LG·June 18, 2024

$\texttt{MoE-RBench}$: Towards Building Reliable Language Models with Sparse Mixture-of-Experts

Guanjie Chen, Xinyu Zhao, Tianlong Chen, Yu Cheng

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces MoE-RBench, a comprehensive framework for assessing the reliability of sparse mixture-of-experts language models across safety, robustness, and adversarial resilience, highlighting how proper training improves their dependability.

Contribution

It provides the first systematic evaluation of MoE models' reliability, comparing them to dense models, and offers insights into training practices for more dependable language models.

Findings

01

MoE models can be more reliable than dense models with proper training.

02

Robustness of MoE is highly sensitive to training settings.

03

Proper hyperparameters and inference techniques improve MoE reliability.

Abstract

Mixture-of-Experts (MoE) has gained increasing popularity as a promising framework for scaling up large language models (LLMs). However, the reliability assessment of MoE lags behind its surging applications. Moreover, when transferred to new domains such as in fine-tuning MoE models sometimes underperform their dense counterparts. Motivated by the research gap and counter-intuitive phenomenon, we propose $MoE-RBench$ , the first comprehensive assessment of SMoE reliability from three aspects: $(i)$ safety and hallucination, $(ii)$ resilience to adversarial attacks, and $(iii)$ out-of-distribution robustness. Extensive models and datasets are tested to compare the MoE to dense networks from these reliability dimensions. Our empirical observations suggest that with appropriate hyperparameters, training recipes, and inference techniques, we can build the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

unites-lab/moe-rbench
pytorchOfficial

Videos

$\texttt{MoE-RBench}$: Towards Building Reliable Language Models with Sparse Mixture-of-Experts· slideslive

Taxonomy

TopicsTopic Modeling · Machine Learning in Healthcare

MethodsMixture of Experts