SEUF: Is Unlearning One Expert Enough for Mixture-of-Experts LLMs?
Haomin Zhuang, Yihua Zhang, Kehan Guo, Jinghan Jia, Gaowen Liu, Sijia Liu, and Xiangliang Zhang

TL;DR
This paper introduces SEUF, a novel framework for effective and controlled unlearning in Mixture-of-Experts LLMs, addressing challenges posed by their dynamic routing nature and improving unlearning quality and utility.
Contribution
The paper proposes SEUF, a new unlearning framework tailored for MoE LLMs that concentrates on active experts and stabilizes routing, enhancing unlearning effectiveness and utility.
Findings
SEUF improves forget quality by up to 5%.
SEUF increases model utility by 35%.
SEUF unlearns only 0.06% of parameters.
Abstract
Recent advancements in LLMs unlearning have shown remarkable success in removing unwanted data-model influences while preserving the model's utility for legitimate knowledge. Despite these strides, sparse Mixture-of-Experts (MoE) LLMs--a key subset of the LLM family--have remained unexplored in the context of unlearning. As MoE LLMs are celebrated for their exceptional performance, we ask:How can unlearning be performed effectively and efficiently on MoE LLMs? Our pilot study shows that the dynamic routing nature of MoE LLMs introduces unique challenges, leading to excessive forgetting, uncontrolled knowledge erasure and substantial utility drops when existing unlearning methods are applied. To address this, we propose a novel Selected-Expert Unlearning Framework (SEUF). Through expert attribution, unlearning is concentrated on the most actively engaged experts for the specified…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsSemantic Web and Ontologies
MethodsSoftmax · Attention Is All You Need · Mixture of Experts
