R^2MoE: Redundancy-Removal Mixture of Experts for Lifelong Concept Learning
Xiaohan Guo, Yusong Cai, Zejia Liu, Zhengning Wang, Lili Pan, Hongliang Li

TL;DR
R^2MoE introduces a parameter-efficient mixture-of-experts framework that effectively mitigates catastrophic forgetting and reduces parameter growth in lifelong visual concept learning, enabling high-fidelity image generation.
Contribution
It presents a novel redundancy-removal mixture of experts framework with routing distillation and hierarchical attention, advancing continual learning with minimal parameter overhead.
Findings
Achieves 87.8% reduction in forgetting rates.
Reduces expert parameters by 63.3%.
Outperforms state-of-the-art in conceptual fidelity.
Abstract
Enabling large-scale generative models to continuously learn new visual concepts is essential for personalizing pre-trained models to meet individual user preferences. Existing approaches for continual visual concept learning are constrained by two fundamental challenges: catastrophic forgetting and parameter expansion. In this paper, we propose Redundancy-Removal Mixture of Experts (R^2MoE), a parameter-efficient framework for lifelong visual concept learning that effectively learns new concepts while incurring minimal parameter overhead. Our framework includes three key innovative contributions: First, we propose a mixture-of-experts framework with a routing distillation mechanism that enables experts to acquire concept-specific knowledge while preserving the gating network's routing capability, thereby effectively mitigating catastrophic forgetting. Second, we propose a strategy for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Retrieval and Classification Techniques · Machine Learning and Data Classification · Text and Document Classification Technologies
