SLIM: Let LLM Learn More and Forget Less with Soft LoRA and Identity   Mixture

Jiayi Han; Liang Du; Hongwei Du; Xiangguo Zhou; Yiwen Wu; Weibo Zheng,; Donghong Han

arXiv:2410.07739·cs.LG·January 29, 2025

SLIM: Let LLM Learn More and Forget Less with Soft LoRA and Identity Mixture

Jiayi Han, Liang Du, Hongwei Du, Xiangguo Zhou, Yiwen Wu, Weibo Zheng,, Donghong Han

PDF

Open Access

TL;DR

SLIM introduces a novel mixture of expert framework combining Soft LoRA and Identity Mixture to enable efficient fine-tuning of large language models, reducing forgetting while maintaining high downstream task performance.

Contribution

The paper proposes SLIM, a mixture of experts approach with dynamic routing and fast adapter merging, to improve PEFT by balancing task adaptation and retention of general capabilities.

Findings

01

SLIM achieves comparable performance to state-of-the-art PEFT methods.

02

SLIM significantly mitigates catastrophic forgetting.

03

SLIM maintains model generality while adapting to downstream tasks.

Abstract

Although many efforts have been made, it is still a challenge to balance the training budget, downstream performance, and the general capabilities of the LLMs in many applications. Training the whole model for downstream tasks is expensive, and could easily result in catastrophic forgetting. By introducing parameter-efficient fine-tuning (PEFT), the training cost could be reduced, but it still suffers from forgetting, and limits the learning on the downstream tasks. To efficiently fine-tune the LLMs with less limitation to their downstream performance while mitigating the forgetting of general capabilities, we propose a novel mixture of expert (MoE) framework based on Soft LoRA and Identity Mixture (SLIM), that allows dynamic routing between LoRA adapters and skipping connection, enables the suppression of forgetting. We adopt weight-yielding with sliding clustering for better…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Handwritten Text Recognition Techniques

MethodsBalanced Selection