CURLoRA: Stable LLM Continual Fine-Tuning and Catastrophic Forgetting   Mitigation

Muhammad Fawi

arXiv:2408.14572·cs.LG·August 28, 2024

CURLoRA: Stable LLM Continual Fine-Tuning and Catastrophic Forgetting Mitigation

Muhammad Fawi

PDF

1 Repo

TL;DR

CURLoRA is a novel fine-tuning method for large language models that effectively mitigates catastrophic forgetting and reduces trainable parameters by leveraging a modified CUR matrix decomposition with implicit regularization.

Contribution

It introduces a unique CUR decomposition-based approach with inverted probability selection and zero-initialized U matrix, enhancing continual learning stability and parameter efficiency.

Findings

01

Outperforms standard LoRA in catastrophic forgetting mitigation

02

Maintains model stability and performance across tasks

03

Reduces the number of trainable parameters significantly

Abstract

This paper introduces CURLoRA, a novel approach to fine-tuning large language models (LLMs) that leverages CUR matrix decomposition in the context of Low-Rank Adaptation (LoRA). Our method addresses two critical challenges in LLM fine-tuning: mitigating catastrophic forgetting during continual learning and reducing the number of trainable parameters. We propose a unique modification to the CUR decomposition process, utilizing inverted probabilities for column and row selection which acts as an implicit regularization, and initializing the $U$ matrix as a zero matrix, and only fine-tuning it. We demonstrate through experiments on multiple datasets that CURLoRA outperforms standard LoRA in mitigating catastrophic forgetting. It maintains model stability and performance across tasks while significantly reducing the number of trainable parameters. Our results show that CURLoRA achieves very…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mnoorfawi/curlora
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsBalanced Selection