InsCL: A Data-efficient Continual Learning Paradigm for Fine-tuning   Large Language Models with Instructions

Yifan Wang; Yafei Liu; Chufan Shi; Haoling Li; Chen Chen; Haonan Lu,; Yujiu Yang

arXiv:2403.11435·cs.CL·March 19, 2024·1 cites

InsCL: A Data-efficient Continual Learning Paradigm for Fine-tuning Large Language Models with Instructions

Yifan Wang, Yafei Liu, Chufan Shi, Haoling Li, Chen Chen, Haonan Lu,, Yujiu Yang

PDF

Open Access

TL;DR

InsCL introduces a novel instruction-based continual learning paradigm that dynamically replays data based on task similarity and instruction complexity, significantly improving LLM fine-tuning efficiency and performance across multiple tasks.

Contribution

The paper proposes InsCL, a new replay strategy utilizing instruction similarity and complexity metrics to enhance continual learning for large language models.

Findings

01

Achieves 3.0% relative gain over random replay

02

Achieves 27.96% relative gain over no replay

03

Consistent performance improvements across 16 tasks

Abstract

Instruction tuning effectively optimizes Large Language Models (LLMs) for downstream tasks. Due to the changing environment in real-life applications, LLMs necessitate continual task-specific adaptation without catastrophic forgetting. Considering the heavy computational cost, replay-based Continual Learning (CL) methods are the simplest and most widely used for LLMs to address the forgetting issue. However, traditional replay-based methods do not fully utilize instructions to customize the replay strategy. In this work, we propose a novel paradigm called Instruction-based Continual Learning (InsCL). InsCL dynamically replays previous data based on task similarity, calculated by Wasserstein Distance with instructions. Moreover, we further introduce an Instruction Information Metric (InsInfo) to quantify the complexity and diversity of instructions. According to InsInfo, InsCL guides the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech Recognition and Synthesis