COPAL: Continual Pruning in Large Language Generative Models

Srikanth Malla; Joon Hee Choi; Chiho Choi

arXiv:2405.02347·cs.LG·June 18, 2024

COPAL: Continual Pruning in Large Language Generative Models

Srikanth Malla, Joon Hee Choi, Chiho Choi

PDF

Open Access

TL;DR

COPAL is a novel pruning algorithm that enables efficient continual adaptation of large language models to new domains without retraining, using sensitivity analysis to identify relevant weights.

Contribution

COPAL introduces a sensitivity-guided pruning method for continual domain adaptation in large language models, avoiding costly retraining.

Findings

01

Outperforms baseline models in efficiency and adaptability

02

Enables seamless domain adaptation without retraining

03

Effective across various model sizes

Abstract

Adapting pre-trained large language models to different domains in natural language processing requires two key considerations: high computational demands and model's inability to continual adaptation. To simultaneously address both issues, this paper presents COPAL (COntinual Pruning in Adaptive Language settings), an algorithm developed for pruning large language generative models under a continual model adaptation setting. While avoiding resource-heavy finetuning or retraining, our pruning process is guided by the proposed sensitivity analysis. The sensitivity effectively measures model's ability to withstand perturbations introduced by the new dataset and finds model's weights that are relevant for all encountered datasets. As a result, COPAL allows seamless model adaptation to new domains while enhancing the resource efficiency. Our empirical evaluation on a various size of LLMs…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Speech and dialogue systems

MethodsPruning