Task-specific Compression for Multi-task Language Models using   Attribution-based Pruning

Nakyeong Yang; Yunah Jang; Hwanhee Lee; Seohyeong Jung; Kyomin Jung

arXiv:2205.04157·cs.CL·February 14, 2023·1 cites

Task-specific Compression for Multi-task Language Models using Attribution-based Pruning

Nakyeong Yang, Yunah Jang, Hwanhee Lee, Seohyeong Jung, Kyomin Jung

PDF

Open Access

TL;DR

This paper introduces a training-free, attribution-based pruning method to compress multi-task language models by removing unnecessary neurons for specific tasks, preserving performance and reducing resource usage.

Contribution

It presents a novel, training-free pruning approach that selectively removes neurons based on attribution scores for task-specific compression of multi-task models.

Findings

01

Outperforms baseline pruning methods on six datasets.

02

Preserves performance in unseen domain settings.

03

Effective in low-resource and unsupervised scenarios.

Abstract

Multi-task language models show outstanding performance for various natural language understanding tasks with only a single model. However, these language models utilize an unnecessarily large number of model parameters, even when used only for a specific task. This paper proposes a novel training-free compression method for multi-task language models using a pruning method. Specifically, we use an attribution method to determine which neurons are essential for performing a specific task. We task-specifically prune unimportant neurons and leave only task-specific parameters. Furthermore, we extend our method to be applicable in low-resource and unsupervised settings. Since our compression method is training-free, it uses few computing resources and does not destroy the pre-trained knowledge of language models. Experimental results on the six widely-used datasets show that our proposed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsPruning