One Network, Many Masks: Towards More Parameter-Efficient Transfer   Learning

Guangtao Zeng; Peiyuan Zhang; Wei Lu

arXiv:2305.17682·cs.CL·June 13, 2023·1 cites

One Network, Many Masks: Towards More Parameter-Efficient Transfer Learning

Guangtao Zeng, Peiyuan Zhang, Wei Lu

PDF

Open Access 1 Repo

TL;DR

This paper introduces PROPETL, a parameter-efficient transfer learning method that shares a single prototype network across tasks and layers, using binary masks to select sub-networks, significantly reducing storage while maintaining performance.

Contribution

Proposes PROPETL, a novel PETL approach that shares a prototype network across tasks and layers with binary masks for sub-network selection, enhancing efficiency.

Findings

01

Outperforms existing PETL methods in various tasks.

02

Uses approximately 10% of the parameter storage of previous methods.

03

Demonstrates the effectiveness of binary masks in identifying crucial network information.

Abstract

Fine-tuning pre-trained language models for multiple tasks tends to be expensive in terms of storage. To mitigate this, parameter-efficient transfer learning (PETL) methods have been proposed to address this issue, but they still require a significant number of parameters and storage when being applied to broader ranges of tasks. To achieve even greater storage reduction, we propose PROPETL, a novel method that enables efficient sharing of a single PETL module which we call prototype network (e.g., adapter, LoRA, and prefix-tuning) across layers and tasks. We then learn binary masks to select different sub-networks from the shared prototype network and apply them as PETL modules into different layers. We find that the binary masks can determine crucial information from the network, which is often ignored in previous studies. Our work can also be seen as a type of pruning method, where…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chaoscodes/propetl
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis

MethodsPruning