TL;DR
AdapterHub is a framework that simplifies the sharing, integration, and application of adapter layers in pre-trained transformer models, enabling efficient multi-task and multilingual NLP adaptations without full model fine-tuning.
Contribution
It introduces a comprehensive framework built on HuggingFace Transformers for dynamic adapter integration, facilitating easy sharing and task-specific adaptation of large pre-trained models.
Findings
Enables quick and seamless adaptation of models across tasks and languages.
Supports sharing and integrating adapters with minimal code changes.
Facilitates scalable, low-resource NLP applications.
Abstract
The current modus operandi in NLP involves downloading and fine-tuning pre-trained models consisting of millions or billions of parameters. Storing and sharing such large trained models is expensive, slow, and time-consuming, which impedes progress towards more general and versatile NLP methods that learn from and for many tasks. Adapters -- small learnt bottleneck layers inserted within each layer of a pre-trained model -- ameliorate this issue by avoiding full fine-tuning of the entire model. However, sharing and integrating adapter layers is not straightforward. We propose AdapterHub, a framework that allows dynamic "stitching-in" of pre-trained adapters for different tasks and languages. The framework, built on top of the popular HuggingFace Transformers library, enables extremely easy and quick adaptations of state-of-the-art pre-trained models (e.g., BERT, RoBERTa, XLM-R) across…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗AdapterHub/bert-base-uncased_sts_qqp_houlsbymodel
- 🤗AdapterHub/bert-base-uncased_nli_multinli_pfeiffermodel· 6 dl6 dl
- 🤗AdapterHub/bert-base-uncased_nli_qnli_pfeiffermodel
- 🤗AdapterHub/bert-base-uncased_sentiment_sst-2_pfeiffermodel· 15 dl15 dl
- 🤗AdapterHub/bert-base-uncased_nli_rte_houlsbymodel· 2 dl2 dl
- 🤗AdapterHub/bert-base-uncased_sentiment_sst-2_houlsbymodel· 1 dl1 dl
- 🤗AdapterHub/bert-base-uncased_nli_rte_pfeiffermodel· 5 dl5 dl
- 🤗AdapterHub/bert-base-uncased_sts_qqp_pfeiffermodel· 6 dl6 dl
- 🤗AdapterHub/bert-base-uncased_nli_qnli_houlsbymodel· 1 dl1 dl
- 🤗AdapterHub/bert-base-uncased_sts_mrpc_houlsbymodel· 10 dl10 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLinear Layer · Attention Dropout · Adam · Dense Connections · Linear Warmup With Linear Decay · Residual Connection · Dropout · Refunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Layer Normalization
