Variational Information Bottleneck for Effective Low-Resource   Fine-Tuning

Rabeeh Karimi Mahabadi; Yonatan Belinkov; James Henderson

arXiv:2106.05469·cs.CL·June 11, 2021·39 cites

Variational Information Bottleneck for Effective Low-Resource Fine-Tuning

Rabeeh Karimi Mahabadi, Yonatan Belinkov, James Henderson

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a Variational Information Bottleneck approach to improve low-resource fine-tuning of large language models by reducing overfitting and enhancing out-of-domain generalization.

Contribution

The paper proposes a novel VIB-based method for low-resource fine-tuning that suppresses irrelevant features and improves robustness and transferability.

Findings

01

Significantly improves transfer learning in low-resource settings

02

Enhances out-of-domain generalization on NLI benchmarks

03

Reduces overfitting in low-resource fine-tuning scenarios

Abstract

While large-scale pretrained language models have obtained impressive results when fine-tuned on a wide variety of tasks, they still often suffer from overfitting in low-resource scenarios. Since such models are general-purpose feature extractors, many of these features are inevitably irrelevant for a given target task. We propose to use Variational Information Bottleneck (VIB) to suppress irrelevant features when fine-tuning on low-resource target tasks, and show that our method successfully reduces overfitting. Moreover, we show that our VIB model finds sentence representations that are more robust to biases in natural language inference datasets, and thereby obtains better generalization to out-of-domain datasets. Evaluation on seven low-resource datasets in different tasks shows that our method significantly improves transfer learning in low-resource scenarios, surpassing prior…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rabeehk/vibert
pytorchOfficial

Videos

Variational Information Bottleneck for Effective Low-Resource Fine-Tuning· slideslive

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis