On the Importance of Effectively Adapting Pretrained Language Models for   Active Learning

Katerina Margatina; Lo\"ic Barrault; Nikolaos Aletras

arXiv:2104.08320·cs.CL·March 3, 2022·1 cites

On the Importance of Effectively Adapting Pretrained Language Models for Active Learning

Katerina Margatina, Lo\"ic Barrault, Nikolaos Aletras

PDF

Open Access 1 Repo

TL;DR

This paper emphasizes the importance of properly adapting pretrained language models for active learning in NLP, proposing a method to improve data efficiency by better fine-tuning strategies.

Contribution

It introduces a novel adaptation approach that involves further training the pretrained model on unlabeled data before active learning, enhancing performance over standard fine-tuning.

Findings

01

Significant improvements in data efficiency with the proposed adaptation method

02

Poor training strategies can severely impair active learning performance

03

The approach is effective across low and high resource scenarios

Abstract

Recent Active Learning (AL) approaches in Natural Language Processing (NLP) proposed using off-the-shelf pretrained language models (LMs). In this paper, we argue that these LMs are not adapted effectively to the downstream task during AL and we explore ways to address this issue. We suggest to first adapt the pretrained LM to the target task by continuing training with all the available unlabeled data and then use it for AL. We also propose a simple yet effective fine-tuning method to ensure that the adapted LM is properly trained in both low and high resource scenarios during AL. Our experiments demonstrate that our approach provides substantial data efficiency improvements compared to the standard fine-tuning approach, suggesting that a poor training strategy can be catastrophic for AL.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mourga/contrastive-active-learning
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Machine Learning and Algorithms · Natural Language Processing Techniques

MethodsMonte Carlo Dropout · Dropout