On the Importance of Effectively Adapting Pretrained Language Models for Active Learning
Katerina Margatina, Lo\"ic Barrault, Nikolaos Aletras

TL;DR
This paper emphasizes the importance of properly adapting pretrained language models for active learning in NLP, proposing a method to improve data efficiency by better fine-tuning strategies.
Contribution
It introduces a novel adaptation approach that involves further training the pretrained model on unlabeled data before active learning, enhancing performance over standard fine-tuning.
Findings
Significant improvements in data efficiency with the proposed adaptation method
Poor training strategies can severely impair active learning performance
The approach is effective across low and high resource scenarios
Abstract
Recent Active Learning (AL) approaches in Natural Language Processing (NLP) proposed using off-the-shelf pretrained language models (LMs). In this paper, we argue that these LMs are not adapted effectively to the downstream task during AL and we explore ways to address this issue. We suggest to first adapt the pretrained LM to the target task by continuing training with all the available unlabeled data and then use it for AL. We also propose a simple yet effective fine-tuning method to ensure that the adapted LM is properly trained in both low and high resource scenarios during AL. Our experiments demonstrate that our approach provides substantial data efficiency improvements compared to the standard fine-tuning approach, suggesting that a poor training strategy can be catastrophic for AL.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Machine Learning and Algorithms · Natural Language Processing Techniques
MethodsMonte Carlo Dropout · Dropout
