Revisiting Uncertainty-based Query Strategies for Active Learning with Transformers
Christopher Schr\"oder, Andreas Niekler, Martin Potthast

TL;DR
This paper evaluates uncertainty-based query strategies for active learning with transformers, showing that several outperform the traditional prediction entropy approach across multiple text classification benchmarks.
Contribution
It revisits and demonstrates the effectiveness of uncertainty-based query strategies specifically in the context of transformer fine-tuning, challenging prior assumptions.
Findings
Several uncertainty-based methods outperform prediction entropy.
Transformers can be effectively integrated with active learning.
Performance varies across different text classification tasks.
Abstract
Active learning is the iterative construction of a classification model through targeted labeling, enabling significant labeling cost savings. As most research on active learning has been carried out before transformer-based language models ("transformers") became popular, despite its practical importance, comparably few papers have investigated how transformers can be combined with active learning to date. This can be attributed to the fact that using state-of-the-art query strategies for transformers induces a prohibitive runtime overhead, which effectively nullifies, or even outweighs the desired cost savings. For this reason, we revisit uncertainty-based query strategies, which had been largely outperformed before, but are particularly suited in the context of fine-tuning transformers. In an extensive evaluation, we connect transformers to experiments from previous research,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Topic Modeling · Machine Learning in Materials Science
MethodsLinear Layer · Position-Wise Feed-Forward Layer · Dropout · Dense Connections · Softmax · Multi-Head Attention · Transformer
