A Survey of Active Learning for Text Classification using Deep Neural Networks
Christopher Schr\"oder, Andreas Niekler

TL;DR
This survey reviews the use of deep neural networks in active learning for text classification, discussing challenges, recent advances, and future research directions to improve model performance and reduce annotation efforts.
Contribution
It provides a comprehensive taxonomy of query strategies, analyzes recent NLP advances in the context of active learning, and identifies gaps and open questions in current research.
Findings
Deep neural networks can enhance active learning for text classification.
Current challenges include unreliable uncertainty estimates and small data training issues.
The survey highlights research gaps and proposes future directions.
Abstract
Natural language processing (NLP) and neural networks (NNs) have both undergone significant changes in recent years. For active learning (AL) purposes, NNs are, however, less commonly used -- despite their current popularity. By using the superior text classification performance of NNs for AL, we can either increase a model's performance using the same amount of data or reduce the data and therefore the required annotation efforts while keeping the same performance. We review AL for text classification using deep neural networks (DNNs) and elaborate on two main causes which used to hinder the adoption: (a) the inability of NNs to provide reliable uncertainty estimates, on which the most commonly used query strategies rely, and (b) the challenge of training DNNs on small data. To investigate the former, we construct a taxonomy of query strategies, which distinguishes between data-based,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Topic Modeling · Natural Language Processing Techniques
