TL;DR
This scoping review comprehensively analyzes active learning strategies for entity recognition in NLP, highlighting their evaluation environments, datasets, and open research questions to guide future empirical comparisons.
Contribution
It categorizes 106 active learning strategies, assesses their evaluation environments, and identifies gaps in hardware and dataset accessibility for NLP entity recognition.
Findings
All studies used F1-score as an evaluation metric.
Most datasets contained newspaper or biomedical data.
Significant open questions remain in strategy selection and evaluation.
Abstract
We conducted a scoping review for active learning in the domain of natural language processing (NLP), which we summarize in accordance with the PRISMA-ScR guidelines as follows: Objective: Identify active learning strategies that were proposed for entity recognition and their evaluation environments (datasets, metrics, hardware, execution time). Design: We used Scopus and ACM as our search engines. We compared the results with two literature surveys to assess the search quality. We included peer-reviewed English publications introducing or comparing active learning strategies for entity recognition. Results: We analyzed 62 relevant papers and identified 106 active learning strategies. We grouped them into three categories: exploitation-based (60x), exploration-based (14x), and hybrid strategies (32x). We found that all studies used the F1-score as an evaluation metric. Information…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
