Multi-task Active Learning for Pre-trained Transformer-based Models
Guy Rotman, Roi Reichart

TL;DR
This paper investigates multi-task active learning for pre-trained Transformer models, demonstrating that it can reduce annotation efforts while maintaining performance across multiple NLP tasks.
Contribution
It introduces multi-task active learning strategies for Transformer models and evaluates their effectiveness across different task relations.
Findings
Multi-task active learning outperforms single-task methods in annotation efficiency.
Effectiveness varies depending on task relatedness.
Proposed criteria improve selection of valuable unlabeled data.
Abstract
Multi-task learning, in which several tasks are jointly learned by a single model, allows NLP models to share information from multiple annotations and may facilitate better predictions when the tasks are inter-related. This technique, however, requires annotating the same text with multiple annotation schemes which may be costly and laborious. Active learning (AL) has been demonstrated to optimize annotation processes by iteratively selecting unlabeled examples whose annotation is most valuable for the NLP model. Yet, multi-task active learning (MT-AL) has not been applied to state-of-the-art pre-trained Transformer-based NLP models. This paper aims to close this gap. We explore various multi-task selection criteria in three realistic multi-task scenarios, reflecting different relations between the participating tasks, and demonstrate the effectiveness of multi-task compared to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Topic Modeling · Machine Learning and Data Classification
