Compute-Efficient Active Learning
G\'abor N\'emeth, Tam\'as Matuszka

TL;DR
This paper introduces a simple, method-agnostic framework for active learning that significantly reduces computational costs on large datasets while maintaining or improving model performance.
Contribution
The paper proposes a novel, efficient framework for active learning that minimizes computational resources needed without sacrificing accuracy.
Findings
Reduces computational costs in active learning
Maintains or improves model performance
Applicable to large-scale datasets
Abstract
Active learning, a powerful paradigm in machine learning, aims at reducing labeling costs by selecting the most informative samples from an unlabeled dataset. However, the traditional active learning process often demands extensive computational resources, hindering scalability and efficiency. In this paper, we address this critical issue by presenting a novel method designed to alleviate the computational burden associated with active learning on massive datasets. To achieve this goal, we introduce a simple, yet effective method-agnostic framework that outlines how to strategically choose and annotate data points, optimizing the process for efficiency while maintaining model performance. Through case studies, we demonstrate the effectiveness of our proposed method in reducing computational costs while maintaining or, in some cases, even surpassing baseline model outcomes. Code is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Data Stream Mining Techniques
