Cost-minimising strategies for data labelling : optimal stopping and active learning
Christos Dimitrakakis, Christian Savu-Krohn

TL;DR
This paper introduces cost-aware strategies for data labeling in supervised learning, focusing on optimal stopping and active learning to minimize total labeling costs while maintaining model performance.
Contribution
It proposes a cost-based framework for optimal stopping and evaluation in active learning, addressing the trade-off between labeling costs and model accuracy.
Findings
Developed a general optimal stopping strategy based on expected cost.
Provided a framework for empirical evaluation using cost as a performance metric.
Discussed the potential for further research in cost-efficient active learning.
Abstract
Supervised learning deals with the inference of a distribution over an output or label space conditioned on points in an observation space , given a training dataset of pairs in . However, in a lot of applications of interest, acquisition of large amounts of observations is easy, while the process of generating labels is time-consuming or costly. One way to deal with this problem is {\em active} learning, where points to be labelled are selected with the aim of creating a model with better performance than that of an model trained on an equal number of randomly sampled points. In this paper, we instead propose to deal with the labelling cost directly: The learning goal is defined as the minimisation of a cost which is a function of the expected model performance and the total cost of the labels used. This allows the development of general strategies and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Machine Learning and Data Classification · Algorithms and Data Compression
