A Survey on Cost Types, Interaction Schemes, and Annotator Performance   Models in Selection Algorithms for Active Learning in Classification

Marek Herde; Denis Huseljic; Bernhard Sick; Adrian Calma

arXiv:2109.11301·cs.LG·January 6, 2022

A Survey on Cost Types, Interaction Schemes, and Annotator Performance Models in Selection Algorithms for Active Learning in Classification

Marek Herde, Denis Huseljic, Bernhard Sick, Adrian Calma

PDF

TL;DR

This survey reviews real-world active learning strategies that incorporate human annotator variability, complex interaction schemes, and cost considerations to improve classification performance efficiently.

Contribution

It categorizes around 60 real-world AL strategies based on their handling of annotator performance, interaction types, and cost schemes, highlighting their differences from traditional approaches.

Findings

01

Addresses human annotator variability and errors.

02

Classifies diverse real-world AL strategies.

03

Outlines future research directions in AL.

Abstract

Pool-based active learning (AL) aims to optimize the annotation process (i.e., labeling) as the acquisition of annotations is often time-consuming and therefore expensive. For this purpose, an AL strategy queries annotations intelligently from annotators to train a high-performance classification model at a low annotation cost. Traditional AL strategies operate in an idealized framework. They assume a single, omniscient annotator who never gets tired and charges uniformly regardless of query difficulty. However, in real-world applications, we often face human annotators, e.g., crowd or in-house workers, who make annotation mistakes and can be reluctant to respond if tired or faced with complex queries. Recently, a wide range of novel AL strategies has been proposed to address these issues. They differ in at least one of the following three central aspects from traditional AL: (1) They…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.