Annotator-Centric Active Learning for Subjective NLP Tasks

Michiel van der Meer; Neele Falk; Pradeep K. Murukannaiah; Enrico; Liscio

arXiv:2404.15720·cs.CL·October 24, 2024·1 cites

Annotator-Centric Active Learning for Subjective NLP Tasks

Michiel van der Meer, Neele Falk, Pradeep K. Murukannaiah, Enrico, Liscio

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper proposes Annotator-Centric Active Learning (ACAL), a method that improves data efficiency and captures diverse human judgments in subjective NLP tasks by strategically selecting annotators.

Contribution

It introduces a novel annotator selection strategy within active learning tailored for subjective NLP tasks, emphasizing diversity and annotator-centric evaluation.

Findings

01

ACAL enhances data efficiency in subjective NLP tasks.

02

ACAL performs well in annotator-centric evaluation metrics.

03

Success depends on having a large, diverse pool of annotators.

Abstract

Active Learning (AL) addresses the high costs of collecting human annotations by strategically annotating the most informative samples. However, for subjective NLP tasks, incorporating a wide range of perspectives in the annotation process is crucial to capture the variability in human judgments. We introduce Annotator-Centric Active Learning (ACAL), which incorporates an annotator selection strategy following data sampling. Our objective is two-fold: 1) to efficiently approximate the full diversity of human judgments, and 2) to assess model performance using annotator-centric metrics, which value minority and majority perspectives equally. We experiment with multiple annotator selection strategies across seven subjective NLP tasks, employing both traditional and novel, human-centered evaluation metrics. Our findings indicate that ACAL improves data efficiency and excels in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

m0re4u/acal-subjective
pytorchOfficial

Videos

Annotator-Centric Active Learning for Subjective NLP Tasks· underline

Taxonomy

TopicsMachine Learning and Algorithms · Topic Modeling · Natural Language Processing Techniques