DEXA: Supporting Non-Expert Annotators with Dynamic Examples from   Experts

Markus Zlabinger; Marta Sabou; Sebastian Hofst\"atter; Mete Sertkan,; and Allan Hanbury

arXiv:2005.08367·cs.IR·May 19, 2020

DEXA: Supporting Non-Expert Annotators with Dynamic Examples from Experts

Markus Zlabinger, Marta Sabou, Sebastian Hofst\"atter, Mete Sertkan,, and Allan Hanbury

PDF

1 Repo

TL;DR

This paper introduces DEXA, a method that enhances crowdworker training by dynamically providing semantically similar expert examples during annotation tasks, significantly improving accuracy in biomedical sentence annotation.

Contribution

The paper proposes a novel dynamic example retrieval approach (DEXA) that outperforms traditional static instructions in crowd annotation accuracy and agreement with experts.

Findings

01

Higher agreement with experts using DEXA (average Cohen's Kappa 0.68 vs. 0.40)

02

Substantial agreement in aggregated annotations (up to 0.78/0.75/0.69)

03

Majority of workers find dynamic examples useful (72%)

Abstract

The success of crowdsourcing based annotation of text corpora depends on ensuring that crowdworkers are sufficiently well-trained to perform the annotation task accurately. To that end, a frequent approach to train annotators is to provide instructions and a few example cases that demonstrate how the task should be performed (referred to as the CONTROL approach). These globally defined "task-level examples", however, (i) often only cover the common cases that are encountered during an annotation task; and (ii) require effort from crowdworkers during the annotation process to find the most relevant example for the currently annotated sample. To overcome these limitations, we propose to support workers in addition to task-level examples, also with "task-instance level" examples that are semantically similar to the currently annotated data sample (referred to as Dynamic Examples for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Markus-Zlabinger/pico-annotation
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.