TL;DR
This paper introduces DEXA, a method that enhances crowdworker training by dynamically providing semantically similar expert examples during annotation tasks, significantly improving accuracy in biomedical sentence annotation.
Contribution
The paper proposes a novel dynamic example retrieval approach (DEXA) that outperforms traditional static instructions in crowd annotation accuracy and agreement with experts.
Findings
Higher agreement with experts using DEXA (average Cohen's Kappa 0.68 vs. 0.40)
Substantial agreement in aggregated annotations (up to 0.78/0.75/0.69)
Majority of workers find dynamic examples useful (72%)
Abstract
The success of crowdsourcing based annotation of text corpora depends on ensuring that crowdworkers are sufficiently well-trained to perform the annotation task accurately. To that end, a frequent approach to train annotators is to provide instructions and a few example cases that demonstrate how the task should be performed (referred to as the CONTROL approach). These globally defined "task-level examples", however, (i) often only cover the common cases that are encountered during an annotation task; and (ii) require effort from crowdworkers during the annotation process to find the most relevant example for the currently annotated sample. To overcome these limitations, we propose to support workers in addition to task-level examples, also with "task-instance level" examples that are semantically similar to the currently annotated data sample (referred to as Dynamic Examples for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
