Investigating label suggestions for opinion mining in German Covid-19   social media

Tilman Beck; Ji-Ung Lee; Christina Viehmann; Marcus Maurer; Oliver; Quiring; Iryna Gurevych

arXiv:2105.12980·cs.CL·June 9, 2021

Investigating label suggestions for opinion mining in German Covid-19 social media

Tilman Beck, Ji-Ung Lee, Christina Viehmann, Marcus Maurer, Oliver, Quiring, Iryna Gurevych

PDF

1 Repo

TL;DR

This study explores how interactively updated label suggestions can enhance opinion mining annotation efficiency in German Covid-19 social media data, showing that small expert-trained models significantly improve annotation consistency and quality.

Contribution

It demonstrates that small, expert-annotated models provide effective label suggestions that improve annotation agreement and quality in opinion mining tasks.

Findings

01

Expert-trained models improve annotation agreement (+.14 Fleiss' κ)

02

Static model suggestions are as effective as interactively trained models

03

Annotated data is suitable for transfer learning experiments

Abstract

This work investigates the use of interactively updated label suggestions to improve upon the efficiency of gathering annotations on the task of opinion mining in German Covid-19 social media data. We develop guidelines to conduct a controlled annotation study with social science students and find that suggestions from a model trained on a small, expert-annotated dataset already lead to a substantial improvement - in terms of inter-annotator agreement(+.14 Fleiss' $κ$ ) and annotation quality - compared to students that do not receive any label suggestions. We further find that label suggestions from interactively trained models do not lead to an improvement over suggestions from a static model. Nonetheless, our analysis of suggestion bias shows that annotators remain capable of reflecting upon the suggested label in general. Finally, we confirm the quality of the annotated data in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

UKPLab/acl2021-label-suggestions-german-covid19
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.