Measuring Annotator Agreement Generally across Complex Structured,   Multi-object, and Free-text Annotation Tasks

Alexander Braylan; Omar Alonso; Matthew Lease

arXiv:2212.09503·cs.CL·December 20, 2022

Measuring Annotator Agreement Generally across Complex Structured, Multi-object, and Free-text Annotation Tasks

Alexander Braylan, Omar Alonso, Matthew Lease

PDF

1 Repo

TL;DR

This paper evaluates inter-annotator agreement measures for complex annotation tasks, identifying challenges with existing methods and proposing two new, more interpretable measures that improve consistency across diverse annotation types.

Contribution

The paper introduces two novel IAA measures designed for complex annotation tasks, addressing interpretability issues and enhancing consistency across various data types.

Findings

01

Existing IAA measures struggle with complex tasks

02

Proposed measures are more interpretable and consistent

03

Evaluation across seven diverse annotation tasks

Abstract

When annotators label data, a key metric for quality assurance is inter-annotator agreement (IAA): the extent to which annotators agree on their labels. Though many IAA measures exist for simple categorical and ordinal labeling tasks, relatively little work has considered more complex labeling tasks, such as structured, multi-object, and free-text annotations. Krippendorff's alpha, best known for use with simpler labeling tasks, does have a distance-based formulation with broader applicability, but little work has studied its efficacy and consistency across complex annotation tasks. We investigate the design and evaluation of IAA measures for complex annotation tasks, with evaluation spanning seven diverse tasks: image bounding boxes, image keypoints, text sequence tagging, ranked lists, free text translations, numeric vectors, and syntax trees. We identify the difficulty of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

praznat/annotationmodeling
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.