Sparse Probability of Agreement

Jeppe N{\o}rregaard; Leon Derczynski

arXiv:2208.06161·cs.CL·February 28, 2023

Sparse Probability of Agreement

Jeppe N{\o}rregaard, Leon Derczynski

PDF

Open Access

TL;DR

The paper introduces Sparse Probability of Agreement (SPA), a new metric for estimating inter-annotator agreement in datasets with incomplete annotations, ensuring unbiased estimates under certain conditions.

Contribution

It proposes SPA, a novel agreement metric that handles sparse annotation data and provides unbiased estimates with multiple weighing schemes.

Findings

01

SPA is an unbiased estimator under certain conditions.

02

Multiple weighing schemes improve SPA's flexibility.

03

SPA effectively estimates agreement in incomplete datasets.

Abstract

Measuring inter-annotator agreement is important for annotation tasks, but many metrics require a fully-annotated set of data, where all annotators annotate all samples. We define Sparse Probability of Agreement, SPA, which estimates the probability of agreement when not all annotator-item-pairs are available. We show that under certain conditions, SPA is an unbiased estimator, and we provide multiple weighing schemes for handling data with various degrees of annotation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReliability and Agreement in Measurement · Mobile Crowdsensing and Crowdsourcing · Air Traffic Management and Optimization