Aggregating Soft Labels from Crowd Annotations Improves Uncertainty   Estimation Under Distribution Shift

Dustin Wright; Isabelle Augenstein

arXiv:2212.09409·cs.CL·April 23, 2025

Aggregating Soft Labels from Crowd Annotations Improves Uncertainty Estimation Under Distribution Shift

Dustin Wright, Isabelle Augenstein

PDF

Open Access

TL;DR

This paper empirically evaluates soft-labeling methods from crowd annotations for uncertainty estimation under distribution shift, proposing simple averaging to improve robustness and consistency across tasks.

Contribution

It provides the first large-scale empirical comparison of soft-labeling methods in out-of-domain settings and introduces aggregation to enhance uncertainty estimation.

Findings

01

Aggregation improves uncertainty estimation in most settings.

02

Simple averaging yields consistent performance across tasks.

03

Method selection is less critical with abundant or minimal data.

Abstract

Selecting an effective training signal for machine learning tasks is difficult: expert annotations are expensive, and crowd-sourced annotations may not be reliable. Recent work has demonstrated that learning from a distribution over labels acquired from crowd annotations can be effective both for performance and uncertainty estimation. However, this has mainly been studied using a limited set of soft-labeling methods in an in-domain setting. Additionally, no one method has been shown to consistently perform well across tasks, making it difficult to know a priori which to choose. To fill these gaps, this paper provides the first large-scale empirical study on learning from crowd labels in the out-of-domain setting, systematically analyzing 8 soft-labeling methods on 4 language and vision tasks. Additionally, we propose to aggregate soft-labels via a simple average in order to achieve…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification

MethodsTest