Collective Human Opinions in Semantic Textual Similarity

Yuxia Wang; Shimin Tao; Ning Xie; Hao Yang; Timothy Baldwin; Karin; Verspoor

arXiv:2308.04114·cs.CL·August 9, 2023

Collective Human Opinions in Semantic Textual Similarity

Yuxia Wang, Shimin Tao, Ning Xie, Hao Yang, Timothy Baldwin, Karin, Verspoor

PDF

Open Access 1 Repo

TL;DR

This paper introduces USTS, a new Chinese dataset for semantic textual similarity that captures human disagreement and semantic vagueness, highlighting limitations of current models in representing opinion variance.

Contribution

The paper presents USTS, the first uncertainty-aware STS dataset with detailed annotations, and analyzes the inadequacy of existing models to capture human opinion variance.

Findings

01

Existing benchmarks mask opinion disagreement by averaging ratings.

02

Current models do not effectively capture individual opinion variance.

03

USTS dataset reveals the complexity of human semantic judgments.

Abstract

Despite the subjective nature of semantic textual similarity (STS) and pervasive disagreements in STS annotation, existing benchmarks have used averaged human ratings as the gold standard. Averaging masks the true distribution of human opinions on examples of low agreement, and prevents models from capturing the semantic vagueness that the individual ratings represent. In this work, we introduce USTS, the first Uncertainty-aware STS dataset with ~15,000 Chinese sentence pairs and 150,000 labels, to study collective human opinions in STS. Analysis reveals that neither a scalar nor a single Gaussian fits a set of observed judgements adequately. We further show that current STS models cannot capture the variance caused by human disagreement on individual instances, but rather reflect the predictive confidence over the aggregate dataset.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yuxiaw/usts
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Advanced Text Analysis Techniques