AmbiCoref: Evaluating Human and Model Sensitivity to Ambiguous   Coreference

Yuewei Yuan; Chaitanya Malaviya; Mark Yatskar

arXiv:2302.00762·cs.CL·February 6, 2023

AmbiCoref: Evaluating Human and Model Sensitivity to Ambiguous Coreference

Yuewei Yuan, Chaitanya Malaviya, Mark Yatskar

PDF

Open Access 1 Repo

TL;DR

This paper introduces AmbiCoref, a diagnostic corpus to evaluate whether coreference resolution models are sensitive to ambiguity in pronoun references, revealing that models often ignore ambiguity unlike humans.

Contribution

The paper presents AmbiCoref, a novel dataset inspired by psycholinguistics, to test model sensitivity to ambiguity in coreference resolution tasks.

Findings

01

Humans are less certain of referents in ambiguous sentences.

02

Most models show little difference in handling ambiguous vs. unambiguous cases.

03

AmbiCoref enables testing of model-human sensitivity to ambiguity.

Abstract

Given a sentence "Abby told Brittney that she upset Courtney", one would struggle to understand who "she" refers to, and ask for clarification. However, if the word "upset" were replaced with "hugged", "she" unambiguously refers to Abby. We study if modern coreference resolution models are sensitive to such pronominal ambiguity. To this end, we construct AmbiCoref, a diagnostic corpus of minimal sentence pairs with ambiguous and unambiguous referents. Our examples generalize psycholinguistic studies of human perception of ambiguity around particular arrangements of verbs and their arguments. Analysis shows that (1) humans are less sure of referents in ambiguous AmbiCoref examples than unambiguous ones, and (2) most coreference models show little difference in output between ambiguous and unambiguous pairs. We release AmbiCoref as a diagnostic corpus for testing whether models treat…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lucyyyw/ambicoref
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification