Unifying the Scope of Bridging Anaphora Types in English: Bridging   Annotations in ARRAU and GUM

Lauren Levine; Amir Zeldes

arXiv:2410.01170·cs.CL·October 3, 2024

Unifying the Scope of Bridging Anaphora Types in English: Bridging Annotations in ARRAU and GUM

Lauren Levine, Amir Zeldes

PDF

Open Access

TL;DR

This paper compares bridging annotations across multiple coreference resources to address standardization issues, analyzing differences and releasing harmonized datasets for improved evaluation of bridging resolution.

Contribution

It introduces a unified schema for bridging annotations and provides harmonized test sets across GUM, GENTLE, and ARRAU corpora to facilitate cross-domain evaluation.

Findings

01

Large differences in bridging phenomena annotations across resources

02

Harmonized datasets enable more reliable cross-domain evaluation

03

Analysis reveals diverse types of bridging phenomena

Abstract

Comparing bridging annotations across coreference resources is difficult, largely due to a lack of standardization across definitions and annotation schemas and narrow coverage of disparate text domains across resources. To alleviate domain coverage issues and consolidate schemas, we compare guidelines and use interpretable predictive models to examine the bridging instances annotated in the GUM, GENTLE and ARRAU corpora. Examining these cases, we find that there is a large difference in types of phenomena annotated as bridging. Beyond theoretical results, we release a harmonized, subcategorized version of the test sets of GUM, GENTLE and the ARRAU Wall Street Journal data to promote meaningful and reliable evaluation of bridging resolution across domains.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Syntax, Semantics, Linguistic Variation · Speech and dialogue systems