Unifying the Scope of Bridging Anaphora Types in English: Bridging Annotations in ARRAU and GUM
Lauren Levine, Amir Zeldes

TL;DR
This paper compares bridging annotations across multiple coreference resources to address standardization issues, analyzing differences and releasing harmonized datasets for improved evaluation of bridging resolution.
Contribution
It introduces a unified schema for bridging annotations and provides harmonized test sets across GUM, GENTLE, and ARRAU corpora to facilitate cross-domain evaluation.
Findings
Large differences in bridging phenomena annotations across resources
Harmonized datasets enable more reliable cross-domain evaluation
Analysis reveals diverse types of bridging phenomena
Abstract
Comparing bridging annotations across coreference resources is difficult, largely due to a lack of standardization across definitions and annotation schemas and narrow coverage of disparate text domains across resources. To alleviate domain coverage issues and consolidate schemas, we compare guidelines and use interpretable predictive models to examine the bridging instances annotated in the GUM, GENTLE and ARRAU corpora. Examining these cases, we find that there is a large difference in types of phenomena annotated as bridging. Beyond theoretical results, we release a harmonized, subcategorized version of the test sets of GUM, GENTLE and the ARRAU Wall Street Journal data to promote meaningful and reliable evaluation of bridging resolution across domains.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Syntax, Semantics, Linguistic Variation · Speech and dialogue systems
