Coreference Reasoning in Machine Reading Comprehension

Mingzhu Wu; Nafise Sadat Moosavi; Dan Roth; Iryna Gurevych

arXiv:2012.15573·cs.CL·June 10, 2021

Coreference Reasoning in Machine Reading Comprehension

Mingzhu Wu, Nafise Sadat Moosavi, Dan Roth, Iryna Gurevych

PDF

Open Access 1 Repo

TL;DR

This paper highlights the challenges of coreference reasoning in machine reading comprehension, proposes a new dataset reflecting these challenges, and demonstrates improved model performance using natural coreference data.

Contribution

It introduces a methodology for creating more representative coreference reasoning datasets and shows how to enhance models with natural coreference data.

Findings

01

State-of-the-art models struggle with coreference reasoning.

02

New dataset better reflects coreference challenges in MRC.

03

Training with natural coreference data improves model performance.

Abstract

Coreference resolution is essential for natural language understanding and has been long studied in NLP. In recent years, as the format of Question Answering (QA) became a standard for machine reading comprehension (MRC), there have been data collection efforts, e.g., Dasigi et al. (2019), that attempt to evaluate the ability of MRC models to reason about coreference. However, as we show, coreference reasoning in MRC is a greater challenge than earlier thought; MRC datasets do not reflect the natural distribution and, consequently, the challenges of coreference reasoning. Specifically, success on these datasets does not reflect a model's proficiency in coreference reasoning. We propose a methodology for creating MRC datasets that better reflect the challenges of coreference reasoning and use it to create a sample evaluation set. The results on our dataset show that state-of-the-art…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

UKPLab/coref-reasoning-in-qa
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications