Neural Models for Reasoning over Multiple Mentions using Coreference

Bhuwan Dhingra; Qiao Jin; Zhilin Yang; William W. Cohen; Ruslan; Salakhutdinov

arXiv:1804.05922·cs.CL·April 18, 2018

Neural Models for Reasoning over Multiple Mentions using Coreference

Bhuwan Dhingra, Qiao Jin, Zhilin Yang, William W. Cohen, Ruslan, Salakhutdinov

PDF

TL;DR

This paper introduces a novel recurrent neural layer that leverages coreference annotations to better aggregate information from multiple mentions of the same entity, improving reading comprehension performance especially with limited training data.

Contribution

The paper proposes a coreference-biased recurrent layer that enhances neural reasoning over multiple entity mentions in text.

Findings

01

Improved performance on Wikihop, LAMBADA, and bAbi datasets

02

Significant gains when training data is scarce

03

Effective integration of coreference information into RNNs

Abstract

Many problems in NLP require aggregating information from multiple mentions of the same entity which may be far apart in the text. Existing Recurrent Neural Network (RNN) layers are biased towards short-term dependencies and hence not suited to such tasks. We present a recurrent layer which is instead biased towards coreferent dependencies. The layer uses coreference annotations extracted from an external system to connect entity mentions belonging to the same cluster. Incorporating this layer into a state-of-the-art reading comprehension model improves performance on three datasets -- Wikihop, LAMBADA and the bAbi AI tasks -- with large gains when training data is scarce.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.