Using Linguistic Features to Improve the Generalization Capability of   Neural Coreference Resolvers

Nafise Sadat Moosavi; Michael Strube

arXiv:1708.00160·cs.CL·October 15, 2018

Using Linguistic Features to Improve the Generalization Capability of Neural Coreference Resolvers

Nafise Sadat Moosavi, Michael Strube

PDF

Open Access 1 Repo

TL;DR

This paper explores how incorporating specific linguistic features into neural coreference resolvers enhances their ability to generalize across different domains, leading to state-of-the-art out-of-domain performance.

Contribution

The study demonstrates that selecting informative linguistic features significantly improves the generalization of neural coreference resolvers beyond the training domain.

Findings

01

Incorporating linguistic features slightly improves generalization.

02

Using informative feature subsets greatly enhances out-of-domain performance.

03

Achieves state-of-the-art results on WikiCoref without domain-specific training.

Abstract

Coreference resolution is an intermediate step for text understanding. It is used in tasks and domains for which we do not necessarily have coreference annotated corpora. Therefore, generalization is of special importance for coreference resolution. However, while recent coreference resolvers have notable improvements on the CoNLL dataset, they struggle to generalize properly to new domains or datasets. In this paper, we investigate the role of linguistic features in building more generalizable coreference resolvers. We show that generalization improves only slightly by merely using a set of additional linguistic features. However, employing features and subsets of their values that are informative for coreference resolution, considerably improves generalization. Thanks to better generalization, our system achieves state-of-the-art results in out-of-domain evaluations, e.g., on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ns-moosavi/epm
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Neural Networks and Applications