Improving Generalization in Coreference Resolution via Adversarial   Training

Sanjay Subramanian; Dan Roth

arXiv:1908.04728·cs.CL·August 14, 2019·1 cites

Improving Generalization in Coreference Resolution via Adversarial Training

Sanjay Subramanian, Dan Roth

PDF

Open Access

TL;DR

This paper introduces adversarial training to improve the generalization of coreference resolution systems, making them more robust to unseen named entities and enhancing their performance across datasets.

Contribution

It applies adversarial gradient-based training to a state-of-the-art coreference system, significantly boosting its ability to generalize to new named entities and datasets.

Findings

01

Improved performance on CoNLL dataset with unseen entities

02

Enhanced robustness to entity name variations

03

Higher accuracy on the GAP dataset

Abstract

In order for coreference resolution systems to be useful in practice, they must be able to generalize to new text. In this work, we demonstrate that the performance of the state-of-the-art system decreases when the names of PER and GPE named entities in the CoNLL dataset are changed to names that do not occur in the training set. We use the technique of adversarial gradient-based training to retrain the state-of-the-art system and demonstrate that the retrained system achieves higher performance on the CoNLL dataset (both with and without the change of named entities) and the GAP dataset.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Domain Adaptation and Few-Shot Learning