Deep Reinforcement Learning for Mention-Ranking Coreference Models

Kevin Clark; Christopher D. Manning

arXiv:1609.08667·cs.CL·November 2, 2016

Deep Reinforcement Learning for Mention-Ranking Coreference Models

Kevin Clark, Christopher D. Manning

PDF

1 Repo

TL;DR

This paper introduces reinforcement learning techniques to train neural mention-ranking models for coreference resolution, directly optimizing evaluation metrics and achieving state-of-the-art results on CoNLL 2012 datasets.

Contribution

It applies reinforcement learning, specifically reward-rescaled max-margin, to coreference models, improving over heuristic loss functions and setting new performance benchmarks.

Findings

01

Reward-rescaled max-margin outperforms REINFORCE in experiments.

02

Significant improvements over state-of-the-art on CoNLL 2012 datasets.

03

Effective direct optimization of coreference evaluation metrics.

Abstract

Coreference resolution systems are typically trained with heuristic loss functions that require careful tuning. In this paper we instead apply reinforcement learning to directly optimize a neural mention-ranking model for coreference evaluation metrics. We experiment with two approaches: the REINFORCE policy gradient algorithm and a reward-rescaled max-margin objective. We find the latter to be more effective, resulting in significant improvements over the current state-of-the-art on the English and Chinese portions of the CoNLL 2012 Shared Task.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

clarkkev/deep-coref
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.