Training with Exploration Improves a Greedy Stack-LSTM Parser

Miguel Ballesteros; Yoav Goldberg; Chris Dyer; Noah A. Smith

arXiv:1603.03793·cs.CL·September 14, 2016·21 cites

Training with Exploration Improves a Greedy Stack-LSTM Parser

Miguel Ballesteros, Yoav Goldberg, Chris Dyer, Noah A. Smith

PDF

Open Access

TL;DR

This paper enhances a greedy Stack-LSTM dependency parser by incorporating a training-with-exploration approach using dynamic oracles, leading to improved parsing accuracy for English and Chinese.

Contribution

It introduces a novel training method with exploration for neural parsers, replacing traditional cross-entropy training with dynamic oracle-based exploration.

Findings

01

Significant accuracy improvements on English and Chinese parsing tasks

02

Effective adaptation of training with exploration to neural network models

03

Demonstrates the benefits of dynamic oracle-based training in dependency parsing

Abstract

We adapt the greedy Stack-LSTM dependency parser of Dyer et al. (2015) to support a training-with-exploration procedure using dynamic oracles(Goldberg and Nivre, 2013) instead of cross-entropy minimization. This form of training, which accounts for model predictions at training time rather than assuming an error-free action history, improves parsing accuracies for both English and Chinese, obtaining very strong results for both languages. We discuss some modifications needed in order to get training with exploration to work well for a probabilistic neural-network.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications