Policy Gradient as a Proxy for Dynamic Oracles in Constituency Parsing

Daniel Fried; Dan Klein

arXiv:1806.03290·cs.CL·June 11, 2018·1 cites

Policy Gradient as a Proxy for Dynamic Oracles in Constituency Parsing

Daniel Fried, Dan Klein

PDF

Open Access

TL;DR

This paper proposes using policy gradient methods as a parser-agnostic alternative to dynamic oracles for training constituency parsers, improving performance and reducing the need for custom supervision.

Contribution

It introduces a policy gradient approach that directly optimizes tree-level metrics and can replace dynamic oracles across different parser architectures.

Findings

01

Policy gradient outperforms static likelihood training in most settings.

02

It recaptures much of the performance gain of dynamic oracles when available.

03

The method is effective across multiple languages and parser types.

Abstract

Dynamic oracles provide strong supervision for training constituency parsers with exploration, but must be custom defined for a given parser's transition system. We explore using a policy gradient method as a parser-agnostic alternative. In addition to directly optimizing for a tree-level metric such as F1, policy gradient has the potential to reduce exposure bias by allowing exploration during training; moreover, it does not require a dynamic oracle for supervision. On four constituency parsers in three languages, the method substantially outperforms static oracle likelihood training in almost all settings. For parsers where a dynamic oracle is available (including a novel oracle which we define for the transition system of Dyer et al. 2016), policy gradient typically recaptures a substantial fraction of the performance gain afforded by the dynamic oracle.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification