An Empirical Investigation of Beam-Aware Training in Supertagging

Renato Negrinho; Matthew R. Gormley; Geoffrey J. Gordon

arXiv:2010.04980·cs.CL·October 13, 2020

An Empirical Investigation of Beam-Aware Training in Supertagging

Renato Negrinho, Matthew R. Gormley, Geoffrey J. Gordon

PDF

1 Repo

TL;DR

This paper empirically investigates beam-aware training in supertagging, demonstrating its performance benefits and providing insights into its design choices and stability, especially for simpler models managing uncertainty.

Contribution

It provides the first empirical analysis of beam-aware training in supertagging, exploring its impact, stability, and optimal design choices.

Findings

01

Beam-aware training improves supertagging performance.

02

Large gains observed for simpler models with uncertainty management.

03

Search-aware training is crucial for maximizing model effectiveness.

Abstract

Structured prediction is often approached by training a locally normalized model with maximum likelihood and decoding approximately with beam search. This approach leads to mismatches as, during training, the model is not exposed to its mistakes and does not use beam search. Beam-aware training aims to address these problems, but unfortunately, it is not yet widely used due to a lack of understanding about how it impacts performance, when it is most useful, and whether it is stable. Recently, Negrinho et al. (2018) proposed a meta-algorithm that captures beam-aware training algorithms and suggests new ones, but unfortunately did not provide empirical results. In this paper, we begin an empirical investigation: we train the supertagging model of Vaswani et al. (2016) and a simpler model with instantiations of the meta-algorithm. We explore the influence of various design choices and make…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

negrinho/beam_learn_supertagging
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.