Benchmarking Approximate Inference Methods for Neural Structured   Prediction

Lifu Tu; Kevin Gimpel

arXiv:1904.01138·cs.CL·July 9, 2019·1 cites

Benchmarking Approximate Inference Methods for Neural Structured Prediction

Lifu Tu, Kevin Gimpel

PDF

Open Access 1 Repo

TL;DR

This paper compares approximate inference methods for neural structured prediction, showing inference networks outperform gradient descent in speed and accuracy trade-offs across sequence labeling tasks.

Contribution

It provides a comprehensive benchmark of inference networks versus gradient descent for neural structured prediction, highlighting the advantages of inference networks.

Findings

01

Inference networks outperform gradient descent in speed and accuracy.

02

Inference networks are faster than exact inference at similar accuracy.

03

Combining inference networks with gradient descent improves results.

Abstract

Exact structured inference with neural network scoring functions is computationally challenging but several methods have been proposed for approximating inference. One approach is to perform gradient descent with respect to the output structure directly (Belanger and McCallum, 2016). Another approach, proposed recently, is to train a neural network (an "inference network") to perform inference (Tu and Gimpel, 2018). In this paper, we compare these two families of inference methods on three sequence labeling datasets. We choose sequence labeling because it permits us to use exact inference as a benchmark in terms of speed, accuracy, and search error. Across datasets, we demonstrate that inference networks achieve a better speed/accuracy/search error trade-off than gradient descent, while also being faster than exact inference at similar accuracy levels. We find further benefit by…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lifu-tu/BenchmarkingApproximateInference
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Topic Modeling