Benchmarking Approximate Inference Methods for Neural Structured Prediction
Lifu Tu, Kevin Gimpel

TL;DR
This paper compares approximate inference methods for neural structured prediction, showing inference networks outperform gradient descent in speed and accuracy trade-offs across sequence labeling tasks.
Contribution
It provides a comprehensive benchmark of inference networks versus gradient descent for neural structured prediction, highlighting the advantages of inference networks.
Findings
Inference networks outperform gradient descent in speed and accuracy.
Inference networks are faster than exact inference at similar accuracy.
Combining inference networks with gradient descent improves results.
Abstract
Exact structured inference with neural network scoring functions is computationally challenging but several methods have been proposed for approximating inference. One approach is to perform gradient descent with respect to the output structure directly (Belanger and McCallum, 2016). Another approach, proposed recently, is to train a neural network (an "inference network") to perform inference (Tu and Gimpel, 2018). In this paper, we compare these two families of inference methods on three sequence labeling datasets. We choose sequence labeling because it permits us to use exact inference as a benchmark in terms of speed, accuracy, and search error. Across datasets, we demonstrate that inference networks achieve a better speed/accuracy/search error trade-off than gradient descent, while also being faster than exact inference at similar accuracy levels. We find further benefit by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning and Data Classification · Topic Modeling
