Learning Constraints for Structured Prediction Using Rectifier Networks

Xingyuan Pan; Maitrey Mehta; Vivek Srikumar

arXiv:2006.01209·cs.CL·June 3, 2020·1 cites

Learning Constraints for Structured Prediction Using Rectifier Networks

Xingyuan Pan, Maitrey Mehta, Vivek Srikumar

PDF

Open Access 1 Repo

TL;DR

This paper introduces a method to learn constraints for structured prediction in NLP tasks by training rectifier networks, which can improve accuracy especially with limited training data.

Contribution

It presents a novel approach to automatically learn output constraints using rectifier networks, reducing reliance on domain expertise.

Findings

01

Learned constraints improve prediction accuracy.

02

Method is effective with small training datasets.

03

Constraints can be converted into linear forms for inference.

Abstract

Various natural language processing tasks are structured prediction problems where outputs are constructed with multiple interdependent decisions. Past work has shown that domain knowledge, framed as constraints over the output space, can help improve predictive accuracy. However, designing good constraints often relies on domain expertise. In this paper, we study the problem of learning such constraints. We frame the problem as that of training a two-layer rectifier network to identify valid structures or substructures, and show a construction for converting a trained network into a system of linear constraints over the inference variables. Our experiments on several NLP tasks show that the learned constraints can improve the prediction accuracy, especially when the number of training examples is small.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

utahnlp/learning-constraints
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning and Data Classification