Learning Soft Linear Constraints with Application to Citation Field   Extraction

Sam Anzaroot; Alexandre Passos; David Belanger; Andrew McCallum

arXiv:1403.1349·cs.CL·October 20, 2014·2 cites

Learning Soft Linear Constraints with Application to Citation Field Extraction

Sam Anzaroot, Alexandre Passos, David Belanger, Andrew McCallum

PDF

Open Access

TL;DR

This paper introduces a method for learning and applying soft linear constraints to improve citation field extraction, enabling automatic constraint generation and cost learning, resulting in significant accuracy gains.

Contribution

It extends dual decomposition to handle soft constraints, allowing automatic generation and learning of constraints for better citation segmentation.

Findings

01

Significant accuracy improvements on citation extraction dataset

02

Effective automatic generation of large constraint families

03

Successful learning of constraint costs through convex optimization

Abstract

Accurately segmenting a citation string into fields for authors, titles, etc. is a challenging task because the output typically obeys various global constraints. Previous work has shown that modeling soft constraints, where the model is encouraged, but not require to obey the constraints, can substantially improve segmentation performance. On the other hand, for imposing hard constraints, dual decomposition is a popular technique for efficient prediction given existing algorithms for unconstrained inference. We extend the technique to perform prediction subject to soft constraints. Moreover, with a technique for performing inference given soft constraints, it is easy to automatically generate large families of constraints and learn their costs with a simple convex optimization problem during training. This allows us to obtain substantial gains in accuracy on a new, challenging citation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Rough Sets and Fuzzy Logic