Embedded-State Latent Conditional Random Fields for Sequence Labeling

Dung Thai; Sree Harsha Ramesh; Shikhar Murty; Luke Vilnis; Andrew; McCallum

arXiv:1809.10835·cs.LG·October 1, 2018

Embedded-State Latent Conditional Random Fields for Sequence Labeling

Dung Thai, Sree Harsha Ramesh, Shikhar Murty, Luke Vilnis, Andrew, McCallum

PDF

TL;DR

This paper introduces a novel sequence labeling model combining RNNs with a latent variable graphical model that captures complex non-local constraints, enabling more accurate information extraction.

Contribution

It proposes an embedded-state latent CRF model with low-rank transition matrices, integrating rich RNN features with a global graphical model for complex output constraints.

Findings

01

Outperforms baseline CRF+RNN models on multiple datasets.

02

Effectively models complex non-local output constraints.

03

Provides interpretable latent structures.

Abstract

Complex textual information extraction tasks are often posed as sequence labeling or \emph{shallow parsing}, where fields are extracted using local labels made consistent through probabilistic inference in a graphical model with constrained transitions. Recently, it has become common to locally parametrize these models using rich features extracted by recurrent neural networks (such as LSTM), while enforcing consistent outputs through a simple linear-chain model, representing Markovian dependencies between successive labels. However, the simple graphical model structure belies the often complex non-local constraints between output labels. For example, many fields, such as a first name, can only occur a fixed number of times, or in the presence of other fields. While RNNs have provided increasingly powerful context-aware local features for sequence tagging, they have yet to be integrated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsConditional Random Field