Embedded-State Latent Conditional Random Fields for Sequence Labeling
Dung Thai, Sree Harsha Ramesh, Shikhar Murty, Luke Vilnis, Andrew, McCallum

TL;DR
This paper introduces a novel sequence labeling model combining RNNs with a latent variable graphical model that captures complex non-local constraints, enabling more accurate information extraction.
Contribution
It proposes an embedded-state latent CRF model with low-rank transition matrices, integrating rich RNN features with a global graphical model for complex output constraints.
Findings
Outperforms baseline CRF+RNN models on multiple datasets.
Effectively models complex non-local output constraints.
Provides interpretable latent structures.
Abstract
Complex textual information extraction tasks are often posed as sequence labeling or \emph{shallow parsing}, where fields are extracted using local labels made consistent through probabilistic inference in a graphical model with constrained transitions. Recently, it has become common to locally parametrize these models using rich features extracted by recurrent neural networks (such as LSTM), while enforcing consistent outputs through a simple linear-chain model, representing Markovian dependencies between successive labels. However, the simple graphical model structure belies the often complex non-local constraints between output labels. For example, many fields, such as a first name, can only occur a fixed number of times, or in the presence of other fields. While RNNs have provided increasingly powerful context-aware local features for sequence tagging, they have yet to be integrated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsConditional Random Field
