An Investigation of Potential Function Designs for Neural CRF

Zechuan Hu; Yong Jiang; Nguyen Bach; Tao Wang; Zhongqiang Huang; Fei; Huang; Kewei Tu

arXiv:2011.05604·cs.CL·April 26, 2021

An Investigation of Potential Function Designs for Neural CRF

Zechuan Hu, Yong Jiang, Nguyen Bach, Tao Wang, Zhongqiang Huang, Fei, Huang, Kewei Tu

PDF

Open Access

TL;DR

This paper explores various potential function designs for neural CRF models, demonstrating that a decomposed quadrilinear potential function leveraging contextual word representations yields superior sequence labeling performance.

Contribution

It introduces and evaluates a novel quadrilinear potential function that explicitly models interactions between labels and contextual words in neural CRF models.

Findings

01

Quadrilinear potential function outperforms other designs.

02

Explicit modeling of contextual words improves accuracy.

03

Best performance achieved with the proposed potential function.

Abstract

The neural linear-chain CRF model is one of the most widely-used approach to sequence labeling. In this paper, we investigate a series of increasingly expressive potential functions for neural CRF models, which not only integrate the emission and transition functions, but also explicitly take the representations of the contextual words as input. Our extensive experiments show that the decomposed quadrilinear potential function based on the vector representations of two neighboring labels and two neighboring words consistently achieves the best performance.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Blind Source Separation Techniques · Machine Learning in Bioinformatics

MethodsConditional Random Field