Position-Aware Self-Attention based Neural Sequence Labeling

Wei Wei; Zanbo Wang; Xianling Mao; Guangyou Zhou; Pan Zhou; Sheng; Jiang

arXiv:1908.09128·cs.CL·October 19, 2021

Position-Aware Self-Attention based Neural Sequence Labeling

Wei Wei, Zanbo Wang, Xianling Mao, Guangyou Zhou, Pan Zhou, Sheng, Jiang

PDF

TL;DR

This paper introduces a position-aware self-attention model for sequence labeling tasks that effectively captures both successive and discrete token dependencies, outperforming existing models without external knowledge.

Contribution

The paper proposes a novel position-aware self-attention mechanism and context fusion layer to improve sequence labeling by modeling complex token relations.

Findings

01

Outperforms state-of-the-art models on POS tagging, NER, and phrase chunking

02

Effectively captures both continuous and discrete token dependencies

03

No external knowledge required for high performance

Abstract

Sequence labeling is a fundamental task in natural language processing and has been widely studied. Recently, RNN-based sequence labeling models have increasingly gained attentions. Despite superior performance achieved by learning the long short-term (i.e., successive) dependencies, the way of sequentially processing inputs might limit the ability to capture the non-continuous relations over tokens within a sentence. To tackle the problem, we focus on how to effectively model successive and discrete dependencies of each token for enhancing the sequence labeling performance. Specifically, we propose an innovative attention-based model (called position-aware selfattention, i.e., PSA) as well as a well-designed self-attentional context fusion layer within a neural network architecture, to explore the positional information of an input sequence for capturing the latent relations among…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.