Contextualized Non-local Neural Networks for Sequence Learning

Pengfei Liu; Shuaichen Chang; Xuanjing Huang; Jian Tang; Jackie Chi; Kit Cheung

arXiv:1811.08600·cs.CL·November 22, 2018·6 cites

Contextualized Non-local Neural Networks for Sequence Learning

Pengfei Liu, Shuaichen Chang, Xuanjing Huang, Jian Tang, Jackie Chi, Kit Cheung

PDF

Open Access

TL;DR

This paper introduces contextualized non-local neural networks (CN3) that combine self-attention and graph neural networks to improve sequence learning by dynamically modeling task-specific structures and local dependencies.

Contribution

The paper presents CN3, a novel model that integrates the strengths of self-attention and GNNs for enhanced sequence learning and interpretability.

Findings

01

Outperforms competitive baselines on ten NLP tasks.

02

Effectively models task-specific dependency structures.

03

Provides improved interpretability of learned structures.

Abstract

Recently, a large number of neural mechanisms and models have been proposed for sequence learning, of which self-attention, as exemplified by the Transformer model, and graph neural networks (GNNs) have attracted much attention. In this paper, we propose an approach that combines and draws on the complementary strengths of these two methods. Specifically, we propose contextualized non-local neural networks (CN $^{3}$ ), which can both dynamically construct a task-specific structure of a sentence and leverage rich local dependencies within a particular neighborhood. Experimental results on ten NLP tasks in text classification, semantic matching, and sequence labeling show that our proposed model outperforms competitive baselines and discovers task-specific dependency structures, thus providing better interpretability to users.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Advanced Graph Neural Networks · Natural Language Processing Techniques

MethodsInterpretability