Investigating how well contextual features are captured by   bi-directional recurrent neural network models

Kushal Chawla; Sunil Kumar Sahu; Ashish Anand

arXiv:1709.00659·cs.CL·November 30, 2017·2 cites

Investigating how well contextual features are captured by bi-directional recurrent neural network models

Kushal Chawla, Sunil Kumar Sahu, Ashish Anand

PDF

Open Access

TL;DR

This paper evaluates how effectively bi-directional RNNs automatically learn and represent contextual features in sequence tagging tasks across general and biomedical domains, highlighting interpretability issues and analysis methods.

Contribution

It introduces three methods to analyze the ability of bi-directional RNNs to capture contextual features in sequence tagging, including positional effects and error analysis.

Findings

01

RNNs effectively capture important contextual words

02

Positional effects influence feature representation

03

Methods facilitate error analysis and interpretability

Abstract

Learning algorithms for natural language processing (NLP) tasks traditionally rely on manually defined relevant contextual features. On the other hand, neural network models using an only distributional representation of words have been successfully applied for several NLP tasks. Such models learn features automatically and avoid explicit feature engineering. Across several domains, neural models become a natural choice specifically when limited characteristics of data are known. However, this flexibility comes at the cost of interpretability. In this paper, we define three different methods to investigate ability of bi-directional recurrent neural networks (RNNs) in capturing contextual features. In particular, we analyze RNNs for sequence tagging tasks. We perform a comprehensive analysis on general as well as biomedical domain datasets. Our experiments focus on important contextual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning and Data Classification