Investigating how well contextual features are captured by bi-directional recurrent neural network models
Kushal Chawla, Sunil Kumar Sahu, Ashish Anand

TL;DR
This paper evaluates how effectively bi-directional RNNs automatically learn and represent contextual features in sequence tagging tasks across general and biomedical domains, highlighting interpretability issues and analysis methods.
Contribution
It introduces three methods to analyze the ability of bi-directional RNNs to capture contextual features in sequence tagging, including positional effects and error analysis.
Findings
RNNs effectively capture important contextual words
Positional effects influence feature representation
Methods facilitate error analysis and interpretability
Abstract
Learning algorithms for natural language processing (NLP) tasks traditionally rely on manually defined relevant contextual features. On the other hand, neural network models using an only distributional representation of words have been successfully applied for several NLP tasks. Such models learn features automatically and avoid explicit feature engineering. Across several domains, neural models become a natural choice specifically when limited characteristics of data are known. However, this flexibility comes at the cost of interpretability. In this paper, we define three different methods to investigate ability of bi-directional recurrent neural networks (RNNs) in capturing contextual features. In particular, we analyze RNNs for sequence tagging tasks. We perform a comprehensive analysis on general as well as biomedical domain datasets. Our experiments focus on important contextual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Machine Learning and Data Classification
