Understanding Hidden Memories of Recurrent Neural Networks
Yao Ming, Shaozu Cao, Ruixiang Zhang, Zhen Li, Yuanzhe, Chen, Yangqiu Song, Huamin Qu

TL;DR
This paper introduces a visual analytics approach to interpret and compare RNN hidden states in NLP, enhancing understanding of their internal mechanisms and aiding model improvement.
Contribution
It presents a novel visualization technique for explaining RNN hidden units and their responses, facilitating deeper insights into model behavior.
Findings
Effective visualization of hidden states and word responses.
Improved understanding of RNN internal mechanisms.
Validated through case studies with domain experts.
Abstract
Recurrent neural networks (RNNs) have been successfully applied to various natural language processing (NLP) tasks and achieved better results than conventional methods. However, the lack of understanding of the mechanisms behind their effectiveness limits further improvements on their architectures. In this paper, we present a visual analytics method for understanding and comparing RNN models for NLP tasks. We propose a technique to explain the function of individual hidden state units based on their expected response to input texts. We then co-cluster hidden state units and words based on the expected response and visualize co-clustering results as memory chips and word clouds to provide more structured knowledge on RNNs' hidden states. We also propose a glyph-based sequence visualization based on aggregate information to analyze the behavior of an RNN's hidden state at the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI)
