Increasing the Interpretability of Recurrent Neural Networks Using   Hidden Markov Models

Viktoriya Krakovna; Finale Doshi-Velez

arXiv:1606.05320·stat.ML·October 4, 2016·58 cites

Increasing the Interpretability of Recurrent Neural Networks Using Hidden Markov Models

Viktoriya Krakovna, Finale Doshi-Velez

PDF

Open Access

TL;DR

This paper proposes combining recurrent neural networks with hidden Markov models to enhance interpretability in speech recognition and translation tasks, demonstrating complementary learning of features.

Contribution

It introduces methods for integrating HMMs with RNNs, including training strategies and hybrid models, to improve model transparency and understanding.

Findings

01

HMMs and LSTMs learn complementary information

02

Hybrid models improve interpretability

03

Combining models enhances understanding of RNN decisions

Abstract

As deep neural networks continue to revolutionize various application domains, there is increasing interest in making these powerful models more understandable and interpretable, and narrowing down the causes of good and bad predictions. We focus on recurrent neural networks (RNNs), state of the art models in speech recognition and translation. Our approach to increasing interpretability is by combining an RNN with a hidden Markov model (HMM), a simpler and more transparent model. We explore various combinations of RNNs and HMMs: an HMM trained on LSTM states; a hybrid model where an HMM is trained first, then a small LSTM is given HMM state distributions and trained to fill in gaps in the HMM's performance; and a jointly trained hybrid model. We find that the LSTM and HMM learn complementary information about the features in the text.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Topic Modeling

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory