Increasing the Interpretability of Recurrent Neural Networks Using   Hidden Markov Models

Viktoriya Krakovna; Finale Doshi-Velez

arXiv:1611.05934·stat.ML·November 21, 2016

Increasing the Interpretability of Recurrent Neural Networks Using Hidden Markov Models

Viktoriya Krakovna, Finale Doshi-Velez

PDF

Open Access

TL;DR

This paper proposes a hybrid approach combining LSTM neural networks with Hidden Markov Models to improve interpretability and performance in speech recognition and translation tasks.

Contribution

It introduces a method to integrate HMMs with LSTMs, enhancing interpretability and often outperforming standalone LSTMs, especially on smaller datasets.

Findings

01

Hybrid models outperform standalone LSTMs on small datasets.

02

LSTM and HMM learn complementary information.

03

The approach improves interpretability of neural networks.

Abstract

As deep neural networks continue to revolutionize various application domains, there is increasing interest in making these powerful models more understandable and interpretable, and narrowing down the causes of good and bad predictions. We focus on recurrent neural networks, state of the art models in speech recognition and translation. Our approach to increasing interpretability is by combining a long short-term memory (LSTM) model with a hidden Markov model (HMM), a simpler and more transparent model. We add the HMM state probabilities to the output layer of the LSTM, and then train the HMM and LSTM either sequentially or jointly. The LSTM can make use of the information from the HMM, and fill in the gaps when the HMM is not performing well. A small hybrid model usually performs better than a standalone LSTM of the same size, especially on smaller data sets. We test the algorithms on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Topic Modeling · Natural Language Processing Techniques

MethodsSigmoid Activation · Tanh Activation · Long Short-Term Memory