# Interpretable Neural Predictions with Differentiable Binary Variables

**Authors:** Jasmijn Bastings, Wilker Aziz, Ivan Titov

arXiv: 1905.08160 · 2020-06-22

## TL;DR

This paper introduces a novel neural network approach that jointly learns to select interpretable rationales from text and classify based on them, using a differentiable method that avoids complex gradient estimators.

## Contribution

It proposes a mixed discrete-continuous latent model enabling gradient-based training of binary rationale selectors with explicit control over selection sparsity.

## Key findings

- Competitive performance on rationale extraction tasks
- Effective gradient-based training of binary selectors
- Potential applications in attention mechanisms

## Abstract

The success of neural networks comes hand in hand with a desire for more interpretability. We focus on text classifiers and make them more interpretable by having them provide a justification, a rationale, for their predictions. We approach this problem by jointly training two neural network models: a latent model that selects a rationale (i.e. a short and informative part of the input text), and a classifier that learns from the words in the rationale alone. Previous work proposed to assign binary latent masks to input positions and to promote short selections via sparsity-inducing penalties such as L0 regularisation. We propose a latent model that mixes discrete and continuous behaviour allowing at the same time for binary selections and gradient-based training without REINFORCE. In our formulation, we can tractably compute the expected value of penalties such as L0, which allows us to directly optimise the model towards a pre-specified text selection rate. We show that our approach is competitive with previous work on rationale extraction, and explore further uses in attention mechanisms.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1905.08160/full.md

## Figures

16 figures with captions in the complete paper: https://tomesphere.com/paper/1905.08160/full.md

## References

41 references — full list in the complete paper: https://tomesphere.com/paper/1905.08160/full.md

---
Source: https://tomesphere.com/paper/1905.08160