# Do Human Rationales Improve Machine Explanations?

**Authors:** Julia Strout, Ye Zhang, Raymond J. Mooney

arXiv: 1905.13714 · 2019-06-03

## TL;DR

This paper demonstrates that incorporating human rationales in training improves the quality of machine explanations, making them more understandable to humans, especially in CNN-based text classification.

## Contribution

It connects learning with rationales to explainable AI, showing that human-provided explanations enhance machine explanation quality evaluated by humans.

## Key findings

- Supervised attention yields better explanations than unsupervised attention.
- Learning with rationales improves explanation quality as judged by humans.
- The approach enhances interpretability in CNN-based text classification.

## Abstract

Work on "learning with rationales" shows that humans providing explanations to a machine learning system can improve the system's predictive accuracy. However, this work has not been connected to work in "explainable AI" which concerns machines explaining their reasoning to humans. In this work, we show that learning with rationales can also improve the quality of the machine's explanations as evaluated by human judges. Specifically, we present experiments showing that, for CNN- based text classification, explanations generated using "supervised attention" are judged superior to explanations generated using normal unsupervised attention.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1905.13714/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/1905.13714/full.md

## References

25 references — full list in the complete paper: https://tomesphere.com/paper/1905.13714/full.md

---
Source: https://tomesphere.com/paper/1905.13714