Reinforcement Explanation Learning

Siddhant Agarwal; Owais Iqbal; Sree Aditya Buridi; Madda Manjusha,; Abir Das

arXiv:2111.13406·cs.CV·November 29, 2021

Reinforcement Explanation Learning

Siddhant Agarwal, Owais Iqbal, Sree Aditya Buridi, Madda Manjusha,, Abir Das

PDF

Open Access

TL;DR

This paper introduces a reinforcement learning-based method for generating saliency maps as explanations for deep learning classifiers, improving efficiency and accuracy over existing black-box explanation techniques.

Contribution

It formulates saliency map generation as a sequential search problem and leverages reinforcement learning to produce high-quality explanations more efficiently.

Findings

01

Outperforms state-of-the-art methods in inference time

02

Maintains explanation quality while reducing computational cost

03

Validated on three benchmark datasets

Abstract

Deep Learning has become overly complicated and has enjoyed stellar success in solving several classical problems like image classification, object detection, etc. Several methods for explaining these decisions have been proposed. Black-box methods to generate saliency maps are particularly interesting due to the fact that they do not utilize the internals of the model to explain the decision. Most black-box methods perturb the input and observe the changes in the output. We formulate saliency map generation as a sequential search problem and leverage upon Reinforcement Learning (RL) to accumulate evidence from input images that most strongly support decisions made by a classifier. Such a strategy encourages to search intelligently for the perturbations that will lead to high-quality explanations. While successful black box explanation approaches need to rely on heavy computations and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Advanced Neural Network Applications · Adversarial Robustness in Machine Learning