A Survey Of Methods For Explaining Black Box Models

Riccardo Guidotti; Anna Monreale; Salvatore Ruggieri; Franco Turini,; Dino Pedreschi; Fosca Giannotti

arXiv:1802.01933·cs.CY·June 22, 2018

A Survey Of Methods For Explaining Black Box Models

Riccardo Guidotti, Anna Monreale, Salvatore Ruggieri, Franco Turini,, Dino Pedreschi, Fosca Giannotti

PDF

TL;DR

This survey reviews various methods for explaining black box models, categorizing approaches based on problem type, black box system, and explanation goals to guide researchers in selecting suitable interpretability techniques.

Contribution

It provides a comprehensive classification of existing explanation methods for black box models, clarifying their applicability and highlighting open research questions.

Findings

01

Classifies explanation approaches based on problem and model types

02

Helps researchers identify relevant interpretability methods

03

Highlights open challenges in black box explainability

Abstract

In the last years many accurate decision support systems have been constructed as black boxes, that is as systems that hide their internal logic to the user. This lack of explanation constitutes both a practical and an ethical issue. The literature reports many approaches aimed at overcoming this crucial weakness sometimes at the cost of scarifying accuracy for interpretability. The applications in which black box decision systems can be used are various, and each approach is typically developed to provide a solution for a specific problem and, as a consequence, delineating explicitly or implicitly its own definition of interpretability and explanation. The aim of this paper is to provide a classification of the main problems addressed in the literature with respect to the notion of explanation and the type of black box system. Given a problem definition, a black box type, and a desired…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsInterpretability