A Survey Of Methods For Explaining Black Box Models
Riccardo Guidotti, Anna Monreale, Salvatore Ruggieri, Franco Turini,, Dino Pedreschi, Fosca Giannotti

TL;DR
This survey reviews various methods for explaining black box models, categorizing approaches based on problem type, black box system, and explanation goals to guide researchers in selecting suitable interpretability techniques.
Contribution
It provides a comprehensive classification of existing explanation methods for black box models, clarifying their applicability and highlighting open research questions.
Findings
Classifies explanation approaches based on problem and model types
Helps researchers identify relevant interpretability methods
Highlights open challenges in black box explainability
Abstract
In the last years many accurate decision support systems have been constructed as black boxes, that is as systems that hide their internal logic to the user. This lack of explanation constitutes both a practical and an ethical issue. The literature reports many approaches aimed at overcoming this crucial weakness sometimes at the cost of scarifying accuracy for interpretability. The applications in which black box decision systems can be used are various, and each approach is typically developed to provide a solution for a specific problem and, as a consequence, delineating explicitly or implicitly its own definition of interpretability and explanation. The aim of this paper is to provide a classification of the main problems addressed in the literature with respect to the notion of explanation and the type of black box system. Given a problem definition, a black box type, and a desired…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsInterpretability
