Explainable Deep Learning: A Field Guide for the Uninitiated
Gabrielle Ras, Ning Xie, Marcel van Gerven, Derek Doran

TL;DR
This paper provides an accessible overview of explainable deep learning, detailing foundational methods, evaluation strategies, related research areas, and future directions to help newcomers understand and navigate the field.
Contribution
It offers a structured, beginner-friendly field guide that clarifies core concepts, evaluation methods, and research context in explainable deep learning.
Findings
Introduces three dimensions of explainable deep learning methods
Discusses evaluation strategies for model explanations
Highlights future research directions in explainability
Abstract
Deep neural networks (DNNs) have become a proven and indispensable machine learning tool. As a black-box model, it remains difficult to diagnose what aspects of the model's input drive the decisions of a DNN. In countless real-world domains, from legislation and law enforcement to healthcare, such diagnosis is essential to ensure that DNN decisions are driven by aspects appropriate in the context of its use. The development of methods and studies enabling the explanation of a DNN's decisions has thus blossomed into an active, broad area of research. A practitioner wanting to study explainable deep learning may be intimidated by the plethora of orthogonal directions the field has taken. This complexity is further exacerbated by competing definitions of what it means ``to explain'' the actions of a DNN and to evaluate an approach's ``ability to explain''. This article offers a field guide…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI)
