Aligning Characteristic Descriptors with Images for Human-Expert-like Explainability
Bharat Chandra Yalavarthi, Nalini Ratha

TL;DR
This paper introduces a novel explainability method for deep learning models in critical domains, using characteristic descriptors and a concept bottleneck to generate expert-like natural language explanations based on image features.
Contribution
The paper proposes a new approach that employs characteristic descriptors and a concept bottleneck layer to produce faithful, natural language explanations aligned with human expert reasoning.
Findings
Outperforms saliency map-based explanations in experiments
Provides inherently faithful and interpretable model decisions
Effective in face recognition and chest X-ray diagnosis
Abstract
In mission-critical domains such as law enforcement and medical diagnosis, the ability to explain and interpret the outputs of deep learning models is crucial for ensuring user trust and supporting informed decision-making. Despite advancements in explainability, existing methods often fall short in providing explanations that mirror the depth and clarity of those given by human experts. Such expert-level explanations are essential for the dependable application of deep learning models in law enforcement and medical contexts. Additionally, we recognize that most explanations in real-world scenarios are communicated primarily through natural language. Addressing these needs, we propose a novel approach that utilizes characteristic descriptors to explain model decisions by identifying their presence in images, thereby generating expert-like explanations. Our method incorporates a concept…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI)
