Towards Counterfactual and Contrastive Explainability and Transparency   of DCNN Image Classifiers

Syed Ali Tariq; Tehseen Zia; Mubeen Ghafoor

arXiv:2501.06831·cs.CV·January 14, 2025

Towards Counterfactual and Contrastive Explainability and Transparency of DCNN Image Classifiers

Syed Ali Tariq, Tehseen Zia, Mubeen Ghafoor

PDF

TL;DR

This paper introduces a model-intrusive method for generating contrastive and counterfactual explanations of DCNN image classifiers, enhancing interpretability and transparency by analyzing internal filters and concepts without altering input images.

Contribution

The paper presents a novel approach that probes internal DCNN filters for explanations, offering contrastive and counterfactual insights without modifying input images, improving transparency.

Findings

01

Effective in identifying key filters and concepts influencing decisions

02

Provides meaningful contrastive and counterfactual explanations

03

Evaluated successfully on CUB 2011 dataset

Abstract

Explainability of deep convolutional neural networks (DCNNs) is an important research topic that tries to uncover the reasons behind a DCNN model's decisions and improve their understanding and reliability in high-risk environments. In this regard, we propose a novel method for generating interpretable counterfactual and contrastive explanations for DCNN models. The proposed method is model intrusive that probes the internal workings of a DCNN instead of altering the input image to generate explanations. Given an input image, we provide contrastive explanations by identifying the most important filters in the DCNN representing features and concepts that separate the model's decision between classifying the image to the original inferred class or some other specified alter class. On the other hand, we provide counterfactual explanations by specifying the minimal changes necessary in such…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsDiffusion-Convolutional Neural Networks