DD-CAM: Minimal Sufficient Explanations for Vision Models Using Delta Debugging
Krishna Khadka, Yu Lei, Raghu N. Kacker, D. Richard Kuhn

TL;DR
DD-CAM introduces a gradient-free, delta debugging-based method to generate minimal, decision-preserving explanations for vision models, resulting in more faithful and accurate saliency maps compared to existing CAM-based methods.
Contribution
The paper presents DD-CAM, a novel framework that isolates minimal sufficient feature subsets for explanations using delta debugging, improving faithfulness and localization in vision model explanations.
Findings
Produces more faithful explanations than existing methods.
Achieves higher localization accuracy.
Generates minimal, decision-preserving saliency maps.
Abstract
We introduce a gradient-free framework for identifying minimal, sufficient, and decision-preserving explanations in vision models by isolating the smallest subset of representational units whose joint activation preserves predictions. Unlike existing approaches that aggregate all units, often leading to cluttered saliency maps, our approach, DD-CAM, identifies a 1-minimal subset whose joint activation suffices to preserve the prediction (i.e., removing any unit from the subset alters the prediction). To efficiently isolate minimal sufficient subsets, we adapt delta debugging, a systematic reduction strategy from software debugging, and configure its search strategy based on unit interactions in the classifier head: testing individual units for models with non-interacting units and testing unit combinations for models in which unit interactions exist. We then generate minimal,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Multimodal Machine Learning Applications
