Fill in the blanks: Rethinking Interpretability in vision
Pathirage N. Deelaka, Tharindu Wickremasinghe, Devin Y. De Silva,, Lisara N. Gajaweera

TL;DR
This paper proposes a novel, model-agnostic approach to interpretability in vision models by analyzing how they fill in masked images, revealing consistent input structure patterns and enhancing explainability tools.
Contribution
It introduces a new perspective on vision model interpretability by examining fill-in-the-blank behavior, which is more adaptable across models than traditional saliency-based methods.
Findings
Consistent patterns in model fill-in behavior across datasets
Potential integration as a model-agnostic explainability tool
Improved understanding of learned input structures
Abstract
Model interpretability is a key challenge that has yet to align with the advancements observed in contemporary state-of-the-art deep learning models. In particular, deep learning aided vision tasks require interpretability, in order for their adoption in more specialized domains such as medical imaging. Although the field of explainable AI (XAI) developed methods for interpreting vision models along with early convolutional neural networks, recent XAI research has mainly focused on assigning attributes via saliency maps. As such, these methods are restricted to providing explanations at a sample level, and many explainability methods suffer from low adaptability across a wide range of vision models. In our work, we re-think vision-model explainability from a novel perspective, to probe the general input structure that a model has learnt during its training. To this end, we ask the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInterpreting and Communication in Healthcare
MethodsALIGN
