Domain aware medical image classifier interpretation by counterfactual impact analysis
Dimitrios Lenis, David Major, Maria Wimmer, Astrid Berg, Gert Sluiter,, and Katja B\"uhler

TL;DR
This paper introduces a neural-network based attribution method for medical image classification that improves interpretability by accurately identifying salient regions without heuristic artifacts, enhancing clinical relevance.
Contribution
It presents a novel, efficient attribution technique using neighborhood conditioned inpainting, surpassing heuristic methods in clarity and anatomical plausibility.
Findings
Significantly reduces localization ambiguity in medical images.
Achieves clearer and more accurate attribution results.
Demonstrates generalizability across mammography and chest X-ray data.
Abstract
The success of machine learning methods for computer vision tasks has driven a surge in computer assisted prediction for medicine and biology. Based on a data-driven relationship between input image and pathological classification, these predictors deliver unprecedented accuracy. Yet, the numerous approaches trying to explain the causality of this learned relationship have fallen short: time constraints, coarse, diffuse and at times misleading results, caused by the employment of heuristic techniques like Gaussian noise and blurring, have hindered their clinical adoption. In this work, we discuss and overcome these obstacles by introducing a neural-network based attribution method, applicable to any trained predictor. Our solution identifies salient regions of an input image in a single forward-pass by measuring the effect of local image-perturbations on a predictor's score. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
