TL;DR
This paper introduces new feature attribution methods based on Information Bottleneck Attribution to interpret neural network predictions on chest X-rays, highlighting all informative regions and explaining regression models.
Contribution
It proposes Inverse IBA, Regression IBA, and Multi-layer IBA to improve interpretability of chest X-ray models, capturing all predictive cues and regional severity.
Findings
Inverse IBA identifies all informative regions for pathologies.
Regression IBA reveals models implicitly learn regional severity.
Multi-layer IBA produces detailed, high-resolution attribution maps.
Abstract
Neural networks have demonstrated remarkable performance in classification and regression tasks on chest X-rays. In order to establish trust in the clinical routine, the networks' prediction mechanism needs to be interpretable. One principal approach to interpretation is feature attribution. Feature attribution methods identify the importance of input features for the output prediction. Building on Information Bottleneck Attribution (IBA) method, for each prediction we identify the chest X-ray regions that have high mutual information with the network's output. Original IBA identifies input regions that have sufficient predictive information. We propose Inverse IBA to identify all informative regions. Thus all predictive cues for pathologies are highlighted on the X-rays, a desirable property for chest X-ray diagnosis. Moreover, we propose Regression IBA for explaining regression…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
