Accurate and Robust Feature Importance Estimation under Distribution Shifts
Jayaraman J. Thiagarajan, Vivek Narayanaswamy, Rushil Anirudh,, Peer-Timo Bremer, Andreas Spanias

TL;DR
PRoFILE is a new post-hoc feature importance estimation method that accurately and robustly handles distribution shifts without re-training, improving interpretability of black-box models in real-world scenarios.
Contribution
It introduces PRoFILE, a novel approach combining a loss estimator and causal objectives to improve feature importance estimation under distribution shifts without additional re-training.
Findings
Significant improvements over state-of-the-art methods in fidelity and robustness.
Effective detection of distribution shifts using proposed learning strategies.
Maintains accuracy of feature importance estimates under complex domain shifts.
Abstract
With increasing reliance on the outcomes of black-box models in critical applications, post-hoc explainability tools that do not require access to the model internals are often used to enable humans understand and trust these models. In particular, we focus on the class of methods that can reveal the influence of input features on the predicted outputs. Despite their wide-spread adoption, existing methods are known to suffer from one or more of the following challenges: computational complexities, large uncertainties and most importantly, inability to handle real-world domain shifts. In this paper, we propose PRoFILE, a novel feature importance estimation method that addresses all these challenges. Through the use of a loss estimator jointly trained with the predictive model and a causal objective, PRoFILE can accurately estimate the feature importance scores even under complex…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMedical Imaging Techniques and Applications · Machine Learning in Healthcare · Explainable Artificial Intelligence (XAI)
MethodsDropout
