SAM: The Sensitivity of Attribution Methods to Hyperparameters
Naman Bansal, Chirag Agarwal, Anh Nguyen

TL;DR
This paper investigates how attribution methods' explanations are highly sensitive to hyperparameter choices, which affects their reliability, reproducibility, and user trust, revealing that robustness varies across classifiers.
Contribution
It provides a comprehensive empirical analysis showing the high sensitivity of attribution methods to hyperparameters and highlights the unexpected robustness of explanations for robust classifiers.
Findings
Many attribution methods are highly sensitive to hyperparameters.
Changing a random seed can significantly alter explanations.
Explanations for robust classifiers are more stable.
Abstract
Attribution methods can provide powerful insights into the reasons for a classifier's decision. We argue that a key desideratum of an explanation method is its robustness to input hyperparameters which are often randomly set or empirically tuned. High sensitivity to arbitrary hyperparameter choices does not only impede reproducibility but also questions the correctness of an explanation and impairs the trust of end-users. In this paper, we provide a thorough empirical study on the sensitivity of existing attribution methods. We found an alarming trend that many methods are highly sensitive to changes in their common hyperparameters e.g. even changing a random seed can yield a different explanation! Interestingly, such sensitivity is not reflected in the average explanation accuracy scores over the dataset as commonly reported in the literature. In addition, explanations generated for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
SAM: The Sensitivity of Attribution Methods to Hyperparameters· youtube
SAM: The Sensitivity of Attribution Methods to Hyperparameters· youtube
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning and Data Classification
