Feature Attribution Stability Suite: How Stable Are Post-Hoc Attributions?

Kamalasankari Subramaniakuppusamy; Jugal Gajjar

arXiv:2604.02532·cs.CV·April 6, 2026

Feature Attribution Stability Suite: How Stable Are Post-Hoc Attributions?

Kamalasankari Subramaniakuppusamy, Jugal Gajjar

PDF

TL;DR

The paper introduces FASS, a benchmark suite for evaluating the stability of post-hoc feature attribution methods under various realistic perturbations, emphasizing the importance of prediction invariance.

Contribution

FASS provides a comprehensive, prediction-invariance conditioned evaluation framework with multiple stability metrics across diverse perturbations and datasets.

Findings

01

Geometric perturbations cause more attribution instability than photometric ones.

02

Without prediction-invariance filtering, up to 99% of attribution pairs involve prediction changes.

03

Grad-CAM shows the highest stability among evaluated attribution methods.

Abstract

Post-hoc feature attribution methods are widely deployed in safety-critical vision systems, yet their stability under realistic input perturbations remains poorly characterized. Existing metrics evaluate explanations primarily under additive noise, collapse stability to a single scalar, and fail to condition on prediction preservation, conflating explanation fragility with model sensitivity. We introduce the Feature Attribution Stability Suite (FASS), a benchmark that enforces prediction-invariance filtering, decomposes stability into three complementary metrics: structural similarity, rank correlation, and top-k Jaccard overlap-and evaluates across geometric, photometric, and compression perturbations. Evaluating four attribution methods (Integrated Gradients, GradientSHAP, Grad-CAM, LIME) across four architectures and three datasets-ImageNet-1K, MS COCO, and CIFAR-10, FASS shows that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.