Manifold Integrated Gradients: Riemannian Geometry for Feature   Attribution

Eslam Zaher; Maciej Trzaskowski; Quan Nguyen; Fred Roosta

arXiv:2405.09800·cs.LG·May 17, 2024

Manifold Integrated Gradients: Riemannian Geometry for Feature Attribution

Eslam Zaher, Maciej Trzaskowski, Quan Nguyen, Fred Roosta

PDF

Open Access 1 Repo

TL;DR

This paper enhances Integrated Gradients by incorporating Riemannian geometry to produce more reliable, perceptually intuitive feature attributions that are robust against adversarial attacks in deep learning models.

Contribution

It introduces a manifold-aware adaptation of Integrated Gradients that aligns attribution paths with the data's intrinsic geometry, improving explanation quality and robustness.

Findings

01

Geodesic-based IG produces more intuitive visualizations.

02

Method increases robustness to attributional attacks.

03

Experiments on real-world datasets validate effectiveness.

Abstract

In this paper, we dive into the reliability concerns of Integrated Gradients (IG), a prevalent feature attribution method for black-box deep learning models. We particularly address two predominant challenges associated with IG: the generation of noisy feature visualizations for vision models and the vulnerability to adversarial attributional attacks. Our approach involves an adaptation of path-based feature attribution, aligning the path of attribution more closely to the intrinsic geometry of the data manifold. Our experiments utilise deep generative models applied to several real-world image datasets. They demonstrate that IG along the geodesics conforms to the curved geometry of the Riemannian data manifold, generating more perceptually intuitive explanations and, subsequently, substantially increasing robustness to targeted attributional attacks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

eszaher/manifold-integrated-gradients
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Numerical Analysis Techniques · Medical Imaging and Analysis · Medical Image Segmentation Techniques