GRALIS: A Unified Canonical Framework for Linear Attribution Methods via Riesz Representation
Raimondo Fanale

TL;DR
GRALIS introduces a unified mathematical framework for linear attribution methods in deep learning, enabling formal comparison and guarantees across techniques like SHAP, IG, and LIME.
Contribution
It establishes a canonical representation theory for additive, linear, continuous attribution functionals, unifying several popular XAI methods under a rigorous mathematical foundation.
Findings
GRALIS encompasses SHAP, IG, LIME, and linearized GradCAM but excludes nonlinear functionals.
Seven formal theorems provide guarantees like completeness, convergence, and interaction values.
Preliminary validation shows improved faithfulness and consistency on histology images.
Abstract
The main XAI attribution methods for deep neural networks -- GradCAM, SHAP, LIME, Integrated Gradients -- operate on separate theoretical foundations and are not formally comparable. We present GRALIS (Gradient-Riesz Averaged Locally-Integrated Shapley), a mathematical framework establishing a representation theory for attributions: every additive, linear, and continuous attribution functional on L^2(Q,mu) admits a unique canonical representation (Q, w, Delta), proved necessary by the Riesz Representation Theorem. This class encompasses SHAP, IG, LIME and linearized GradCAM, but excludes nonlinear functionals such as standard GradCAM or attention maps. Seven formal theorems provide simultaneous guarantees absent in any individual method: (T1) necessary canonical form; (T2) exact completeness; (T3) Monte Carlo convergence O(1/sqrt(m))+O(1/k); (T4) exact Shapley Interaction Values; (T5)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
