Always Keep Your Promises: A Model-Agnostic Attribution Algorithm for Neural Networks
Kevin Lee, Duncan Smith-Halverson, Pablo Millan Arias

TL;DR
DynamicLRP is a novel, model-agnostic attribution method for neural networks that operates at the tensor operation level, providing broad applicability and maintaining theoretical guarantees without requiring model modifications.
Contribution
We introduce DynamicLRP, a framework that achieves architecture-agnostic attribution by decomposing at the operation level and using a Promise System for deferred activation resolution.
Findings
Matches or exceeds specialized LRP implementations in faithfulness.
Achieves 99.92% node coverage across diverse architectures.
Operates efficiently on models with up to 1 billion parameters.
Abstract
Layer-wise Relevance Propagation (LRP) provides principled attribution for neural networks through conservation properties and foundations in Deep Taylor Decomposition. However, existing implementations operate at the module level, requiring architecture-specific propagation rules and model modifications. These limit the generality of target model and sustainability of implementations as architectures evolve. We introduce DynamicLRP, a model-agnostic LRP framework operating at the tensor operation level. By decomposing attribution to individual operations within computation graphs and introducing a novel mechanism for deferred activation resolution, named the Promise System, our approach achieves true architecture agnosticity while maintaining LRP's theoretical guarantees. This design operates independently of backpropagation machinery, requiring no model modification, enabling…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Advanced Graph Neural Networks · Generative Adversarial Networks and Image Synthesis
