Illuminating Salient Contributions in Neuron Activation with Attribution Equilibrium
Woo-Jeoung Nam, Seong-Whan Lee

TL;DR
This paper introduces Attribution Equilibrium, a new method for decomposing neural network outputs into detailed attributions, improving interpretability by balancing positive and negative evidence and considering neuron inactivation.
Contribution
We propose Attribution Equilibrium, a novel attribution method that enhances interpretability by balancing evidence and incorporating antagonistic elements and neuron inactivation effects.
Findings
Outperforms existing attribution methods in identifying key input features.
Provides clearer visualization of evidence behind network decisions.
Effective across multiple datasets including PASCAL VOC, MS COCO, and ImageNet.
Abstract
With the remarkable success of deep neural networks, there is a growing interest in research aimed at providing clear interpretations of their decision-making processes. In this paper, we introduce Attribution Equilibrium, a novel method to decompose output predictions into fine-grained attributions, balancing positive and negative relevance for clearer visualization of the evidence behind a network decision. We carefully analyze conventional approaches to decision explanation and present a different perspective on the conservation of evidence. We define the evidence as a gap between positive and negative influences among gradient-derived initial contribution maps. Then, we incorporate antagonistic elements and a user-defined criterion for the degree of positive attribution during propagation. Additionally, we consider the role of inactivated neurons in the propagation rule, thereby…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Machine Learning in Materials Science · Market Dynamics and Volatility
