Explaining Explanations: Axiomatic Feature Interactions for Deep Networks
Joseph D. Janizek, Pascal Sturmfels, Su-In Lee

TL;DR
This paper introduces Integrated Hessians, a novel method extending Integrated Gradients to explain pairwise feature interactions in neural networks, providing more comprehensive insights into model behavior.
Contribution
The paper presents Integrated Hessians, a new, architecture-agnostic method for explaining feature interactions in neural networks, addressing limitations of previous approaches.
Findings
Overcomes theoretical limitations of prior interaction explanations
Faster than existing methods with many features
Outperforms previous methods on quantitative benchmarks
Abstract
Recent work has shown great promise in explaining neural network behavior. In particular, feature attribution methods explain which features were most important to a model's prediction on a given input. However, for many tasks, simply knowing which features were important to a model's prediction may not provide enough insight to understand model behavior. The interactions between features within the model may better help us understand not only the model, but also why certain features are more important than others. In this work, we present Integrated Hessians, an extension of Integrated Gradients that explains pairwise feature interactions in neural networks. Integrated Hessians overcomes several theoretical limitations of previous methods to explain interactions, and unlike such previous methods is not limited to a specific architecture or class of neural network. Additionally, we find…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Machine Learning in Healthcare
