Optimizing Explanations by Network Canonization and Hyperparameter Search
Frederik Pahde, Galip \"Umit Yolcu, Alexander Binder, Wojciech Samek,, Sebastian Lapuschkin

TL;DR
This paper introduces model canonization techniques for complex neural networks to improve explainable AI methods and proposes an evaluation framework to quantify their benefits across multiple architectures and tasks.
Contribution
It presents new canonization methods for modern neural network blocks and a framework to evaluate and optimize XAI explanations through hyperparameter search.
Findings
Canonizations improve XAI explanation quality.
Evaluation framework quantifies canonization benefits.
Hyperparameter search enhances explanation effectiveness.
Abstract
Explainable AI (XAI) is slowly becoming a key component for many AI applications. Rule-based and modified backpropagation XAI approaches however often face challenges when being applied to modern model architectures including innovative layer building blocks, which is caused by two reasons. Firstly, the high flexibility of rule-based XAI methods leads to numerous potential parameterizations. Secondly, many XAI methods break the implementation-invariance axiom because they struggle with certain model components, e.g., BatchNorm layers. The latter can be addressed with model canonization, which is the process of re-structuring the model to disregard problematic components without changing the underlying function. While model canonization is straightforward for simple architectures (e.g., VGG, ResNet), it can be challenging for more complex and highly interconnected models (e.g.,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Advanced Neural Network Applications
MethodsDepthwise Convolution · Pointwise Convolution · Depthwise Separable Convolution · Inverted Residual Block · Squeeze-and-Excitation Block · Sigmoid Activation · RMSProp · Global Average Pooling · Average Pooling · Residual Connection
