Scalable Partial Explainability in Neural Networks via Flexible Activation Functions
Schyler C. Sun, Chen Li, Zhuangkun Wei, Antonios Tsourdos, Weisi Guo

TL;DR
This paper introduces a scalable neural network architecture with adaptive activation functions modeled as Gaussian Processes, enabling partial interpretability by explaining neuron roles within the network structure.
Contribution
It proposes a novel scalable NN topology based on the Kolmogorov-Arnold theorem, where activation functions are tunable via Gaussian Processes during training, enhancing interpretability.
Findings
Demonstrated interpretability on a banknote authentication dataset
Showed trade-off between model complexity and interpretability
Potential to serve as an interpretation layer for deep networks
Abstract
Achieving transparency in black-box deep learning algorithms is still an open challenge. High dimensional features and decisions given by deep neural networks (NN) require new algorithms and methods to expose its mechanisms. Current state-of-the-art NN interpretation methods (e.g. Saliency maps, DeepLIFT, LIME, etc.) focus more on the direct relationship between NN outputs and inputs rather than the NN structure and operations itself. In current deep NN operations, there is uncertainty over the exact role played by neurons with fixed activation functions. In this paper, we achieve partially explainable learning model by symbolically explaining the role of activation functions (AF) under a scalable topology. This is carried out by modeling the AFs as adaptive Gaussian Processes (GP), which sit within a novel scalable NN topology, based on the Kolmogorov-Arnold Superposition Theorem…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications
MethodsLocal Interpretable Model-Agnostic Explanations
