Scalable Partial Explainability in Neural Networks via Flexible   Activation Functions

Schyler C. Sun; Chen Li; Zhuangkun Wei; Antonios Tsourdos; Weisi Guo

arXiv:2006.06057·cs.LG·June 12, 2020

Scalable Partial Explainability in Neural Networks via Flexible Activation Functions

Schyler C. Sun, Chen Li, Zhuangkun Wei, Antonios Tsourdos, Weisi Guo

PDF

Open Access

TL;DR

This paper introduces a scalable neural network architecture with adaptive activation functions modeled as Gaussian Processes, enabling partial interpretability by explaining neuron roles within the network structure.

Contribution

It proposes a novel scalable NN topology based on the Kolmogorov-Arnold theorem, where activation functions are tunable via Gaussian Processes during training, enhancing interpretability.

Findings

01

Demonstrated interpretability on a banknote authentication dataset

02

Showed trade-off between model complexity and interpretability

03

Potential to serve as an interpretation layer for deep networks

Abstract

Achieving transparency in black-box deep learning algorithms is still an open challenge. High dimensional features and decisions given by deep neural networks (NN) require new algorithms and methods to expose its mechanisms. Current state-of-the-art NN interpretation methods (e.g. Saliency maps, DeepLIFT, LIME, etc.) focus more on the direct relationship between NN outputs and inputs rather than the NN structure and operations itself. In current deep NN operations, there is uncertainty over the exact role played by neurons with fixed activation functions. In this paper, we achieve partially explainable learning model by symbolically explaining the role of activation functions (AF) under a scalable topology. This is carried out by modeling the AFs as adaptive Gaussian Processes (GP), which sit within a novel scalable NN topology, based on the Kolmogorov-Arnold Superposition Theorem…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications

MethodsLocal Interpretable Model-Agnostic Explanations