Disentangling deep neural networks with rectified linear units using duality
Chandrashekar Lakshminarayanan, Amit Vikram Singh

TL;DR
This paper introduces deep linearly gated networks (DLGN) as an interpretable alternative to traditional deep neural networks with ReLUs, disentangling computations into interpretable linear components and analyzing their properties.
Contribution
The paper proposes DLGN, a novel interpretable model that disentangles neural network computations into linear components, and analyzes the neural path kernel's invariance properties.
Findings
DLGN recovers over 83.5% of DNN performance on CIFAR datasets.
Convolution with global pooling provides rotational invariance to the neural path kernel.
Skip connections induce ensemble-like structure in the neural path kernel.
Abstract
Despite their success deep neural networks (DNNs) are still largely considered as black boxes. The main issue is that the linear and non-linear operations are entangled in every layer, making it hard to interpret the hidden layer outputs. In this paper, we look at DNNs with rectified linear units (ReLUs), and focus on the gating property (`on/off' states) of the ReLUs. We extend the recently developed dual view in which the computation is broken path-wise to show that learning in the gates is more crucial, and learning the weights given the gates is characterised analytically via the so called neural path kernel (NPK) which depends on inputs and gates. In this paper, we present novel results to show that convolution with global pooling and skip connection provide respectively rotational invariance and ensemble structure to the NPK. To address `black box'-ness, we propose a novel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Stochastic Gradient Optimization Techniques
MethodsConvolution
