Disentangling deep neural networks with rectified linear units using   duality

Chandrashekar Lakshminarayanan; Amit Vikram Singh

arXiv:2110.03403·cs.LG·October 8, 2021

Disentangling deep neural networks with rectified linear units using duality

Chandrashekar Lakshminarayanan, Amit Vikram Singh

PDF

Open Access

TL;DR

This paper introduces deep linearly gated networks (DLGN) as an interpretable alternative to traditional deep neural networks with ReLUs, disentangling computations into interpretable linear components and analyzing their properties.

Contribution

The paper proposes DLGN, a novel interpretable model that disentangles neural network computations into linear components, and analyzes the neural path kernel's invariance properties.

Findings

01

DLGN recovers over 83.5% of DNN performance on CIFAR datasets.

02

Convolution with global pooling provides rotational invariance to the neural path kernel.

03

Skip connections induce ensemble-like structure in the neural path kernel.

Abstract

Despite their success deep neural networks (DNNs) are still largely considered as black boxes. The main issue is that the linear and non-linear operations are entangled in every layer, making it hard to interpret the hidden layer outputs. In this paper, we look at DNNs with rectified linear units (ReLUs), and focus on the gating property (`on/off' states) of the ReLUs. We extend the recently developed dual view in which the computation is broken path-wise to show that learning in the gates is more crucial, and learning the weights given the gates is characterised analytically via the so called neural path kernel (NPK) which depends on inputs and gates. In this paper, we present novel results to show that convolution with global pooling and skip connection provide respectively rotational invariance and ensemble structure to the NPK. To address `black box'-ness, we propose a novel…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Stochastic Gradient Optimization Techniques

MethodsConvolution