Compression and Interpretability of Deep Neural Networks via Tucker   Tensor Layer: From First Principles to Tensor Valued Back-Propagation

Giuseppe G. Calvi; Ahmad Moniri; Mahmoud Mahfouz; Qibin Zhao; Danilo; P. Mandic

arXiv:1903.06133·cs.LG·January 7, 2020·21 cites

Compression and Interpretability of Deep Neural Networks via Tucker Tensor Layer: From First Principles to Tensor Valued Back-Propagation

Giuseppe G. Calvi, Ahmad Moniri, Mahmoud Mahfouz, Qibin Zhao, Danilo, P. Mandic

PDF

Open Access

TL;DR

This paper introduces the Tucker Tensor Layer (TTL), a tensor-based approach to compress deep neural networks and improve their interpretability by leveraging tensor decompositions and tensor-valued back-propagation.

Contribution

It proposes a novel tensor-valued framework for DNN compression and interpretability, extending back-propagation to tensor derivatives and demonstrating significant parameter reduction.

Findings

01

66.63-fold compression on MNIST and Fashion-MNIST

02

10% training speed-up with simplified VGG-16

03

Enhanced interpretability through feature importance analysis

Abstract

This work aims to help resolve the two main stumbling blocks in the application of Deep Neural Networks (DNNs), that is, the exceedingly large number of trainable parameters and their physical interpretability. This is achieved through a tensor valued approach, based on the proposed Tucker Tensor Layer (TTL), as an alternative to the dense weight-matrices of DNNs. This allows us to treat the weight-matrices of general DNNs as a matrix unfolding of a higher order weight-tensor. By virtue of the compression properties of tensor decompositions, this enables us to introduce a novel and efficient framework for exploiting the multi-way nature of the weight-tensor in order to dramatically reduce the number of DNN parameters. We also derive the tensor valued back-propagation algorithm within the TTL framework, by extending the notion of matrix derivatives to tensors. In this way, the physical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTensor decomposition and applications · Model Reduction and Neural Networks · Computational Physics and Python Applications

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · TuckER · Interpretability