Theoretical Properties for Neural Networks with Weight Matrices of Low   Displacement Rank

Liang Zhao; Siyu Liao; Yanzhi Wang; Zhe Li; Jian Tang; Victor Pan and; Bo Yuan

arXiv:1703.00144·cs.LG·September 25, 2017·19 cites

Theoretical Properties for Neural Networks with Weight Matrices of Low Displacement Rank

Liang Zhao, Siyu Liao, Yanzhi Wang, Zhe Li, Jian Tang, Victor Pan and, Bo Yuan

PDF

Open Access

TL;DR

This paper provides a theoretical analysis of neural networks with low displacement rank matrices, proving their approximation capabilities and efficiency, and introduces a training algorithm for such networks.

Contribution

It offers the first formal theoretical foundation for LDR neural networks, including approximation properties and error bounds, along with a training method.

Findings

01

LDR neural networks have universal approximation properties.

02

Error bounds of LDR networks are comparable to general neural networks.

03

A back-propagation based training algorithm for LDR networks is proposed.

Abstract

Recently low displacement rank (LDR) matrices, or so-called structured matrices, have been proposed to compress large-scale neural networks. Empirical results have shown that neural networks with weight matrices of LDR matrices, referred as LDR neural networks, can achieve significant reduction in space and computational complexity while retaining high accuracy. We formally study LDR matrices in deep learning. First, we prove the universal approximation property of LDR neural networks with a mild condition on the displacement operators. We then show that the error bounds of LDR neural networks are as efficient as general neural networks with both single-layer and multiple-layer structure. Finally, we propose back-propagation based training algorithm for general LDR neural networks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Face and Expression Recognition · Machine Learning and ELM