Theoretical Properties for Neural Networks with Weight Matrices of Low Displacement Rank
Liang Zhao, Siyu Liao, Yanzhi Wang, Zhe Li, Jian Tang, Victor Pan and, Bo Yuan

TL;DR
This paper provides a theoretical analysis of neural networks with low displacement rank matrices, proving their approximation capabilities and efficiency, and introduces a training algorithm for such networks.
Contribution
It offers the first formal theoretical foundation for LDR neural networks, including approximation properties and error bounds, along with a training method.
Findings
LDR neural networks have universal approximation properties.
Error bounds of LDR networks are comparable to general neural networks.
A back-propagation based training algorithm for LDR networks is proposed.
Abstract
Recently low displacement rank (LDR) matrices, or so-called structured matrices, have been proposed to compress large-scale neural networks. Empirical results have shown that neural networks with weight matrices of LDR matrices, referred as LDR neural networks, can achieve significant reduction in space and computational complexity while retaining high accuracy. We formally study LDR matrices in deep learning. First, we prove the universal approximation property of LDR neural networks with a mild condition on the displacement operators. We then show that the error bounds of LDR neural networks are as efficient as general neural networks with both single-layer and multiple-layer structure. Finally, we propose back-propagation based training algorithm for general LDR neural networks.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Face and Expression Recognition · Machine Learning and ELM
