LU decomposition and Toeplitz decomposition of a neural network
Yucong Liu, Simiao Jiao, and Lek-Heng Lim

TL;DR
This paper proves that neural networks can be approximated by structured matrices like LU and Toeplitz, enabling parameter reduction without losing universal approximation capability, with practical experiments confirming efficiency gains.
Contribution
The paper introduces neural network approximations using LU and Toeplitz matrix decompositions, extending universal approximation theorems to structured matrices and convolutional networks.
Findings
Structured matrices reduce parameters significantly
Universal approximation holds with LU and Toeplitz matrices
Experiments show minimal accuracy loss with structured constraints
Abstract
It is well-known that any matrix has an LU decomposition. Less well-known is the fact that it has a 'Toeplitz decomposition' where 's are Toeplitz matrices. We will prove that any continuous function has an approximation to arbitrary accuracy by a neural network that takes the form , i.e., where the weight matrices alternate between lower and upper triangular matrices, for some bias vector , and the activation may be chosen to be essentially any uniformly continuous nonpolynomial function. The same result also holds with Toeplitz matrices, i.e., to arbitrary accuracy, and likewise for Hankel matrices. A consequence of our Toeplitz result…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMatrix Theory and Algorithms · Neural Networks and Applications · Blind Source Separation Techniques
MethodsTest
