Low-Rank Tensor Decompositions for the Theory of Neural Networks
Ricardo Borsoi, Konstantin Usevich, Marianne Clausel

TL;DR
This paper reviews how low-rank tensor decompositions are fundamental in understanding deep neural networks, covering their theoretical implications for expressivity, learnability, generalization, and identifiability.
Contribution
It provides a unified overview of tensor methods' role in deep neural network theory, connecting diverse approaches across disciplines.
Findings
Tensor decompositions have strong uniqueness guarantees.
Polynomial time algorithms exist for tensor decomposition.
Tensor methods explain neural networks' expressivity and learnability.
Abstract
The groundbreaking performance of deep neural networks (NNs) promoted a surge of interest in providing a mathematical basis to deep learning theory. Low-rank tensor decompositions are specially befitting for this task due to their close connection to NNs and their rich theoretical results. Different tensor decompositions have strong uniqueness guarantees, which allow for a direct interpretation of their factors, and polynomial time algorithms have been proposed to compute them. Through the connections between tensors and NNs, such results supported many important advances in the theory of NNs. In this review, we show how low-rank tensor methods--which have been a core tool in the signal processing and machine learning communities--play a fundamental role in theoretically explaining different aspects of the performance of deep NNs, including their expressivity, algorithmic learnability…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
