Stable Tensor Neural Networks for Rapid Deep Learning
Elizabeth Newman, Lior Horesh, Haim Avron, Misha Kilmer

TL;DR
This paper introduces stable tensor neural networks ($t$-NNs) based on tensor algebra that enable rapid learning and improved generalization by leveraging multidimensional data and a matrix-mimetic algebraic structure.
Contribution
The paper proposes a novel stable $t$-NN framework utilizing tensor algebra and $t$-product, enhancing learning speed, parameter efficiency, and generalization in deep neural networks.
Findings
Demonstrated improved performance on MNIST and CIFAR-10 datasets.
Achieved more compact parameterization with better generalizability.
Introduced a tensor algebraic framework that mimics matrix properties.
Abstract
We propose a tensor neural network (-NN) framework that offers an exciting new paradigm for designing neural networks with multidimensional (tensor) data. Our network architecture is based on the -product (Kilmer and Martin, 2011), an algebraic formulation to multiply tensors via circulant convolution. In this -product algebra, we interpret tensors as -linear operators analogous to matrices as linear operators, and hence our framework inherits mimetic matrix properties. To exemplify the elegant, matrix-mimetic algebraic structure of our -NNs, we expand on recent work (Haber and Ruthotto, 2017) which interprets deep neural networks as discretizations of non-linear differential equations and introduces stable neural networks which promote superior generalization. Motivated by this dynamic framework, we introduce a stable -NN which facilitates more rapid learning because…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTensor decomposition and applications · Model Reduction and Neural Networks · Computational Physics and Python Applications
