Approximation and learning with compositional tensor trains
Martin Eigel, Charles Miranda, Anthony Nouy, David Sommer

TL;DR
This paper introduces compositional tensor trains (CTTs), a flexible tensor-based model for approximating multivariate functions, combining the expressivity of deep neural networks with efficient tensor algebra for scalable learning.
Contribution
The paper proposes CTTs, a novel compositional tensor train format that unifies various approximation tools and develops new optimization algorithms inspired by natural gradient descent and optimal control.
Findings
CTTs can encode polynomials, DNNs, and tensor networks with similar complexity.
Numerical experiments show CTTs' high expressivity and efficient optimization.
The proposed algorithms outperform traditional methods in regression tasks.
Abstract
We introduce compositional tensor trains (CTTs) for the approximation of multivariate functions, a class of models obtained by composing low-rank functions in the tensor-train format. This format can encode standard approximation tools, such as (sparse) polynomials, deep neural networks (DNNs) with fixed width, or tensor networks with arbitrary permutation of the inputs, or more general affine coordinate transformations, with similar complexities. This format can be viewed as a DNN with width exponential in the input dimension and structured weights matrices. Compared to DNNs, this format enables controlled compression at the layer level using efficient tensor algebra. On the optimization side, we derive a layerwise algorithm inspired by natural gradient descent, allowing to exploit efficient low-rank tensor algebra. This relies on low-rank estimations of Gram matrices, and tensor…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTensor decomposition and applications · Model Reduction and Neural Networks · Stochastic Gradient Optimization Techniques
