Implicit Regularization for Tubal Tensor Factorizations via Gradient Descent
Santhosh Karnik, Anna Veselovska, Mark Iwen, Felix Krahmer

TL;DR
This paper rigorously analyzes how gradient descent implicitly regularizes overparametrized tubal tensor factorizations, favoring low tubal rank solutions, especially relevant for image data, beyond the lazy training regime.
Contribution
It provides the first theoretical proof of implicit regularization for tensor factorizations under gradient descent, extending beyond gradient flow and lazy training regimes.
Findings
Gradient descent biases solutions towards low tubal rank.
Small random initialization is crucial for the implicit regularization.
Numerical simulations confirm the theoretical predictions.
Abstract
We provide a rigorous analysis of implicit regularization in an overparametrized tensor factorization problem beyond the lazy training regime. For matrix factorization problems, this phenomenon has been studied in a number of works. A particular challenge has been to design universal initialization strategies which provably lead to implicit regularization in gradient-descent methods. At the same time, it has been argued by Cohen et. al. 2016 that more general classes of neural networks can be captured by considering tensor factorizations. However, in the tensor case, implicit regularization has only been rigorously established for gradient flow or in the lazy training regime. In this paper, we prove the first tensor result of its kind for gradient descent rather than gradient flow. We focus on the tubal tensor product and the associated notion of low tubal rank, encouraged by the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsCongenital gastrointestinal and neural anomalies · Wind Energy Research and Development · Congenital heart defects research
MethodsSparse Evolutionary Training · Focus
