Tensor-on-Tensor Regression: Riemannian Optimization, Over-parameterization, Statistical-computational Gap, and Their Interplay
Yuetian Luo, Anru R. Zhang

TL;DR
This paper introduces Riemannian optimization methods for tensor-on-tensor regression, demonstrating their convergence, adaptability to over-parameterization, and revealing a statistical-computational gap that benefits higher-order tensors.
Contribution
It provides the first convergence guarantees for tensor-on-tensor regression with unknown rank and uncovers the 'blessing' of over-parameterization in higher-order tensor models.
Findings
Riemannian gradient descent converges linearly
Riemannian Gauss-Newton converges quadratically
Over-parameterization is cost-free in sample size for tensors of order three or higher
Abstract
We study the tensor-on-tensor regression, where the goal is to connect tensor responses to tensor covariates with a low Tucker rank parameter tensor/matrix without the prior knowledge of its intrinsic rank. We propose the Riemannian gradient descent (RGD) and Riemannian Gauss-Newton (RGN) methods and cope with the challenge of unknown rank by studying the effect of rank over-parameterization. We provide the first convergence guarantee for the general tensor-on-tensor regression by showing that RGD and RGN respectively converge linearly and quadratically to a statistically optimal estimate in both rank correctly-parameterized and over-parameterized settings. Our theory reveals an intriguing phenomenon: Riemannian optimization methods naturally adapt to over-parameterization without modifications to their implementation. We also prove the statistical-computational gap in scalar-on-tensor…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTensor decomposition and applications · Advanced Neuroimaging Techniques and Applications
MethodsTuckER
