Mild Over-Parameterization Benefits Asymmetric Tensor PCA
Shihong Ding, Weicheng Lin, Cong Fang

TL;DR
This paper introduces a novel matrix-parameterized gradient descent algorithm for asymmetric tensor PCA that uses mild over-parameterization to improve sample efficiency and adaptivity under limited memory constraints.
Contribution
It presents the first tractable algorithm for ATPCA with memory costs independent of the tensor dimension, leveraging mild over-parameterization for better performance.
Findings
Achieves near-optimal $d^{ar{k}-2}$ sample complexity with limited memory.
Enhances adaptivity, reducing sample size as vectors become more aligned.
Attains polynomial-time complexity matching the symmetric case in the limit.
Abstract
Asymmetric Tensor PCA (ATPCA) is a prototypical model for studying the trade-offs between sample complexity, computation, and memory. Existing algorithms for this problem typically require at least state memory cost to recover the signal, where is the vector dimension and is the tensor order. We focus on the setting where is even and consider (stochastic) gradient descent-based algorithms under a limited memory budget, which permits only mild over-parameterization of the model. We propose a matrix-parameterized method (in state memory cost) using a novel three-phase alternating-update algorithm to address the problem and demonstrate how mild over-parameterization facilitates learning in two key aspects: (i) it improves sample efficiency, allowing our method to achieve \emph{near-optimal}…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
