Stochastic Proximal Gradient Descent for Nuclear Norm Regularization
Lijun Zhang, Tianbao Yang, Rong Jin, Zhi-Hua Zhou

TL;DR
This paper introduces a stochastic proximal gradient method that significantly reduces space complexity for nuclear norm regularized matrix optimization, achieving efficient convergence rates.
Contribution
It presents a novel stochastic proximal gradient algorithm with $O(m+n)$ space complexity for nuclear norm regularization, improving over previous methods.
Findings
Achieves $O(rac{ ext{log} T}{ ext{sqrt} T})$ convergence for convex functions.
Achieves $O(rac{ ext{log} T}{T})$ convergence for strongly convex functions.
Reduces space complexity from $O(mn)$ to $O(m+n)$.
Abstract
In this paper, we utilize stochastic optimization to reduce the space complexity of convex composite optimization with a nuclear norm regularizer, where the variable is a matrix of size . By constructing a low-rank estimate of the gradient, we propose an iterative algorithm based on stochastic proximal gradient descent (SPGD), and take the last iterate of SPGD as the final solution. The main advantage of the proposed algorithm is that its space complexity is , in contrast, most of previous algorithms have a space complexity. Theoretical analysis shows that it achieves and convergence rates for general convex functions and strongly convex functions, respectively.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Advanced Optimization Algorithms Research
