Stochastic regularized majorization-minimization with weakly convex and multi-convex surrogates
Hanbaek Lyu

TL;DR
This paper extends stochastic majorization-minimization algorithms to weakly convex and multi-convex surrogates, providing convergence rates for non-convex, dependent data settings, and validating with experiments.
Contribution
It introduces a novel extension of SMM allowing weakly convex and multi-convex surrogates with convergence guarantees in non-convex, dependent data scenarios.
Findings
Convergence rate of $O(( ext{log} n)^{1+ ext{epsilon}}/n^{1/2})$ for empirical loss.
Convergence rate of $O(( ext{log} n)^{1+ ext{epsilon}}/n^{1/4})$ for expected loss.
First rate bounds for several optimization methods under dependent data.
Abstract
Stochastic majorization-minimization (SMM) is a class of stochastic optimization algorithms that proceed by sampling new data points and minimizing a recursive average of surrogate functions of an objective function. The surrogates are required to be strongly convex and convergence rate analysis for the general non-convex setting was not available. In this paper, we propose an extension of SMM where surrogates are allowed to be only weakly convex or block multi-convex, and the averaged surrogates are approximately minimized with proximal regularization or block-minimized within diminishing radii, respectively. For the general nonconvex constrained setting with non-i.i.d. data samples, we show that the first-order optimality gap of the proposed algorithm decays at the rate for the empirical loss and for the expected…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Tensor decomposition and applications
MethodsStochastic Regularized Majorization-Minimization · AdaGrad · AMSGrad · Adam · Stochastic Gradient Descent
