Stochastic regularized majorization-minimization with weakly convex and   multi-convex surrogates

Hanbaek Lyu

arXiv:2201.01652·math.OC·March 22, 2023·1 cites

Stochastic regularized majorization-minimization with weakly convex and multi-convex surrogates

Hanbaek Lyu

PDF

Open Access 1 Repo

TL;DR

This paper extends stochastic majorization-minimization algorithms to weakly convex and multi-convex surrogates, providing convergence rates for non-convex, dependent data settings, and validating with experiments.

Contribution

It introduces a novel extension of SMM allowing weakly convex and multi-convex surrogates with convergence guarantees in non-convex, dependent data scenarios.

Findings

01

Convergence rate of $O(( ext{log} n)^{1+ ext{epsilon}}/n^{1/2})$ for empirical loss.

02

Convergence rate of $O(( ext{log} n)^{1+ ext{epsilon}}/n^{1/4})$ for expected loss.

03

First rate bounds for several optimization methods under dependent data.

Abstract

Stochastic majorization-minimization (SMM) is a class of stochastic optimization algorithms that proceed by sampling new data points and minimizing a recursive average of surrogate functions of an objective function. The surrogates are required to be strongly convex and convergence rate analysis for the general non-convex setting was not available. In this paper, we propose an extension of SMM where surrogates are allowed to be only weakly convex or block multi-convex, and the averaged surrogates are approximately minimized with proximal regularization or block-minimized within diminishing radii, respectively. For the general nonconvex constrained setting with non-i.i.d. data samples, we show that the first-order optimality gap of the proposed algorithm decays at the rate $O ((lo g n)^{1 + ϵ} / n^{1/2})$ for the empirical loss and $O ((lo g n)^{1 + ϵ} / n^{1/4})$ for the expected…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

HanbaekLyu/SRMM
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Tensor decomposition and applications

MethodsStochastic Regularized Majorization-Minimization · AdaGrad · AMSGrad · Adam · Stochastic Gradient Descent