Stochastic Gradient Descent for Stochastic Doubly-Nonconvex Composite Optimization
Takayuki Kawashima, Hironori Fujisawa

TL;DR
This paper develops convergence guarantees for stochastic gradient descent applied to doubly-nonconvex composite optimization problems, under weak quasiconvexity assumptions, with improved convergence rates and empirical validation.
Contribution
It introduces the first convergence analysis for stochastic optimization when both composite functions are nonconvex, assuming quasiconvex penalty functions, and achieves optimal convergence rates.
Findings
Convergence properties are established for doubly-nonconvex cases.
Optimal convergence rates are derived with appropriate step size and mini-batch size.
Experimental results show the proposed method outperforms existing algorithms.
Abstract
The stochastic gradient descent has been widely used for solving composite optimization problems in big data analyses. Many algorithms and convergence properties have been developed. The composite functions were convex primarily and gradually nonconvex composite functions have been adopted to obtain more desirable properties. The convergence properties have been investigated, but only when either of composite functions is nonconvex. There is no convergence property when both composite functions are nonconvex, which is named the \textit{doubly-nonconvex} case.To overcome this difficulty, we assume a simple and weak condition that the penalty function is \textit{quasiconvex} and then we obtain convergence properties for the stochastic doubly-nonconvex composite optimization problem.The convergence rate obtained here is of the same order as the existing work.We deeply analyze the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Complexity and Algorithms in Graphs
