Linear Convergence of SVRG in Statistical Estimation

Chao Qu; Yan Li; Huan Xu

arXiv:1611.01957·stat.ML·July 28, 2017·2 cites

Linear Convergence of SVRG in Statistical Estimation

Chao Qu, Yan Li, Huan Xu

PDF

Open Access

TL;DR

This paper proves that SVRG, a popular optimization algorithm, converges linearly for a broad class of statistical estimators, including non-convex models, without requiring strong convexity.

Contribution

It extends the linear convergence guarantee of SVRG to non-strongly convex and non-convex statistical estimation problems using restricted strong convexity.

Findings

01

SVRG converges linearly for a wide class of estimators.

02

Convergence occurs at the statistical precision of the model.

03

Results apply to non-convex models like corrected Lasso and SCAD.

Abstract

SVRG and its variants are among the state of art optimization algorithms for large scale machine learning problems. It is well known that SVRG converges linearly when the objective function is strongly convex. However this setup can be restrictive, and does not include several important formulations such as Lasso, group Lasso, logistic regression, and some non-convex models including corrected Lasso and SCAD. In this paper, we prove that, for a class of statistical M-estimators covering examples mentioned above, SVRG solves the formulation with {\em a linear convergence rate} without strong convexity or even convexity. Our analysis makes use of {\em restricted strong convexity}, under which we show that SVRG converges linearly to the fundamental statistical precision of the model, i.e., the difference between true unknown parameter $θ^{*}$ and the optimal solution $\hat{θ}$ of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Distributed Sensor Networks and Detection Algorithms · Advanced Bandit Algorithms Research