Stochastic Primal-Dual Proximal ExtraGradient Descent for Compositely   Regularized Optimization

Tianyi Lin; Linbo Qiao; Teng Zhang; Jiashi Feng; Bofeng; Zhang

arXiv:1708.05978·cs.LG·February 2, 2018

Stochastic Primal-Dual Proximal ExtraGradient Descent for Compositely Regularized Optimization

Tianyi Lin, Linbo Qiao, Teng Zhang, Jiashi Feng, Bofeng, Zhang

PDF

TL;DR

This paper introduces SPDPEG, a stochastic primal-dual extra-gradient method for complex regularized optimization problems, achieving optimal convergence rates and outperforming existing algorithms in machine learning tasks.

Contribution

The paper develops a novel stochastic primal-dual extra-gradient algorithm with proven convergence rates for both convex and strongly convex objectives, addressing computational challenges in regularized stochastic minimization.

Findings

01

SPDPEG converges at $O(1/\sqrt{t})$ for convex objectives.

02

SPDPEG achieves $O(\log(t)/t)$ and $O(1/t)$ convergence for strongly convex objectives.

03

Experiments show SPDPEG outperforms competing algorithms in fused logistic regression tasks.

Abstract

We consider a wide range of regularized stochastic minimization problems with two regularization terms, one of which is composed with a linear function. This optimization model abstracts a number of important applications in artificial intelligence and machine learning, such as fused Lasso, fused logistic regression, and a class of graph-guided regularized minimization. The computational challenges of this model are in two folds. On one hand, the closed-form solution of the proximal mapping associated with the composed regularization term or the expected objective function is not available. On the other hand, the calculation of the full gradient of the expectation in the objective is very expensive when the number of input data samples is considerably large. To address these issues, we propose a stochastic variant of extra-gradient type methods, namely \textsf{Stochastic Primal-Dual…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLogistic Regression