Non asymptotic analysis of Adaptive stochastic gradient algorithms and   applications

Antoine Godichon-Baggioni (LPSM (UMR\_8001)); Pierre Tarrago (LPSM; (UMR\_8001))

arXiv:2303.01370·math.OC·March 3, 2023·Trans. Mach. Learn. Res.·1 cites

Non asymptotic analysis of Adaptive stochastic gradient algorithms and applications

Antoine Godichon-Baggioni (LPSM (UMR\_8001)), Pierre Tarrago (LPSM, (UMR\_8001))

PDF

Open Access

TL;DR

This paper provides a non-asymptotic theoretical analysis of adaptive stochastic gradient algorithms like Adagrad and Stochastic Newton, specifically for strongly convex problems, with applications to linear regression and generalized linear models.

Contribution

It offers the first non-asymptotic analysis of adaptive gradient algorithms for strongly convex objectives, extending theoretical understanding beyond classical methods.

Findings

01

Theoretical bounds for Adagrad and Stochastic Newton algorithms.

02

Applications to linear regression and generalized linear models.

03

Insights into algorithm performance in ill-conditioned problems.

Abstract

In stochastic optimization, a common tool to deal sequentially with large sample is to consider the well-known stochastic gradient algorithm. Nevertheless, since the stepsequence is the same for each direction, this can lead to bad results in practice in case of ill-conditionned problem. To overcome this, adaptive gradient algorithms such that Adagrad or Stochastic Newton algorithms should be prefered. This paper is devoted to the non asymptotic analyis of these adaptive gradient algorithms for strongly convex objective. All the theoretical results will be adapted to linear regression and regularized generalized linear model for both Adagrad and Stochastic Newton algorithms.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Risk and Portfolio Optimization

MethodsLinear Regression · AdaGrad · Network On Network