On Acceleration of Gradient-Based Empirical Risk Minimization using   Local Polynomial Regression

Ekaterina Trimbach; Edward Duc Hien Nguyen; and C\'esar A. Uribe

arXiv:2204.07702·math.OC·April 19, 2022

On Acceleration of Gradient-Based Empirical Risk Minimization using Local Polynomial Regression

Ekaterina Trimbach, Edward Duc Hien Nguyen, and C\'esar A. Uribe

PDF

Open Access

TL;DR

This paper introduces accelerated algorithms based on local polynomial interpolation for empirical risk minimization, demonstrating improved theoretical complexity and empirical performance over traditional gradient methods in certain settings.

Contribution

It proposes two accelerated methods for ERM using LPI-GD, achieving better oracle complexity and providing the first empirical evaluation of local polynomial interpolation-based gradient methods.

Findings

01

Accelerated methods reduce oracle complexity to rom or LPI-GD.

02

Empirical results show LPI-GD outperforms GD and SGD in some scenarios.

03

Theoretical analysis confirms acceleration benefits in specific parameter regimes.

Abstract

We study the acceleration of the Local Polynomial Interpolation-based Gradient Descent method (LPI-GD) recently proposed for the approximate solution of empirical risk minimization problems (ERM). We focus on loss functions that are strongly convex and smooth with condition number $σ$ . We additionally assume the loss function is $η$ -H\"older continuous with respect to the data. The oracle complexity of LPI-GD is $\tilde{O} (σ m^{d} lo g (1/ ε))$ for a desired accuracy $ε$ , where $d$ is the dimension of the parameter space, and $m$ is the cardinality of an approximation grid. The factor $m^{d}$ can be shown to scale as $O ((1/ ε)^{d /2 η})$ . LPI-GD has been shown to have better oracle complexity than gradient descent (GD) and stochastic gradient descent (SGD) for certain parameter regimes. We propose two accelerated methods for the ERM…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Statistical Methods and Inference

MethodsStochastic Gradient Descent