On Acceleration of Gradient-Based Empirical Risk Minimization using Local Polynomial Regression
Ekaterina Trimbach, Edward Duc Hien Nguyen, and C\'esar A. Uribe

TL;DR
This paper introduces accelerated algorithms based on local polynomial interpolation for empirical risk minimization, demonstrating improved theoretical complexity and empirical performance over traditional gradient methods in certain settings.
Contribution
It proposes two accelerated methods for ERM using LPI-GD, achieving better oracle complexity and providing the first empirical evaluation of local polynomial interpolation-based gradient methods.
Findings
Accelerated methods reduce oracle complexity to rom or LPI-GD.
Empirical results show LPI-GD outperforms GD and SGD in some scenarios.
Theoretical analysis confirms acceleration benefits in specific parameter regimes.
Abstract
We study the acceleration of the Local Polynomial Interpolation-based Gradient Descent method (LPI-GD) recently proposed for the approximate solution of empirical risk minimization problems (ERM). We focus on loss functions that are strongly convex and smooth with condition number . We additionally assume the loss function is -H\"older continuous with respect to the data. The oracle complexity of LPI-GD is for a desired accuracy , where is the dimension of the parameter space, and is the cardinality of an approximation grid. The factor can be shown to scale as . LPI-GD has been shown to have better oracle complexity than gradient descent (GD) and stochastic gradient descent (SGD) for certain parameter regimes. We propose two accelerated methods for the ERM…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Statistical Methods and Inference
MethodsStochastic Gradient Descent
