Learning with SGD and Random Features

Luigi Carratino; Alessandro Rudi; Lorenzo Rosasco

arXiv:1807.06343·stat.ML·January 25, 2019·34 cites

Learning with SGD and Random Features

Luigi Carratino, Alessandro Rudi, Lorenzo Rosasco

PDF

Open Access

TL;DR

This paper analyzes the use of stochastic gradient descent combined with random features for large-scale nonparametric learning, deriving finite sample bounds and illustrating their effectiveness through experiments.

Contribution

It provides a theoretical analysis of stochastic gradient methods with random features in nonparametric learning, including finite sample bounds and parameter insights.

Findings

01

Finite sample bounds for the estimator

02

Parameters like features, iterations, and batch size influence learning

03

Numerical experiments validate theoretical results

Abstract

Sketching and stochastic gradient methods are arguably the most common techniques to derive efficient large scale learning algorithms. In this paper, we investigate their application in the context of nonparametric statistical learning. More precisely, we study the estimator defined by stochastic gradient with mini batches and random features. The latter can be seen as form of nonlinear sketching and used to define approximate kernel methods. The considered estimator is not explicitly penalized/constrained and regularization is implicit. Indeed, our study highlights how different parameters, such as number of features, iterations, step-size and mini-batch size control the learning properties of the solutions. We do this by deriving optimal finite sample bounds, under standard assumptions. The obtained results are corroborated and illustrated by numerical experiments.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace and Expression Recognition