Competing with the Empirical Risk Minimizer in a Single Pass

Roy Frostig; Rong Ge; Sham M. Kakade; Aaron Sidford

arXiv:1412.6606·stat.ML·February 26, 2015·30 cites

Competing with the Empirical Risk Minimizer in a Single Pass

Roy Frostig, Rong Ge, Sham M. Kakade, Aaron Sidford

PDF

Open Access

TL;DR

This paper introduces a simple, single-pass streaming algorithm that matches the statistical performance of the empirical risk minimizer while significantly reducing computational resources, making it efficient and scalable for large data problems.

Contribution

The authors present a novel streaming algorithm that achieves ERM-like statistical convergence in linear time and space, with super-polynomial error decay and easy parallelization.

Findings

01

Algorithm runs in linear time with a single data pass.

02

Achieves the same statistical rate as ERM on all problems.

03

Performance improves super-polynomially with initial error.

Abstract

In many estimation problems, e.g. linear and logistic regression, we wish to minimize an unknown objective given only unbiased samples of the objective function. Furthermore, we aim to achieve this using as few samples as possible. In the absence of computational constraints, the minimizer of a sample average of observed data -- commonly referred to as either the empirical risk minimizer (ERM) or the $M$ -estimator -- is widely regarded as the estimation strategy of choice due to its desirable statistical convergence properties. Our goal in this work is to perform as well as the ERM, on every problem, while minimizing the use of computational resources such as running time and space usage. We provide a simple streaming algorithm which, under standard regularity assumptions on the underlying problem, enjoys the following properties: * The algorithm can be implemented in linear time…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Advanced Bandit Algorithms Research