On Scalable Inference with Stochastic Gradient Descent

Yixin Fang; Jinfeng Xu; Lei Yang

arXiv:1707.00192·stat.ML·July 4, 2017·5 cites

On Scalable Inference with Stochastic Gradient Descent

Yixin Fang, Jinfeng Xu, Lei Yang

PDF

Open Access

TL;DR

This paper introduces a scalable inference method for stochastic gradient descent that updates estimates with each new data point and perturbed estimates, enabling practical statistical inference for large datasets.

Contribution

It proposes a novel, scalable inferential procedure for SGD that is easy to implement and applicable to a wide range of models, including GLMs and quantile regression.

Findings

01

Method performs well in simulations.

02

Applicable to large datasets and online updating.

03

Theoretical guarantees established.

Abstract

In many applications involving large dataset or online updating, stochastic gradient descent (SGD) provides a scalable way to compute parameter estimates and has gained increasing popularity due to its numerical convenience and memory efficiency. While the asymptotic properties of SGD-based estimators have been established decades ago, statistical inference such as interval estimation remains much unexplored. The traditional resampling method such as the bootstrap is not computationally feasible since it requires to repeatedly draw independent samples from the entire dataset. The plug-in method is not applicable when there are no explicit formulas for the covariance matrix of the estimator. In this paper, we propose a scalable inferential procedure for stochastic gradient descent, which, upon the arrival of each observation, updates the SGD estimate as well as a large number of randomly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Statistical Methods and Inference

MethodsStochastic Gradient Descent