Statistical Inference for Linear Functionals of Online SGD in High-dimensional Linear Regression
Bhavya Agrawalla, Krishnakumar Balasubramanian, Promit Ghosal

TL;DR
This paper develops a high-dimensional CLT for online SGD in overparametrized linear regression, enabling practical, data-driven uncertainty quantification and confidence interval construction in high-dimensional settings.
Contribution
It introduces the first high-dimensional CLT for linear functionals of online SGD iterates, along with an online variance estimator for uncertainty quantification.
Findings
A bias-corrected CLT holds when iterations grow sub-linearly with dimension.
An online variance estimator with high-probability bounds is developed.
The method enables practical, fully online confidence interval construction in high dimensions.
Abstract
Stochastic gradient descent (SGD) has emerged as the quintessential method in a data scientist's toolbox. Using SGD for high-stakes applications requires, however, careful quantification of the associated uncertainty. Towards that end, in this work, we establish a high-dimensional Central Limit Theorem (CLT) for linear functionals of online SGD iterates for overparametrized least-squares regression with non-isotropic Gaussian inputs. We first show that a bias-corrected CLT holds when the number of iterations of the online SGD, , grows sub-linearly in the dimensionality, . In order to use the developed result in practice, we further develop an online approach for estimating the variance term appearing in the CLT, and establish high-probability bounds for the developed online estimator. Together with the CLT result, this provides a fully online and data-driven way to numerically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Random Matrices and Applications
MethodsStochastic Gradient Descent
