Statistical Inference for Linear Functionals of Online Least-squares SGD when $t \gtrsim d^{1+\delta}$
Bhavya Agrawalla, Krishnakumar Balasubramanian, Promit Ghosal

TL;DR
This paper establishes a Gaussian CLT for linear functionals of online least-squares SGD in high-dimensional regimes with improved computational efficiency, enabling practical online inference with confidence intervals.
Contribution
It introduces a non-asymptotic CLT for SGD in growing dimensions with t d^{1+}, along with an online variance estimator and deviation bounds, extending inference capabilities.
Findings
CLT holds for t d^{1+} in high dimensions.
Proposed online variance estimator is consistent and computationally efficient.
Framework enables fully online, data-driven confidence intervals for SGD iterates.
Abstract
Stochastic Gradient Descent (SGD) has become a cornerstone method in modern data science. However, deploying SGD in high-stakes applications necessitates rigorous quantification of its inherent uncertainty. In this work, we establish \emph{non-asymptotic Berry--Esseen bounds} for linear functionals of online least-squares SGD, thereby providing a Gaussian Central Limit Theorem (CLT) in a \emph{growing-dimensional regime}. Existing approaches to high-dimensional inference for projection parameters, such as~\cite{chang2023inference}, rely on inverting empirical covariance matrices and require at least iterations to achieve finite-sample Berry--Esseen guarantees, rendering them computationally expensive and restrictive in the allowable dimensional scaling. In contrast, we show that a CLT holds for SGD iterates when the number of iterations grows as $t \gtrsim…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Gaussian Processes and Bayesian Inference · Markov Chains and Monte Carlo Methods
