Statistical Inference for Linear Functionals of Online Least-squares SGD when $t \gtrsim d^{1+\delta}$

Bhavya Agrawalla; Krishnakumar Balasubramanian; Promit Ghosal

arXiv:2510.19734·cs.LG·October 23, 2025

Statistical Inference for Linear Functionals of Online Least-squares SGD when $t \gtrsim d^{1+\delta}$

Bhavya Agrawalla, Krishnakumar Balasubramanian, Promit Ghosal

PDF

Open Access

TL;DR

This paper establishes a Gaussian CLT for linear functionals of online least-squares SGD in high-dimensional regimes with improved computational efficiency, enabling practical online inference with confidence intervals.

Contribution

It introduces a non-asymptotic CLT for SGD in growing dimensions with t d^{1+}, along with an online variance estimator and deviation bounds, extending inference capabilities.

Findings

01

CLT holds for t d^{1+} in high dimensions.

02

Proposed online variance estimator is consistent and computationally efficient.

03

Framework enables fully online, data-driven confidence intervals for SGD iterates.

Abstract

Stochastic Gradient Descent (SGD) has become a cornerstone method in modern data science. However, deploying SGD in high-stakes applications necessitates rigorous quantification of its inherent uncertainty. In this work, we establish \emph{non-asymptotic Berry--Esseen bounds} for linear functionals of online least-squares SGD, thereby providing a Gaussian Central Limit Theorem (CLT) in a \emph{growing-dimensional regime}. Existing approaches to high-dimensional inference for projection parameters, such as~\cite{chang2023inference}, rely on inverting empirical covariance matrices and require at least $t ≳ d^{3/2}$ iterations to achieve finite-sample Berry--Esseen guarantees, rendering them computationally expensive and restrictive in the allowable dimensional scaling. In contrast, we show that a CLT holds for SGD iterates when the number of iterations grows as $t \gtrsim…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Gaussian Processes and Bayesian Inference · Markov Chains and Monte Carlo Methods