On Constructing Confidence Region for Model Parameters in Stochastic Gradient Descent via Batch Means
Yi Zhu, Jing Dong

TL;DR
This paper introduces a method for constructing valid confidence regions for stochastic gradient descent parameters using batch means, including theoretical guarantees and extensions for various batch sizes.
Contribution
It develops a new algorithm for confidence region construction that avoids covariance estimation and proves a process-level CLT for Polyak-Ruppert averaging SGD estimators.
Findings
Establishes a process-level functional CLT for SGD estimators.
Proposes an extended batch means method for different batch size specifications.
Provides a practical algorithm for confidence region construction in stochastic optimization.
Abstract
In this paper, we study a simple algorithm to construct asymptotically valid confidence regions for model parameters using the batch means method. The main idea is to cancel out the covariance matrix which is hard/costly to estimate. In the process of developing the algorithm, we establish process-level functional central limit theorem for Polyak-Ruppert averaging based stochastic gradient descent estimators. We also extend the batch means method to accommodate more general batch size specifications.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Markov Chains and Monte Carlo Methods · Statistical Methods and Inference
