When Does Dynamic Preconditioning Preserve the Polyak-Ruppert CLT? A Stabilization Threshold
Sunyoung An, Xiaoming Huo

TL;DR
This paper investigates the conditions under which dynamic preconditioning in stochastic approximation preserves the validity of the Polyak-Ruppert CLT, identifying a critical stabilization rate threshold.
Contribution
It provides an exact decomposition to analyze stabilization effects, establishing a sharp threshold for the stabilization rate ensuring the CLT holds in preconditioned stochastic approximation.
Findings
Identifies a stabilization rate threshold > (lpha+1)/2 for CLT validity.
Shows that for > (lpha+1)/2, the dynamic remainder vanishes in L^2.
Demonstrates that common algorithms like SA-AdaGrad, SA-RMSProp, and SA-ONS satisfy the stabilization condition.
Abstract
Polyak-Ruppert averaging yields an asymptotically normal estimator with sandwich covariance , the foundation of online inference. When the gradient step is preconditioned by a data-driven matrix , we ask how fast must stabilize for the central limit theorem (CLT) to remain valid. We resolve this via an exact preconditioner-isolating decomposition of the averaged error that confines to a dynamic remainder , leaving the martingale and Taylor terms preconditioner-free. Let denote the effective inverse drift matrix, with and step-size exponent . We identify a stabilization-rate threshold and prove that, within the class of polynomial rate hypotheses used in our upper bound, it cannot be weakened: the dynamic remainder …
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
