When Does Dynamic Preconditioning Preserve the Polyak-Ruppert CLT? A Stabilization Threshold

Sunyoung An; Xiaoming Huo

arXiv:2604.23498·math.ST·April 28, 2026

When Does Dynamic Preconditioning Preserve the Polyak-Ruppert CLT? A Stabilization Threshold

Sunyoung An, Xiaoming Huo

PDF

TL;DR

This paper investigates the conditions under which dynamic preconditioning in stochastic approximation preserves the validity of the Polyak-Ruppert CLT, identifying a critical stabilization rate threshold.

Contribution

It provides an exact decomposition to analyze stabilization effects, establishing a sharp threshold for the stabilization rate ensuring the CLT holds in preconditioned stochastic approximation.

Findings

01

Identifies a stabilization rate threshold > (lpha+1)/2 for CLT validity.

02

Shows that for > (lpha+1)/2, the dynamic remainder vanishes in L^2.

03

Demonstrates that common algorithms like SA-AdaGrad, SA-RMSProp, and SA-ONS satisfy the stabilization condition.

Abstract

Polyak-Ruppert averaging yields an asymptotically normal estimator with sandwich covariance $H^{- 1} S H^{- 1}$ , the foundation of online inference. When the gradient step is preconditioned by a data-driven matrix $P_{t}$ , we ask how fast $P_{t}$ must stabilize for the central limit theorem (CLT) to remain valid. We resolve this via an exact preconditioner-isolating decomposition of the averaged error that confines $P_{t}$ to a dynamic remainder $R_{n}$ , leaving the martingale and Taylor terms preconditioner-free. Let $M_{t} = (P_{t} H)^{- 1}$ denote the effective inverse drift matrix, with $∥ M_{t} - M_{t - 1} ∥_{op} ≲ t^{- β}$ and step-size exponent $α \in (1/2, 1)$ . We identify a stabilization-rate threshold $β > (α + 1) /2$ and prove that, within the class of polynomial rate hypotheses used in our upper bound, it cannot be weakened: the dynamic remainder $n R_{n}$ …

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.