On the Fundamental Limit of the Stochastic Gradient Identification Algorithm Under Non-Persistent Excitation

Senhan Yao; Longxu Zhang

arXiv:2511.19981·math.OC·May 8, 2026

On the Fundamental Limit of the Stochastic Gradient Identification Algorithm Under Non-Persistent Excitation

Senhan Yao, Longxu Zhang

PDF

TL;DR

This paper proves that stochastic gradient algorithms achieve strong consistency under non-persistent excitation for a wider range of conditions than previously known, nearly resolving a long-standing conjecture.

Contribution

It establishes strong consistency of the stochastic gradient method for all \\alpha in [0, 1), improving upon prior results limited to \\alpha \\le 1/3 and nearly resolving a four-decade-old conjecture.

Findings

01

Strong consistency holds for 0 \\le \\alpha < 1.

02

New algebraic framework yields sharper matrix norm bounds.

03

Nearly resolves the Chen--Guo conjecture for the entire range 1/3 < \\alpha < 1.

Abstract

Stochastic gradient (SG) methods are fundamental to system identification and machine learning, enabling online parameter estimation in large-scale and streaming-data settings. As a classical identification method, the SG algorithm has been extensively studied for decades. Under non-persistent excitation, the strongest currently available convergence result assumes that the condition number of the Fisher information matrix is \(O((\log r_n)^\alpha)\), where \(r_n = 1 + \sum_{i=1}^n \|\varphi_i\|^2\). Existing theory establishes strong consistency when \(\alpha \le 1/3\), whereas the same condition with \(\alpha > 1\) is insufficient to guarantee strong consistency. We prove that strong consistency holds throughout the range \(0 \le \alpha < 1\). The proof is based on a new algebraic framework that yields substantially sharper matrix norm bounds. This result nearly resolves the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.