Covariance Estimators for the ROOT-SGD Algorithm in Online Learning
Yiling Luo, Xiaoming Huo, Yajun Mei

TL;DR
This paper develops two covariance estimators for ROOT-SGD in online learning, enabling statistical inference by providing reliable uncertainty quantification without requiring Hessian information.
Contribution
It introduces a plug-in covariance estimator and a Hessian-free estimator for ROOT-SGD, addressing the challenge of unknown asymptotic covariance in online learning.
Findings
Plug-in estimator converges at rate O(1/√t)
Hessian-free estimator is asymptotically consistent
Both estimators facilitate statistical inference in ROOT-SGD
Abstract
Online learning naturally arises in many statistical and machine learning problems. The most widely used methods in online learning are stochastic first-order algorithms. Among this family of algorithms, there is a recently developed algorithm, Recursive One-Over-T SGD (ROOT-SGD). ROOT-SGD is advantageous in that it converges at a non-asymptotically fast rate, and its estimator further converges to a normal distribution. However, this normal distribution has unknown asymptotic covariance; thus cannot be directly applied to measure the uncertainty. To fill this gap, we develop two estimators for the asymptotic covariance of ROOT-SGD. Our covariance estimators are useful for statistical inference in ROOT-SGD. Our first estimator adopts the idea of plug-in. For each unknown component in the formula of the asymptotic covariance, we substitute it with its empirical counterpart. The plug-in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Advanced Bandit Algorithms Research · Data Stream Mining Techniques
MethodsStochastic Gradient Descent
