Covariance Estimators for the ROOT-SGD Algorithm in Online Learning

Yiling Luo; Xiaoming Huo; Yajun Mei

arXiv:2212.01259·stat.ML·December 5, 2022

Covariance Estimators for the ROOT-SGD Algorithm in Online Learning

Yiling Luo, Xiaoming Huo, Yajun Mei

PDF

Open Access

TL;DR

This paper develops two covariance estimators for ROOT-SGD in online learning, enabling statistical inference by providing reliable uncertainty quantification without requiring Hessian information.

Contribution

It introduces a plug-in covariance estimator and a Hessian-free estimator for ROOT-SGD, addressing the challenge of unknown asymptotic covariance in online learning.

Findings

01

Plug-in estimator converges at rate O(1/√t)

02

Hessian-free estimator is asymptotically consistent

03

Both estimators facilitate statistical inference in ROOT-SGD

Abstract

Online learning naturally arises in many statistical and machine learning problems. The most widely used methods in online learning are stochastic first-order algorithms. Among this family of algorithms, there is a recently developed algorithm, Recursive One-Over-T SGD (ROOT-SGD). ROOT-SGD is advantageous in that it converges at a non-asymptotically fast rate, and its estimator further converges to a normal distribution. However, this normal distribution has unknown asymptotic covariance; thus cannot be directly applied to measure the uncertainty. To fill this gap, we develop two estimators for the asymptotic covariance of ROOT-SGD. Our covariance estimators are useful for statistical inference in ROOT-SGD. Our first estimator adopts the idea of plug-in. For each unknown component in the formula of the asymptotic covariance, we substitute it with its empirical counterpart. The plug-in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Advanced Bandit Algorithms Research · Data Stream Mining Techniques

MethodsStochastic Gradient Descent