Adaptive debiased SGD in high-dimensional GLMs with streaming data
Ruijian Han, Lan Luo, Yuanhang Luo, Yuanyuan Lin, Jian Huang

TL;DR
This paper presents an adaptive online inference method for high-dimensional generalized linear models that updates estimates in real-time with low computational and storage costs, using a novel adaptive stochastic gradient descent and debiasing technique.
Contribution
It introduces an adaptive stochastic gradient descent algorithm with an online debiasing procedure for high-dimensional GLMs, enabling efficient real-time inference in streaming data.
Findings
Establishes asymptotic normality of the ADL estimator.
Demonstrates computational efficiency through simulations.
Validates the method with a real spam email classification dataset.
Abstract
Online statistical inference facilitates real-time analysis of sequentially collected data, making it different from traditional methods that rely on static datasets. This paper introduces a novel approach to online inference in high-dimensional generalized linear models, where we update regression coefficient estimates and their standard errors upon each new data arrival. In contrast to existing methods that either require full dataset access or large-dimensional summary statistics storage, our method operates in a single-pass mode, significantly reducing both time and space complexity. The core of our methodological innovation lies in an adaptive stochastic gradient descent algorithm tailored for dynamic objective functions, coupled with a novel online debiasing procedure. This allows us to maintain low-dimensional summary statistics while effectively controlling the optimization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · 3D Shape Modeling and Analysis · Advanced Data Storage Technologies
