Factor Augmented High-Dimensional SGD

Shubo Li; Yuefeng Han; Xiufan Yu

arXiv:2605.19291·stat.ML·May 20, 2026

Factor Augmented High-Dimensional SGD

Shubo Li, Yuefeng Han, Xiufan Yu

PDF

TL;DR

This paper introduces Factor-Augmented SGD, a scalable streaming optimization method that incorporates latent factor estimation, with theoretical guarantees for high-dimensional machine learning tasks.

Contribution

It proposes a novel factor-augmented SGD algorithm that operates on streaming data and provides the first theoretical analysis including latent factor estimation error.

Findings

01

Operates purely on streaming data, scalable to large high-dimensional problems.

02

Provides moment convergence analysis under decaying step sizes and mini-batch updates.

03

Establishes a new theoretical framework incorporating latent factor estimation error.

Abstract

Stochastic gradient descent (SGD) is a fundamental optimization algorithm widely used in modern machine learning. In this paper, we propose Factor-Augmented SGD (FSGD), a new optimization method that leverages latent factor representations in high-dimensional learning tasks. Unlike standard two-stage dimension reduction approaches that rely on offline representation learning and full data storage, a key novelty of FSGD is that it operates purely on streaming data, making it scalable to large-scale and high-dimensional problems. Furthermore, we establish the first theoretical framework that explicitly incorporates latent factor estimation error into the analysis of SGD, and provide moment convergence in $ℓ^{s}$ norm under decaying step sizes and mini-batch updates. Our results provide a new foundation for employing SGD reliably and scalably in high-dimensional machine learning systems.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.