Least Squares Regression with Markovian Data: Fundamental Limits and Algorithms
Guy Bresler, Prateek Jain, Dheeraj Nagaraj, Praneeth Netrapalli and, Xian Wu

TL;DR
This paper investigates the fundamental limits and algorithms for least squares regression with data sampled from a Markov chain, revealing that such data makes optimization inherently more challenging than independent data and proposing an experience replay-based algorithm that outperforms standard methods.
Contribution
The paper establishes sharp minimax lower bounds for Markovian data regression and introduces an experience replay algorithm that surpasses traditional SGD in this setting.
Findings
Optimization with Markovian data is strictly harder than with independent data.
A trivial algorithm (SGD-DD) is minimax optimal for Markovian data.
Experience replay-based algorithm achieves better error rates on Markov chains.
Abstract
We study the problem of least squares linear regression where the data-points are dependent and are sampled from a Markov chain. We establish sharp information theoretic minimax lower bounds for this problem in terms of , the mixing time of the underlying Markov chain, under different noise settings. Our results establish that in general, optimization with Markovian data is strictly harder than optimization with independent data and a trivial algorithm (SGD-DD) that works with only one in every samples, which are approximately independent, is minimax optimal. In fact, it is strictly better than the popular Stochastic Gradient Descent (SGD) method with constant step-size which is otherwise minimax optimal in the regression with independent data setting. Beyond a worst case analysis, we investigate whether structured datasets…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Markov Chains and Monte Carlo Methods · Advanced Bandit Algorithms Research
MethodsExperience Replay · Linear Regression
