Self-Normalized Martingales and Uniform Regret Bounds for Linear Regression
Fan Chen, Jian Qian, Alexander Rakhlin, Nikita Zhivotovskiy

TL;DR
This paper investigates the limitations and possibilities of scale-invariant bounds for self-normalized martingales in linear regression, revealing a dimension-dependent dichotomy and providing new regret bounds.
Contribution
It characterizes when scale-invariant bounds are possible, resolves an open problem on uniform regret bounds, and introduces a natural concentration inequality for vector martingales.
Findings
Scale-invariant bounds exist only in dimension 1 without assumptions.
In dimension 1, an $O( ext{log } T)$ doubly-uniform regret algorithm is provided.
For higher dimensions, sublinear regret with scale-invariance is impossible without additional assumptions.
Abstract
Self-normalized martingale inequalities lie at the heart of confidence ellipsoids for online least squares and, more broadly, many bandit and reinforcement-learning results. Yet existing vector and scalar results typically rely on bounded covariates and an explicit regularization matrix, producing bounds that are \emph{not scale-invariant}: although the self-normalized quantity is scale-invariant by definition, its standard upper bounds are not. We characterize when scale-invariant upper bounds on self-normalized martingales are possible. Without further assumptions, we prove that nontrivial scale-invariant bounds exist only in dimension ; moreover, in we obtain scale-invariant self-normalized bounds without any assumptions on the covariates. In contrast, for we show that no nontrivial scale-invariant bound can hold in full generality. We then connect this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
