The Power of Second Order Methods for Sequence Preconditioning
Annie Marsden, Elad Hazan

TL;DR
This paper demonstrates that combining universal sequence preconditioning with the Vovk-Azoury-Warmuth algorithm achieves near-optimal regret bounds for long-memory dynamical systems, even with complex spectra.
Contribution
It introduces a novel combination of USP and VAW that achieves polylogarithmic regret for marginally-stable linear systems, extending applicability beyond bounded-spectrum cases.
Findings
USP compresses sequences, reducing memory requirements for prediction.
VAW remains robust despite exponential diameter growth.
Combined USP and VAW achieve polylogarithmic regret in complex systems.
Abstract
Sequence prediction methods for dynamical systems with long memory, i.e. marginally stable systems, typically achieve regret that grows polynomially with the hidden dimension of the underlying generative model. Universal Sequence Preconditioning (USP) is a method that compresses any sequence which comes from a linear dynamical system into a "preconditioned" sequence which requires exponentially shorter memory for accurate prediction. However, the preconditioned sequence yields exponentially larger diameters and gradients, hindering USP from unlocking optimal regret bounds. Inspired by the minimum description length principle, we show that the Vovk-Azoury-Warmuth (VAW) algorithm is naturally matched to the USP regime. Indeed, it takes advantage of the memory compression while remaining robust to the exponential explosion of the diameter. We prove that combining USP with VAW achieves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
