No-Regret Prediction in Marginally Stable Systems

Udaya Ghai; Holden Lee; Karan Singh; Cyril Zhang; Yi Zhang

arXiv:2002.02064·cs.LG·November 24, 2020·5 cites

No-Regret Prediction in Marginally Stable Systems

Udaya Ghai, Holden Lee, Karan Singh, Cyril Zhang, Yi Zhang

PDF

Open Access

TL;DR

This paper develops a novel analysis for online prediction in marginally stable linear systems, achieving sublinear regret bounds despite unidentifiability and polynomial state growth, with applications to autoregressive filtering.

Contribution

It introduces a refined regret analysis for marginally stable systems, enabling sublinear regret bounds for online least-squares and autoregressive filtering under challenging conditions.

Findings

01

Online least-squares achieves sublinear regret in marginally stable systems.

02

Logarithmic regret is achievable in partially observed autoregressive models.

03

The analysis handles polynomial state growth despite unidentifiability.

Abstract

We consider the problem of online prediction in a marginally stable linear dynamical system subject to bounded adversarial or (non-isotropic) stochastic perturbations. This poses two challenges. Firstly, the system is in general unidentifiable, so recent and classical results on parameter recovery do not apply. Secondly, because we allow the system to be marginally stable, the state can grow polynomially with time; this causes standard regret bounds in online convex optimization to be vacuous. In spite of these challenges, we show that the online least-squares algorithm achieves sublinear regret (improvable to polylogarithmic in the stochastic setting), with polynomial dependence on the system's parameters. This requires a refined regret analysis, including a structural lemma showing the current state of the system to be a small linear combination of past states, even if the state grows…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Model Reduction and Neural Networks · Control Systems and Identification