Regret Analysis of Multi-task Representation Learning for Linear-Quadratic Adaptive Control
Bruce D. Lee, Leonardo F. Toso, Thomas T. Zhang, James Anderson,, Nikolai Matni

TL;DR
This paper analyzes the regret of multi-task representation learning in linear-quadratic control, showing benefits of multiple agents and sharing representations in dynamic environments with changing goals.
Contribution
It introduces a novel regret analysis for multi-task representation learning in linear-quadratic control, addressing challenges of misspecification and dynamic environments.
Findings
Regret scales as O((\u221a{T/H})) for benign exploration.
Regret scales as O((_u d_ heta) ( T) + T^{3/4}/H^{1/5}) in difficult exploration.
Sharing representations reduces effective task-specific parameters, improving regret bounds.
Abstract
Representation learning is a powerful tool that enables learning over large multitudes of agents or domains by enforcing that all agents operate on a shared set of learned features. However, many robotics or controls applications that would benefit from collaboration operate in settings with changing environments and goals, whereas most guarantees for representation learning are stated for static settings. Toward rigorously establishing the benefit of representation learning in dynamic settings, we analyze the regret of multi-task representation learning for linear-quadratic control. This setting introduces unique challenges. Firstly, we must account for and balance the introduced by an approximate representation. Secondly, we cannot rely on the parameter update schemes of single-task online LQR, for which least-squares often suffices, and must devise a novel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Control Systems Optimization · Iterative Learning Control Systems · Control Systems and Identification
MethodsSparse Evolutionary Training
