Suboptimal coverings for continuous spaces of control tasks

James A. Preiss; Gaurav S. Sukhatme

arXiv:2104.11865·math.OC·April 27, 2021

Suboptimal coverings for continuous spaces of control tasks

James A. Preiss, Gaurav S. Sukhatme

PDF

Open Access 1 Repo

TL;DR

This paper introduces the {}-suboptimal covering number to quantify the complexity of multi-task control problems with infinite task sets, aiding in understanding the function class expressiveness needed for learning-based control.

Contribution

It defines a new measure called the {}-suboptimal covering number for continuous control spaces and analyzes its properties for linear quadratic regulator (LQR) problems.

Findings

01

Logarithmic dependence on the space 'breadth' for scalar cases.

02

Constructive covers can efficiently approximate multi-task LQR problems.

03

Experiments visualize lower bounds and system behaviors for the proposed covering measure.

Abstract

We propose the {\alpha}-suboptimal covering number to characterize multi-task control problems where the set of dynamical systems and/or cost functions is infinite, analogous to the cardinality of finite task sets. This notion may help quantify the function class expressiveness needed to represent a good multi-task policy, which is important for learning-based control methods that use parameterized function approximation. We study suboptimal covering numbers for linear dynamical systems with quadratic cost (LQR problems) and construct a class of multi-task LQR problems amenable to analysis. For the scalar case, we show logarithmic dependence on the "breadth" of the space. For the matrix case, we present experiments 1) measuring the efficiency of a particular constructive cover, and 2) visualizing the behavior of two candidate systems for the lower bound.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jpreiss/suboptimal_coverings
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Reinforcement Learning in Robotics · Advanced Bandit Algorithms Research