Online greedy identification of linear dynamical systems

Matthieu Blanke; Marc Lelarge

arXiv:2204.06375·stat.ML·April 27, 2023

Online greedy identification of linear dynamical systems

Matthieu Blanke, Marc Lelarge

PDF

Open Access 1 Repo

TL;DR

This paper introduces an online greedy control policy for linear dynamical systems that maximizes information gain per step, offering a low-complexity alternative with competitive performance in limited-trial settings.

Contribution

It proposes a novel online greedy approach for exploration in linear dynamical systems, emphasizing low complexity and effectiveness with few experiments.

Findings

01

Low computational complexity compared to gradient-based methods

02

Experimentally competitive performance in limited trials

03

Effective information maximization per control step

Abstract

This work addresses the problem of exploration in an unknown environment. For linear dynamical systems, we use an experimental design framework and introduce an online greedy policy where the control maximizes the information of the next step. In a setting with a limited number of experimental trials, our algorithm has low complexity and shows experimentally competitive performances compared to more elaborate gradient-based methods.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mb-29/greedy-identification
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Machine Learning and Algorithms · Optimization and Search Problems