Online Actuator Selection and Controller Design for Linear Quadratic   Regulation with Unknown System Model

Lintao Ye; Ming Chi; Zhi-Wei Liu; Vijay Gupta

arXiv:2201.10197·math.OC·September 16, 2024·1 cites

Online Actuator Selection and Controller Design for Linear Quadratic Regulation with Unknown System Model

Lintao Ye, Ming Chi, Zhi-Wei Liu, Vijay Gupta

PDF

Open Access

TL;DR

This paper introduces online algorithms for actuator selection and controller design in linear quadratic regulation with unknown models, achieving sublinear regret bounds in both episodic and non-episodic settings.

Contribution

It presents a novel combined approach using multiarmed bandits and certainty equivalence for simultaneous actuator selection and control in unknown LQR systems.

Findings

01

Achieves -regret in episodic setting

02

Achieves T^{2/3}-regret in non-episodic setting

03

Demonstrates scalability with number of actuators

Abstract

We study the simultaneous actuator selection and controller design problem for linear quadratic regulation with Gaussian noise over a finite horizon of length $T$ and unknown system model. We consider both episodic and non-episodic settings of the problem and propose online algorithms that specify both the sets of actuators to be utilized under a cardinality constraint and the controls corresponding to the sets of selected actuators. In the episodic setting, the interaction with the system breaks into $N$ episodes, each of which restarts from a given initial condition and has length $T$ . In the non-episodic setting, the interaction goes on continuously. Our online algorithms leverage a multiarmed bandit algorithm to select the sets of actuators and a certainty equivalence approach to design the corresponding controls. We show that our online algorithms yield $N$ -regret for the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Smart Grid Energy Management