Stability Enforced Bandit Algorithms for Channel Selection in Remote   State Estimation of Gauss-Markov Processes

Alex S. Leong; Daniel E. Quevedo; Wanchun Liu

arXiv:2205.09923·eess.SY·August 7, 2023

Stability Enforced Bandit Algorithms for Channel Selection in Remote State Estimation of Gauss-Markov Processes

Alex S. Leong, Daniel E. Quevedo, Wanchun Liu

PDF

Open Access

TL;DR

This paper develops bandit-based algorithms for remote state estimation of Gauss-Markov processes, ensuring stability and providing regret bounds despite unknown channel statistics.

Contribution

It introduces novel bandit algorithms tailored for channel selection in remote estimation, guaranteeing stability and analyzing regret in the presence of unknown channel parameters.

Findings

01

Algorithms achieve stable state estimation under unknown channels.

02

Regret bounds scale sublinearly with time, indicating learning efficiency.

03

Proposed methods outperform baseline approaches in simulations.

Abstract

In this paper we consider the problem of remote state estimation of a Gauss-Markov process, where a sensor can, at each discrete time instant, transmit on one out of M different communication channels. A key difficulty of the situation at hand is that the channel statistics are unknown. We study the case where both learning of the channel reception probabilities and state estimation is carried out simultaneously. Methods for choosing the channels based on techniques for multi-armed bandits are presented, and shown to provide stability. Furthermore, we define the performance notion of estimation regret, and derive bounds on how it scales with time for the considered algorithms.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Gaussian Processes and Bayesian Inference · Distributed Sensor Networks and Detection Algorithms