Self-Triggered Markov Decision Processes

Yunhan Huang; Quanyan Zhu

arXiv:2102.08571·eess.SY·February 18, 2021

Self-Triggered Markov Decision Processes

Yunhan Huang, Quanyan Zhu

PDF

Open Access

TL;DR

This paper extends self-triggered control strategies to general Markov Decision Processes, proposing novel policies that optimize resource use and system performance through dynamic programming and theoretical analysis.

Contribution

It introduces a new framework for self-triggered policies in MDPs, including a DP equation with lookahead and methods for balancing resource consumption and performance.

Findings

01

Effective reduction in communication resources demonstrated.

02

Trade-offs between resource use and system performance analyzed.

03

Theoretical foundations for policy computation established.

Abstract

In this paper, we study Markov Decision Processes (MDPs) with self-triggered strategies, where the idea of self-triggered control is extended to more generic MDP models. This extension broadens the application of self-triggering policies to a broader range of systems. We study the co-design problems of the control policy and the triggering policy to optimize two pre-specified cost criteria. The first cost criterion is introduced by incorporating a pre-specified update penalty into the traditional MDP cost criteria to reduce the use of communication resources. Under this criteria, a novel dynamic programming (DP) equation called DP equation with optimized lookahead to proposed to solve for the self-triggering policy under this criteria. The second self-triggering policy is to maximize the triggering time while still guaranteeing a pre-specified level of sub-optimality. Theoretical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAge of Information Optimization · Reinforcement Learning in Robotics · Smart Grid Energy Management