A useful technique for piecewise deterministic Markov decision processes

Xin Guo; Yi Zhang

arXiv:2006.06827·math.OC·June 15, 2020·Oper. Res. Lett.

A useful technique for piecewise deterministic Markov decision processes

Xin Guo, Yi Zhang

PDF

Open Access

TL;DR

This paper introduces a technique that transforms piecewise deterministic Markov decision processes into auxiliary forms with desirable properties, facilitating analysis and application to risk-sensitive total cost criteria.

Contribution

The paper proposes a novel method to construct auxiliary PDMDPs that preserve performance measures, enabling better analysis of policies with unbounded transition intensities.

Findings

01

Auxiliary PDMDPs have properties not present in original models.

02

Performance measures are preserved between original and auxiliary PDMDPs.

03

Technique applied successfully to risk-sensitive PDMDPs with total cost criteria.

Abstract

This paper presents with justifications a technique that is useful for the study of piecewise deterministic Markov decision processes (PDMDPs) with general policies and unbounded transition intensities. This technique produces an auxiliary PDMDP from the original one. As to be discussed and claified, the auxiliary PDMDP possesses certain desired properties, which may not be possessed by the original PDMDP. Moreover, the performance measure of any policy in the original PDMDP can be replicated by the auxiliary PDMDP for a large class of performance criteria. As an application, we apply this technique to risk-sensitive PDMDPs with total cost criteria.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Formal Methods in Verification · Simulation Techniques and Applications