Continuous-time Markov decision processes under the risk-sensitive   average cost criterion

Qingda Wei; Xian Chen

arXiv:1512.06641·math.OC·December 22, 2015·Oper. Res. Lett.

Continuous-time Markov decision processes under the risk-sensitive average cost criterion

Qingda Wei, Xian Chen

PDF

Open Access

TL;DR

This paper investigates continuous-time Markov decision processes with a focus on risk-sensitive average costs, establishing conditions for optimal policies and solving the associated optimality equation.

Contribution

It introduces a new approach to prove the existence of solutions and optimal policies for risk-sensitive average cost criteria in continuous-time MDPs with finite states.

Findings

01

Existence of solutions to the risk-sensitive average cost optimality equation.

02

Existence of optimal deterministic stationary policies.

03

Applicable under mild conditions for bounded costs and transition rates.

Abstract

This paper studies continuous-time Markov decision processes under the risk-sensitive average cost criterion. The state space is a finite set, the action space is a Borel space, the cost and transition rates are bounded, and the risk-sensitivity coefficient can take arbitrary positive real numbers. Under the mild conditions, we develop a new approach to establish the existence of a solution to the risk-sensitive average cost optimality equation and obtain the existence of an optimal deterministic stationary policy.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Risk and Portfolio Optimization · Advanced Control Systems Optimization