Blackwell optimality in risk-sensitive stochastic control

Marcin Pitera; {\L}ukasz Stettner

arXiv:2601.13136·math.OC·January 21, 2026·CoDIT

Blackwell optimality in risk-sensitive stochastic control

Marcin Pitera, {\L}ukasz Stettner

PDF

Open Access

TL;DR

This paper explores Blackwell optimality in risk-sensitive Markov Decision Processes, analyzing its properties, differences from risk-neutral cases, and its relation to stationarity, supported by illustrative examples.

Contribution

It introduces the concept of Blackwell optimality in risk-sensitive MDPs, examines its properties, and clarifies its relation to stationarity with illustrative examples.

Findings

01

Blackwell optimality can be characterized in risk-sensitive MDPs.

02

Risk-sensitive entropy affects the optimality and stationarity properties.

03

An example illustrates the structural differences between risk-sensitive and risk-neutral cases.

Abstract

In this paper, we consider a discrete-time Markov Decision Process (MDP) on a finite state-action space with a long-run risk-sensitive criterion used as the objective function. We discuss the concept of Blackwell optimality and comment on intricacies which arise when the risk-neutral expectation is replaced by the risk-sensitive entropy. Also, we show the relation between the Blackwell optimality and ultimate stationarity and provide an illustrative example that helps to better understand the structural difference between these two concepts.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRisk and Portfolio Optimization · Advanced Control Systems Optimization · Stability and Control of Uncertain Systems