Existence and Finiteness Conditions for Risk-Sensitive Planning: Results and Conjectures
Yaxin Liu, Sven Koenig

TL;DR
This paper establishes conditions under which risk-sensitive planning in Markov decision processes guarantees finite and well-defined optimal expected utilities, addressing a gap in AI and operations research.
Contribution
It derives new conditions ensuring the existence and finiteness of optimal utilities in risk-sensitive MDPs with non-linear utilities, extending understanding in this complex area.
Findings
Conditions for existence of finite optimal utilities
Results primarily apply to stationary policies in mixed reward settings
Conjecture for extension to non-stationary policies
Abstract
Decision-theoretic planning with risk-sensitive planning objectives is important for building autonomous agents or decision-support systems for real-world applications. However, this line of research has been largely ignored in the artificial intelligence and operations research communities since planning with risk-sensitive planning objectives is more complicated than planning with risk-neutral planning objectives. To remedy this situation, we derive conditions that guarantee that the optimal expected utilities of the total plan-execution reward exist and are finite for fully observable Markov decision process models with non-linear utility functions. In case of Markov decision process models with both positive and negative rewards, most of our results hold for stationary policies only, but we conjecture that they can be generalized to non stationary policies.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI-based Problem Solving and Planning · Logic, Reasoning, and Knowledge · Multi-Agent Systems and Negotiation
