Risk-averse formulations of Stochastic Optimal Control and Markov Decision Processes

Alexander Shapiro; Yan Li

arXiv:2505.16651·math.OC·May 23, 2025

Risk-averse formulations of Stochastic Optimal Control and Markov Decision Processes

Alexander Shapiro, Yan Li

PDF

Open Access

TL;DR

This paper explores risk-averse and distributionally robust approaches to stochastic optimal control and MDPs, focusing on risk functionals like VaR, and provides conditions for optimal policies and sample complexity analysis.

Contribution

It introduces a framework for risk-averse modeling in SOC and MDPs, including conditions for optimal policies and analysis of sample complexity with VaR.

Findings

01

Derived necessary and sufficient conditions for non-randomized optimal policies.

02

Analyzed sample complexity of VaR-based optimization problems.

03

Discussed construction of nested risk functionals for risk modeling.

Abstract

The aim of this paper is to investigate risk-averse and distributionally robust modeling of Stochastic Optimal Control (SOC) and Markov Decision Process (MDP). We discuss construction of conditional nested risk functionals, a particular attention is given to the Value-at-Risk measure. Necessary and sufficient conditions for existence of non-randomized optimal policies in the framework of robust SOC and MDP are derived. We also investigate sample complexity of optimization problems involving the Value-at-Risk measure.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRisk and Portfolio Optimization · Reinforcement Learning in Robotics · Stochastic processes and financial applications