Convex duality for stochastic shortest path problems in known and unknown environments
Kelli Francis-Staite

TL;DR
This paper explores convex duality in stochastic shortest path problems, extending value iteration operators to unknown environments and analyzing their duals and convergence properties.
Contribution
It introduces new convex optimization formulations for SSPs with unknown parameters and examines their duals, revealing complex convergence behaviors and open research questions.
Findings
Extended value iteration operators relate to convex programs and their duals.
Finite horizon bounds can be applied but may lead to non-monotone operators.
Oscillating behavior observed in certain special cases.
Abstract
This paper studies Stochastic Shortest Path (SSP) problems in known and unknown environments from the perspective of convex optimisation. It first recalls results in the known parameter case, and develops understanding through different proofs. It then focuses on the unknown parameter case, where it studies extended value iteration (EVI) operators. This includes the existing operators used in Rosenberg et al. [26] and Tarbouriech et al. [31] based on the l-1 norm and supremum norm, as well as defining EVI operators corresponding to other norms and divergences, such as the KL-divergence. This paper shows in general how the EVI operators relate to convex programs, and the form of their dual, where strong duality is exhibited. This paper then focuses on whether the bounds from finite horizon research of Neu and Pike-Burke [21] can be applied to these extended value iteration operators in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRisk and Portfolio Optimization · Water resources management and optimization · Stochastic Gradient Optimization Techniques
