Convergent Reinforcement Learning Algorithms for Stochastic Shortest Path Problem

Soumyajit Guin; Shalabh Bhatnagar

arXiv:2508.13963·cs.LG·December 3, 2025

Convergent Reinforcement Learning Algorithms for Stochastic Shortest Path Problem

Soumyajit Guin, Shalabh Bhatnagar

PDF

Open Access

TL;DR

This paper introduces new convergent reinforcement learning algorithms for the stochastic shortest path problem, demonstrating their effectiveness in both tabular and function approximation settings with superior and reliable performance.

Contribution

It presents two novel algorithms for SSP in tabular settings and one for function approximation, all with proven convergence and improved performance.

Findings

01

Tabular algorithms outperform existing RL algorithms in SSP.

02

Function approximation algorithm shows reliable performance.

03

All algorithms demonstrate asymptotic almost-sure convergence.

Abstract

In this paper we propose two algorithms in the tabular setting and an algorithm for the function approximation setting for the Stochastic Shortest Path (SSP) problem. SSP problems form an important class of problems in Reinforcement Learning (RL), as other types of cost-criteria in RL can be formulated in the setting of SSP. We show asymptotic almost-sure convergence for all our algorithms. We observe superior performance of our tabular algorithms compared to other well-known convergent RL algorithms. We further observe reliable performance of our function approximation algorithm compared to other algorithms in the function approximation setting.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSmart Parking Systems Research · Machine Learning and ELM · Optimization and Search Problems