Stochastic Shortest Path with Energy Constraints in POMDPs
Tom\'a\v{s} Br\'azdil, Krishnendu Chatterjee, Martin Chmel\'ik, Anchit, Gupta, Petr Novotn\'y

TL;DR
This paper introduces a novel algorithm for solving energy-constrained POMDPs, extending traditional stochastic shortest path problems to include energy constraints and producing human-readable policies.
Contribution
It presents a new RTDP-based algorithm for energy-aware POMDPs and an automated machine learning method to extract understandable policies.
Findings
Algorithm performs well on benchmark instances
Produces succinct, human-readable policies
Extends POMDP framework to energy constraints
Abstract
We consider partially observable Markov decision processes (POMDPs) with a set of target states and positive integer costs associated with every transition. The traditional optimization objective (stochastic shortest path) asks to minimize the expected total cost until the target set is reached. We extend the traditional framework of POMDPs to model energy consumption, which represents a hard constraint. The energy levels may increase and decrease with transitions, and the hard constraint requires that the energy level must remain positive in all steps till the target is reached. First, we present a novel algorithm for solving POMDPs with energy levels, developing on existing POMDP solvers and using RTDP as its main method. Our second contribution is related to policy representation. For larger POMDP instances the policies computed by existing solvers are too large to be understandable.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Optical Network Technologies · Energy Efficient Wireless Sensor Networks · Smart Parking Systems Research
