ConstrainedZero: Chance-Constrained POMDP Planning using Learned Probabilistic Failure Surrogates and Adaptive Safety Constraints
Robert J. Moss, Arec Jamgochian, Johannes Fischer, Anthony Corso, and, Mykel J. Kochenderfer

TL;DR
This paper presents ConstrainedZero, a novel planning algorithm for CC-POMDPs that learns neural approximations of value, policy, and failure probability, enabling safe decision-making in uncertain environments with adaptive safety constraints.
Contribution
It introduces ConstrainedZero, combining neural network approximations with adaptive conformal inference for safe POMDP planning, improving safety without reward-cost trade-offs.
Findings
Achieves target safety levels in benchmark tasks.
Effectively balances safety and utility in uncertain environments.
Outperforms heuristic methods in safety-critical scenarios.
Abstract
To plan safely in uncertain environments, agents must balance utility with safety constraints. Safe planning problems can be modeled as a chance-constrained partially observable Markov decision process (CC-POMDP) and solutions often use expensive rollouts or heuristics to estimate the optimal value and action-selection policy. This work introduces the ConstrainedZero policy iteration algorithm that solves CC-POMDPs in belief space by learning neural network approximations of the optimal value and policy with an additional network head that estimates the failure probability given a belief. This failure probability guides safe action selection during online Monte Carlo tree search (MCTS). To avoid overemphasizing search based on the failure estimates, we introduce -MCTS, which uses adaptive conformal inference to update the failure threshold during planning. The approach is tested…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Reliability and Analysis Research · Formal Methods in Verification · Bayesian Modeling and Causal Inference
