TL;DR
ESCORT is a novel particle-based framework that improves the accuracy of belief distribution representations in POMDPs by modeling complex, multi-modal, high-dimensional uncertainties with correlation-aware and temporally consistent updates.
Contribution
It introduces correlation-aware projections and temporal consistency constraints to extend SVGD for better belief approximation in complex POMDP environments.
Findings
ESCORT outperforms existing methods in belief accuracy.
It enhances decision quality in POMDPs.
Demonstrates robustness across synthetic and real domains.
Abstract
In Partially Observable Markov Decision Processes (POMDPs), maintaining and updating belief distributions over possible underlying states provides a principled way to summarize action-observation history for effective decision-making under uncertainty. As environments grow more realistic, belief distributions develop complexity that standard mathematical models cannot accurately capture, creating a fundamental challenge in maintaining representational accuracy. Despite advances in deep learning and probabilistic modeling, existing POMDP belief approximation methods fail to accurately represent complex uncertainty structures such as high-dimensional, multi-modal belief distributions, resulting in estimation errors that lead to suboptimal agent behaviors. To address this challenge, we present ESCORT (Efficient Stein-variational and sliced Consistency-Optimized Representation for Temporal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
