Autonomous sPOMDP Environment Modeling With Partial Model Exploitation
Andrew Wilhelm, Aaron Wilhelm, Garrett Fosdick

TL;DR
This paper introduces an enhanced sPOMDP-based environment modeling algorithm that improves exploration efficiency and scalability, enabling autonomous robots to generate environment representations more effectively in complex settings.
Contribution
It extends the original surprise-based sPOMDP method, significantly boosting training speed and robustness in environment exploration tasks.
Findings
Increased training speed by 31-63%
Enhanced robustness in less deterministic environments
Demonstrated effectiveness in various simulated environments
Abstract
A state space representation of an environment is a classic and yet powerful tool used by many autonomous robotic systems for efficient and often optimal solution planning. However, designing these representations with high performance is laborious and costly, necessitating an effective and versatile tool for autonomous generation of state spaces for autonomous robots. We present a novel state space exploration algorithm by extending the original surprise-based partially-observable Markov Decision Processes (sPOMDP), and demonstrate its effective long-term exploration planning performance in various environments. Through extensive simulation experiments, we show the proposed model significantly increases efficiency and scalability of the original sPOMDP learning techniques with a range of 31-63% gain in training speed while improving robustness in environments with less deterministic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Machine Learning and Algorithms · Bayesian Modeling and Causal Inference
