Autonomous sPOMDP Environment Modeling With Partial Model Exploitation

Andrew Wilhelm; Aaron Wilhelm; Garrett Fosdick

arXiv:2012.12203·cs.LG·December 23, 2020

Autonomous sPOMDP Environment Modeling With Partial Model Exploitation

Andrew Wilhelm, Aaron Wilhelm, Garrett Fosdick

PDF

Open Access 1 Repo

TL;DR

This paper introduces an enhanced sPOMDP-based environment modeling algorithm that improves exploration efficiency and scalability, enabling autonomous robots to generate environment representations more effectively in complex settings.

Contribution

It extends the original surprise-based sPOMDP method, significantly boosting training speed and robustness in environment exploration tasks.

Findings

01

Increased training speed by 31-63%

02

Enhanced robustness in less deterministic environments

03

Demonstrated effectiveness in various simulated environments

Abstract

A state space representation of an environment is a classic and yet powerful tool used by many autonomous robotic systems for efficient and often optimal solution planning. However, designing these representations with high performance is laborious and costly, necessitating an effective and versatile tool for autonomous generation of state spaces for autonomous robots. We present a novel state space exploration algorithm by extending the original surprise-based partially-observable Markov Decision Processes (sPOMDP), and demonstrate its effective long-term exploration planning performance in various environments. Through extensive simulation experiments, we show the proposed model significantly increases efficiency and scalability of the original sPOMDP learning techniques with a range of 31-63% gain in training speed while improving robustness in environments with less deterministic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

EpiSci/SBL
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Machine Learning and Algorithms · Bayesian Modeling and Causal Inference