Predictive World Models from Real-World Partial Observations
Robin Karlsson, Alexander Carballo, Keisuke Fujii, Kento Ohtani,, Kazuya Takeda

TL;DR
This paper introduces a hierarchical VAE framework for learning probabilistic predictive world models from partial observations in real-world road environments, enabling better spatial understanding for mobile robots.
Contribution
It presents a novel sequential training method for HVAEs to learn from partial states, advancing predictive modeling in complex real-world settings.
Findings
Achieved 96.21 IoU in spatial structure prediction
Reduced gap to perfect prediction by 62% for stochastic regions
Demonstrated effective continual learning of spatial predictions
Abstract
Cognitive scientists believe adaptable intelligent agents like humans perform reasoning through learned causal mental simulations of agents and environments. The problem of learning such simulations is called predictive world modeling. Recently, reinforcement learning (RL) agents leveraging world models have achieved SOTA performance in game environments. However, understanding how to apply the world modeling approach in complex real-world environments relevant to mobile robots remains an open question. In this paper, we present a framework for learning a probabilistic predictive world model for real-world road environments. We implement the model using a hierarchical VAE (HVAE) capable of predicting a diverse set of fully observed plausible worlds from accumulated sensor observations. While prior HVAE methods require complete states as ground truth for learning, we present a novel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Reinforcement Learning in Robotics · Topic Modeling
MethodsHierarchical Variational Autoencoder
