Learning Discrete State Abstractions With Deep Variational Inference

Ondrej Biza; Robert Platt; Jan-Willem van de Meent; Lawson L. S.; Wong

arXiv:2003.04300·cs.LG·January 12, 2021·6 cites

Learning Discrete State Abstractions With Deep Variational Inference

Ondrej Biza, Robert Platt, Jan-Willem van de Meent, Lawson L. S., Wong

PDF

Open Access 1 Repo

TL;DR

This paper introduces a deep variational inference approach to learn discrete state abstractions for large state spaces, enabling efficient planning in high-dimensional environments with image states.

Contribution

It proposes a novel end-to-end method combining neural encoders and hidden Markov models to learn discrete state abstractions from high-dimensional data.

Findings

01

Effective in robotic manipulation domains with image states

02

Outperforms previous bisimulation methods in grid-world environments

03

Enables planning for unseen goals using learned abstractions

Abstract

Abstraction is crucial for effective sequential decision making in domains with large state spaces. In this work, we propose an information bottleneck method for learning approximate bisimulations, a type of state abstraction. We use a deep neural encoder to map states onto continuous embeddings. We map these embeddings onto a discrete representation using an action-conditioned hidden Markov model, which is trained end-to-end with the neural network. Our method is suited for environments with high-dimensional states and learns from a stream of experience collected by an agent acting in a Markov decision process. Through this learned discrete abstract model, we can efficiently plan for unseen goals in a multi-goal Reinforcement Learning setting. We test our method in simplified robotic manipulation domains with image states. We also compare it against previous model-based approaches to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ondrejba/discrete_abstractions
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Neural Networks and Applications · Machine Learning and Algorithms