Learning Plannable Representations with Causal InfoGAN
Thanard Kurutach, Aviv Tamar, Ge Yang, Stuart Russell, Pieter Abbeel

TL;DR
This paper introduces a method to learn low-dimensional, causal representations of high-dimensional visual data to generate goal-directed visual plans, enabling plausible sequences of observations for control tasks.
Contribution
It proposes a novel framework combining representation learning and planning by maximizing mutual information, suitable for high-dimensional observations like images.
Findings
Successfully generates plausible visual plans for rope manipulation.
Demonstrates effective low-dimensional causal representations for planning.
Integrates generative models with planning algorithms for goal-directed sequences.
Abstract
In recent years, deep generative models have been shown to 'imagine' convincing high-dimensional observations such as images, audio, and even video, learning directly from raw data. In this work, we ask how to imagine goal-directed visual plans -- a plausible sequence of observations that transition a dynamical system from its current configuration to a desired goal state, which can later be used as a reference trajectory for control. We focus on systems with high-dimensional observations, such as images, and propose an approach that naturally combines representation learning and planning. Our framework learns a generative model of sequential observations, where the generative process is induced by a transition in a low-dimensional planning model, and an additional noise. By maximizing the mutual information between the generated observations and the transition in the planning model, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Generative Adversarial Networks and Image Synthesis · Advanced Vision and Imaging
