Plan2Vec: Unsupervised Representation Learning by Latent Plans

Ge Yang; Amy Zhang; Ari S. Morcos; Joelle Pineau; Pieter Abbeel,; Roberto Calandra

arXiv:2005.03648·cs.LG·May 8, 2020·6 cites

Plan2Vec: Unsupervised Representation Learning by Latent Plans

Ge Yang, Amy Zhang, Ari S. Morcos, Joelle Pineau, Pieter Abbeel,, Roberto Calandra

PDF

Open Access 1 Repo

TL;DR

Plan2vec is an unsupervised learning method that constructs a graph from image data to learn global embeddings, enabling efficient goal-conditioned control and planning over long horizons.

Contribution

It introduces a novel graph-based approach for unsupervised representation learning inspired by reinforcement learning, improving long-horizon planning and control.

Findings

01

Effective on simulated and real-world datasets

02

Achieves reactive planning with linear complexity

03

Successfully amortizes planning cost

Abstract

In this paper we introduce plan2vec, an unsupervised representation learning approach that is inspired by reinforcement learning. Plan2vec constructs a weighted graph on an image dataset using near-neighbor distances, and then extrapolates this local metric to a global embedding by distilling path-integral over planned path. When applied to control, plan2vec offers a way to learn goal-conditioned value estimates that are accurate over long horizons that is both compute and sample efficient. We demonstrate the effectiveness of plan2vec on one simulated and two challenging real-world image datasets. Experimental results show that plan2vec successfully amortizes the planning cost, enabling reactive planning that is linear in memory and computation complexity rather than exhaustive over the entire state space.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

geyang/plan2vec
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Reinforcement Learning in Robotics