Palm up: Playing in the Latent Manifold for Unsupervised Pretraining

Hao Liu; Tom Zahavy; Volodymyr Mnih; Satinder Singh

arXiv:2210.10913·cs.LG·October 25, 2022·1 cites

Palm up: Playing in the Latent Manifold for Unsupervised Pretraining

Hao Liu, Tom Zahavy, Volodymyr Mnih, Satinder Singh

PDF

Open Access 1 Video

TL;DR

This paper introduces a novel unsupervised pretraining method that combines static dataset models with dynamic latent space exploration, enabling effective representation learning transferable to vision and reinforcement learning tasks.

Contribution

It proposes a new algorithm that integrates deep generative models with dynamic latent space exploration for unsupervised pretraining and reinforcement learning.

Findings

01

Learned representations transfer well to downstream tasks

02

The method enables exploration in static datasets using latent dynamics

03

Temporal pairing provides natural supervision for representation learning

Abstract

Large and diverse datasets have been the cornerstones of many impressive advancements in artificial intelligence. Intelligent creatures, however, learn by interacting with the environment, which changes the input sensory signals and the state of the environment. In this work, we aim to bring the best of both worlds and propose an algorithm that exhibits an exploratory behavior whilst it utilizes large diverse datasets. Our key idea is to leverage deep generative models that are pretrained on static datasets and introduce a dynamic model in the latent space. The transition dynamics simply mixes an action and a random sampled latent. It then applies an exponential moving average for temporal persistency, the resulting latent is decoded to image using pretrained generator. We then employ an unsupervised reinforcement learning algorithm to explore in this environment and perform…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Palm up: Playing in the Latent Manifold for Unsupervised Pretraining· slideslive

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Human Pose and Action Recognition · Domain Adaptation and Few-Shot Learning