Integrating Contrastive Learning with Dynamic Models for Reinforcement   Learning from Images

Bang You; Oleg Arenz; Youping Chen; Jan Peters

arXiv:2203.01810·cs.LG·March 4, 2022

Integrating Contrastive Learning with Dynamic Models for Reinforcement Learning from Images

Bang You, Oleg Arenz, Youping Chen, Jan Peters

PDF

1 Repo

TL;DR

This paper introduces a novel self-supervised learning approach that combines contrastive learning with dynamic models to improve sample efficiency and generalization in reinforcement learning from images.

Contribution

It proposes a method integrating contrastive learning with nonlinear transition models to enhance Markovianity and invariance in learned embeddings.

Findings

01

Achieves higher sample efficiency on Deepmind control suite

02

Demonstrates improved generalization over state-of-the-art contrastive methods

03

Effectively combines contrastive learning with dynamic modeling

Abstract

Recent methods for reinforcement learning from images use auxiliary tasks to learn image features that are used by the agent's policy or Q-function. In particular, methods based on contrastive learning that induce linearity of the latent dynamics or invariance to data augmentation have been shown to greatly improve the sample efficiency of the reinforcement learning algorithm and the generalizability of the learned embedding. We further argue, that explicitly improving Markovianity of the learned embedding is desirable and propose a self-supervised representation learning method which integrates contrastive learning with dynamic models to synergistically combine these three objectives: (1) We maximize the InfoNCE bound on the mutual information between the state- and action-embedding and the embedding of the next state to induce a linearly predictive embedding without explicitly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bangyou01/pytorch-cody
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsContrastive Learning · InfoNCE