TL;DR
This paper introduces a novel self-supervised learning approach that combines contrastive learning with dynamic models to improve sample efficiency and generalization in reinforcement learning from images.
Contribution
It proposes a method integrating contrastive learning with nonlinear transition models to enhance Markovianity and invariance in learned embeddings.
Findings
Achieves higher sample efficiency on Deepmind control suite
Demonstrates improved generalization over state-of-the-art contrastive methods
Effectively combines contrastive learning with dynamic modeling
Abstract
Recent methods for reinforcement learning from images use auxiliary tasks to learn image features that are used by the agent's policy or Q-function. In particular, methods based on contrastive learning that induce linearity of the latent dynamics or invariance to data augmentation have been shown to greatly improve the sample efficiency of the reinforcement learning algorithm and the generalizability of the learned embedding. We further argue, that explicitly improving Markovianity of the learned embedding is desirable and propose a self-supervised representation learning method which integrates contrastive learning with dynamic models to synergistically combine these three objectives: (1) We maximize the InfoNCE bound on the mutual information between the state- and action-embedding and the embedding of the next state to induce a linearly predictive embedding without explicitly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsContrastive Learning · InfoNCE
