Sample-efficient Reinforcement Learning Representation Learning with   Curiosity Contrastive Forward Dynamics Model

Thanh Nguyen; Tung M. Luu; Thang Vu; Chang D. Yoo

arXiv:2103.08255·cs.LG·January 13, 2023

Sample-efficient Reinforcement Learning Representation Learning with Curiosity Contrastive Forward Dynamics Model

Thanh Nguyen, Tung M. Luu, Thang Vu, Chang D. Yoo

PDF

Open Access 1 Repo

TL;DR

This paper introduces CCFDM, a sample-efficient reinforcement learning framework that uses contrastive learning and intrinsic rewards from a forward dynamics model to improve exploration, generalization, and performance on pixel-based tasks.

Contribution

The paper proposes CCFDM, a novel framework combining contrastive learning with a forward dynamics model to enhance sample efficiency and exploration in pixel-based RL.

Findings

01

CCFDM outperforms prior pixel-based RL methods on DeepMind Control Suite.

02

Intrinsic rewards from FDM prediction error improve exploration.

03

Contrastive learning enhances generalization in RL agents.

Abstract

Developing an agent in reinforcement learning (RL) that is capable of performing complex control tasks directly from high-dimensional observation such as raw pixels is yet a challenge as efforts are made towards improving sample efficiency and generalization. This paper considers a learning framework for Curiosity Contrastive Forward Dynamics Model (CCFDM) in achieving a more sample-efficient RL based directly on raw pixels. CCFDM incorporates a forward dynamics model (FDM) and performs contrastive learning to train its deep convolutional neural network-based image encoder (IE) to extract conducive spatial and temporal information for achieving a more sample efficiency for RL. In addition, during training, CCFDM provides intrinsic rewards, produced based on FDM prediction error, encourages the curiosity of the RL agent to improve exploration. The diverge and less-repetitive observations…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

thanhkaist/CCFDM1
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Fetal and Pediatric Neurological Disorders

MethodsContrastive Learning