MOOSS: Mask-Enhanced Temporal Contrastive Learning for Smooth State   Evolution in Visual Reinforcement Learning

Jiarui Sun; M. Ugur Akcal; Wei Zhang; Girish Chowdhary

arXiv:2409.02714·cs.CV·September 5, 2024

MOOSS: Mask-Enhanced Temporal Contrastive Learning for Smooth State Evolution in Visual Reinforcement Learning

Jiarui Sun, M. Ugur Akcal, Wei Zhang, Girish Chowdhary

PDF

Open Access 1 Repo

TL;DR

MOOSS introduces a novel temporal contrastive learning framework with graph-based spatial-temporal masking to improve state representation and sample efficiency in visual reinforcement learning.

Contribution

It presents a new self-supervised dual-component approach combining graph construction and multi-level contrastive learning for modeling state evolution.

Findings

01

Outperforms previous methods on multiple control benchmarks.

02

Enhances sample efficiency in visual RL tasks.

03

Effectively models state dynamics through spatial-temporal masking.

Abstract

In visual Reinforcement Learning (RL), learning from pixel-based observations poses significant challenges on sample efficiency, primarily due to the complexity of extracting informative state representations from high-dimensional data. Previous methods such as contrastive-based approaches have made strides in improving sample efficiency but fall short in modeling the nuanced evolution of states. To address this, we introduce MOOSS, a novel framework that leverages a temporal contrastive objective with the help of graph-based spatial-temporal masking to explicitly model state evolution in visual RL. Specifically, we propose a self-supervised dual-component strategy that integrates (1) a graph construction of pixel-based observations for spatial-temporal masking, coupled with (2) a multi-level contrastive learning mechanism that enriches state representations by emphasizing temporal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jsun57/mooss
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics

MethodsContrastive Learning