Mask-based Latent Reconstruction for Reinforcement Learning

Tao Yu; Zhizheng Zhang; Cuiling Lan; Yan Lu; Zhibo Chen

arXiv:2201.12096·cs.LG·October 11, 2022·21 cites

Mask-based Latent Reconstruction for Reinforcement Learning

Tao Yu, Zhizheng Zhang, Cuiling Lan, Yan Lu, Zhibo Chen

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces Mask-based Latent Reconstruction (MLR), a self-supervised method that improves state representation learning in deep reinforcement learning from pixels, leading to enhanced sample efficiency and better performance on control benchmarks.

Contribution

The paper proposes a novel mask-based reconstruction approach in the latent space to improve state representations in RL, demonstrating significant sample efficiency gains.

Findings

01

MLR outperforms state-of-the-art RL methods on multiple benchmarks.

02

MLR improves sample efficiency in both continuous and discrete control tasks.

03

Mask-based latent reconstruction enhances the use of context in state representation learning.

Abstract

For deep reinforcement learning (RL) from pixels, learning effective state representations is crucial for achieving high performance. However, in practice, limited experience and high-dimensional inputs prevent effective representation learning. To address this, motivated by the success of mask-based modeling in other research fields, we introduce mask-based reconstruction to promote state representation learning in RL. Specifically, we propose a simple yet effective self-supervised method, Mask-based Latent Reconstruction (MLR), to predict complete state representations in the latent space from the observations with spatially and temporally masked pixels. MLR enables better use of context information when learning state representations to make them more informative, which facilitates the training of RL agents. Extensive experiments show that our MLR significantly improves the sample…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

microsoft/Mask-based-Latent-Reconstruction
pytorchOfficial

Videos

Mask-based Latent Reconstruction for Reinforcement Learning· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Context-Aware Activity Recognition Systems · Explainable Artificial Intelligence (XAI)