Accelerating Representation Learning with View-Consistent Dynamics in   Data-Efficient Reinforcement Learning

Tao Huang; Jiachen Wang; Xiao Chen

arXiv:2201.07016·cs.LG·January 19, 2022·1 cites

Accelerating Representation Learning with View-Consistent Dynamics in Data-Efficient Reinforcement Learning

Tao Huang, Jiachen Wang, Xiao Chen

PDF

Open Access

TL;DR

This paper introduces View-Consistent Dynamics (VCD), a novel method that accelerates representation learning in deep RL by enforcing view-consistency on dynamics, leading to state-of-the-art data efficiency in visual control tasks.

Contribution

The paper proposes a formal Multi-view Markov Decision Process and a view-consistent dynamics model to improve data efficiency in visual RL tasks.

Findings

01

VCD achieves state-of-the-art data efficiency on DeepMind Control Suite.

02

VCD outperforms existing methods on Atari-100k.

03

Enforcing view-consistency improves representation learning in RL.

Abstract

Learning informative representations from image-based observations is of fundamental concern in deep Reinforcement Learning (RL). However, data-inefficiency remains a significant barrier to this objective. To overcome this obstacle, we propose to accelerate state representation learning by enforcing view-consistency on the dynamics. Firstly, we introduce a formalism of Multi-view Markov Decision Process (MMDP) that incorporates multiple views of the state. Following the structure of MMDP, our method, View-Consistent Dynamics (VCD), learns state representations by training a view-consistent dynamics model in the latent space, where views are generated by applying data augmentation to states. Empirical evaluation on DeepMind Control Suite and Atari-100k demonstrates VCD to be the SoTA data-efficient algorithm on visual control tasks.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Neural dynamics and brain function