Learning Future Representation with Synthetic Observations for   Sample-efficient Reinforcement Learning

Xin Liu; Yaran Chen; and Dongbin Zhao

arXiv:2405.11740·cs.LG·May 21, 2024

Learning Future Representation with Synthetic Observations for Sample-efficient Reinforcement Learning

Xin Liu, Yaran Chen, and Dongbin Zhao

PDF

Open Access

TL;DR

This paper introduces LFS, a novel self-supervised reinforcement learning method that synthesizes future observations to improve sample efficiency and visual representation learning without relying on rewards or actions.

Contribution

LFS proposes a training-free synthetic observation generation and data selection approach to enhance auxiliary data for RL, enabling better future state understanding and wider application scope.

Findings

01

LFS achieves state-of-the-art sample efficiency in continuous control tasks.

02

LFS improves visual pre-training from action-free video demonstrations.

03

Synthetic observations help the agent anticipate future states effectively.

Abstract

In visual Reinforcement Learning (RL), upstream representation learning largely determines the effect of downstream policy learning. Employing auxiliary tasks allows the agent to enhance visual representation in a targeted manner, thereby improving the sample efficiency and performance of downstream RL. Prior advanced auxiliary tasks all focus on how to extract as much information as possible from limited experience (including observations, actions, and rewards) through their different auxiliary objectives, whereas in this article, we first start from another perspective: auxiliary training data. We try to improve auxiliary representation learning for RL by enriching auxiliary training data, proposing \textbf{L}earning \textbf{F}uture representation with \textbf{S}ynthetic observations \textbf{(LFS)}, a novel self-supervised RL approach. Specifically, we propose a training-free method…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGaussian Processes and Bayesian Inference

MethodsFocus