Learning Future Representation with Synthetic Observations for Sample-efficient Reinforcement Learning
Xin Liu, Yaran Chen, and Dongbin Zhao

TL;DR
This paper introduces LFS, a novel self-supervised reinforcement learning method that synthesizes future observations to improve sample efficiency and visual representation learning without relying on rewards or actions.
Contribution
LFS proposes a training-free synthetic observation generation and data selection approach to enhance auxiliary data for RL, enabling better future state understanding and wider application scope.
Findings
LFS achieves state-of-the-art sample efficiency in continuous control tasks.
LFS improves visual pre-training from action-free video demonstrations.
Synthetic observations help the agent anticipate future states effectively.
Abstract
In visual Reinforcement Learning (RL), upstream representation learning largely determines the effect of downstream policy learning. Employing auxiliary tasks allows the agent to enhance visual representation in a targeted manner, thereby improving the sample efficiency and performance of downstream RL. Prior advanced auxiliary tasks all focus on how to extract as much information as possible from limited experience (including observations, actions, and rewards) through their different auxiliary objectives, whereas in this article, we first start from another perspective: auxiliary training data. We try to improve auxiliary representation learning for RL by enriching auxiliary training data, proposing \textbf{L}earning \textbf{F}uture representation with \textbf{S}ynthetic observations \textbf{(LFS)}, a novel self-supervised RL approach. Specifically, we propose a training-free method…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference
MethodsFocus
