Learning Robust Representations via Bidirectional Transition for Visual Reinforcement Learning

Xiaobo Hu; Youfang Lin; Yue Liu; Jinwen Wang; Shuo Wang; Hehe Fan; Kai Lv

arXiv:2312.01915·cs.CV·September 11, 2025·1 cites

Learning Robust Representations via Bidirectional Transition for Visual Reinforcement Learning

Xiaobo Hu, Youfang Lin, Yue Liu, Jinwen Wang, Shuo Wang, Hehe Fan, Kai Lv

PDF

Open Access

TL;DR

This paper introduces a Bidirectional Transition (BiT) model that enhances visual reinforcement learning by predicting environmental transitions both forward and backward, leading to more reliable and generalizable representations across various control tasks.

Contribution

The paper proposes a novel BiT model that leverages bidirectional transition prediction to improve representation reliability and generalization in visual reinforcement learning.

Findings

01

Competitive performance on DeepMind Control suite

02

Effective in robotic manipulation tasks

03

Demonstrates wide applicability in simulation environments

Abstract

Visual reinforcement learning has proven effective in solving control tasks with high-dimensional observations. However, extracting reliable and generalizable representations from vision-based observations remains a central challenge. Inspired by the human thought process, when the representation extracted from the observation can predict the future and trace history, the representation is reliable and accurate in comprehending the environment. Based on this concept, we introduce a Bidirectional Transition (BiT) model, which leverages the ability to bidirectionally predict environmental transitions both forward and backward to extract reliable representations. Our model demonstrates competitive generalization performance and sample efficiency on two settings of the DeepMind Control suite. Additionally, we utilize robotic manipulation and CARLA simulators to demonstrate the wide…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning

MethodsEntropy Regularization · Proximal Policy Optimization · CARLA: An Open Urban Driving Simulator