Saliency-Guided Representation with Consistency Policy Learning for Visual Unsupervised Reinforcement Learning

Jingbo Sun; Qichao Zhang; Songjun Tu; Xing Fang; Yupeng Zheng; Haoran Li; Ke Chen; Dongbin Zhao

arXiv:2604.05931·cs.CV·April 8, 2026

Saliency-Guided Representation with Consistency Policy Learning for Visual Unsupervised Reinforcement Learning

Jingbo Sun, Qichao Zhang, Songjun Tu, Xing Fang, Yupeng Zheng, Haoran Li, Ke Chen, Dongbin Zhao

PDF

TL;DR

This paper introduces SRCP, a framework that enhances zero-shot visual unsupervised reinforcement learning by improving successor representations through saliency guidance and consistency policy learning.

Contribution

It proposes a novel decoupled representation learning approach with saliency guidance and a consistency policy for better generalization in visual URL.

Findings

01

SRCP achieves state-of-the-art zero-shot generalization on 16 tasks.

02

The framework improves successor measure accuracy and skill controllability.

03

Compatible with various successor representation methods.

Abstract

Zero-shot unsupervised reinforcement learning (URL) offers a promising direction for building generalist agents capable of generalizing to unseen tasks without additional supervision. Among existing approaches, successor representations (SR) have emerged as a prominent paradigm due to their effectiveness in structured, low-dimensional settings. However, SR methods struggle to scale to high-dimensional visual environments. Through empirical analysis, we identify two key limitations of SR in visual URL: (1) SR objectives often lead to suboptimal representations that attend to dynamics-irrelevant regions, resulting in inaccurate successor measures and degraded task generalization; and (2) these flawed representations hinder SR policies from modeling multi-modal skill-conditioned action distributions and ensuring skill controllability. To address these limitations, we propose…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.