Self-Predictive Dynamics for Generalization of Vision-based Reinforcement Learning

Kyungsoo Kim; Jeongsoo Ha; Yusung Kim

arXiv:2506.05418·cs.CV·June 9, 2025

Self-Predictive Dynamics for Generalization of Vision-based Reinforcement Learning

Kyungsoo Kim, Jeongsoo Ha, Yusung Kim

PDF

Open Access

TL;DR

This paper introduces Self-Predictive Dynamics (SPD), a novel method that enhances the generalization of vision-based reinforcement learning by extracting task-relevant features through predictive representations, especially under unseen and distracting visual conditions.

Contribution

SPD employs parallel augmentations and transition prediction to improve feature extraction and generalization in vision-based reinforcement learning tasks.

Findings

01

SPD outperforms previous methods on MuJoCo visual control tasks.

02

SPD significantly improves generalization in unseen observations.

03

SPD effectively handles distracting elements like shadows and clouds.

Abstract

Vision-based reinforcement learning requires efficient and robust representations of image-based observations, especially when the images contain distracting (task-irrelevant) elements such as shadows, clouds, and light. It becomes more important if those distractions are not exposed during training. We design a Self-Predictive Dynamics (SPD) method to extract task-relevant features efficiently, even in unseen observations after training. SPD uses weak and strong augmentations in parallel, and learns representations by predicting inverse and forward transitions across the two-way augmented versions. In a set of MuJoCo visual control tasks and an autonomous driving task (CARLA), SPD outperforms previous studies in complex observations, and significantly improves the generalization performance for unseen observations. Our code is available at https://github.com/unigary/SPD.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Domain Adaptation and Few-Shot Learning · Robot Manipulation and Learning