Zero-Shot Generalization of Vision-Based RL Without Data Augmentation

Sumeet Batra; Gaurav S. Sukhatme

arXiv:2410.07441·cs.LG·August 13, 2025

Zero-Shot Generalization of Vision-Based RL Without Data Augmentation

Sumeet Batra, Gaurav S. Sukhatme

PDF

Open Access

TL;DR

This paper introduces ALDA, a model that enables zero-shot generalization in vision-based RL without data augmentation by leveraging latent disentanglement and associative memory, inspired by neuroscience.

Contribution

The paper proposes ALDA, a novel approach combining latent disentanglement and associative memory to achieve zero-shot generalization in RL without data augmentation.

Findings

01

ALDA achieves zero-shot generalization on challenging task variations.

02

Data augmentation is shown to be a form of weak disentanglement.

03

The approach reduces computational and data collection costs.

Abstract

Generalizing vision-based reinforcement learning (RL) agents to novel environments remains a difficult and open challenge. Current trends are to collect large-scale datasets or use data augmentation techniques to prevent overfitting and improve downstream generalization. However, the computational and data collection costs increase exponentially with the number of task variations and can destabilize the already difficult task of training RL agents. In this work, we take inspiration from recent advances in computational neuroscience and propose a model, Associative Latent DisentAnglement (ALDA), that builds on standard off-policy RL towards zero-shot generalization. Specifically, we revisit the role of latent disentanglement in RL and show how combining it with a model of associative memory achieves zero-shot generalization on difficult task variations without relying on data…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Processing Techniques and Applications · Advanced Vision and Imaging · Advanced Image Processing Techniques