Sliding Puzzles Gym: A Scalable Benchmark for State Representation in Visual Reinforcement Learning
Bryan L. M. de Oliveira, Luana G. B. Martins, Bruno Brand\~ao, Murilo L. da Luz, Telma W. de L. Soares, Luckeciano C. Melo

TL;DR
The paper introduces the Sliding Puzzles Gym (SPGym), a scalable benchmark for evaluating visual representation learning in reinforcement learning, enabling systematic assessment of how well algorithms handle visual diversity independently of other factors.
Contribution
It presents a novel benchmark that isolates and scales visual representation challenges in RL, facilitating focused evaluation of representation learning methods.
Findings
Current RL algorithms struggle with increased visual diversity.
Sophisticated representation methods often underperform simpler data augmentation.
Performance degrades with larger image pools, indicating limitations in current approaches.
Abstract
Effective visual representation learning is crucial for reinforcement learning (RL) agents to extract task-relevant information from raw sensory inputs and generalize across diverse environments. However, existing RL benchmarks lack the ability to systematically evaluate representation learning capabilities in isolation from other learning challenges. To address this gap, we introduce the Sliding Puzzles Gym (SPGym), a novel benchmark that transforms the classic 8-tile puzzle into a visual RL task with images drawn from arbitrarily large datasets. SPGym's key innovation lies in its ability to precisely control representation learning complexity through adjustable grid sizes and image pools, while maintaining fixed environment dynamics, observation, and action spaces. This design enables researchers to isolate and scale the visual representation challenge independently of other learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNeural dynamics and brain function · Advanced Memory and Neural Computing
