Just Cluster It: An Approach for Exploration in High-Dimensions using Clustering and Pre-Trained Representations
Stefan Sylvius Wagner, Stefan Harmeling

TL;DR
This paper introduces a clustering-based exploration method in reinforcement learning that leverages pre-trained representations, demonstrating effectiveness in 3-D environments and surpassing existing methods in VizDoom and Habitat.
Contribution
It proposes a novel clustering approach using pre-trained and random representations for state counting, highlighting the benefits of pre-trained biases in exploration.
Findings
Random features can effectively cluster to count states in 3-D environments.
Pre-trained DINO representations outperform random features in complex visual environments.
The method surpasses other exploration techniques in VizDoom and Habitat environments.
Abstract
In this paper we adopt a representation-centric perspective on exploration in reinforcement learning, viewing exploration fundamentally as a density estimation problem. We investigate the effectiveness of clustering representations for exploration in 3-D environments, based on the observation that the importance of pixel changes between transitions is less pronounced in 3-D environments compared to 2-D environments, where pixel changes between transitions are typically distinct and significant. We propose a method that performs episodic and global clustering on random representations and on pre-trained DINO representations to count states, i.e, estimate pseudo-counts. Surprisingly, even random features can be clustered effectively to count states in 3-D environments, however when these become visually more complex, pre-trained DINO representations are more effective thanks to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Clustering Algorithms Research
MethodsAttention Is All You Need · Softmax · Linear Layer · Multi-Head Attention · Residual Connection · Layer Normalization · Dense Connections · Vision Transformer · self-DIstillation with NO labels
