Temporal Representations for Exploration: Learning Complex Exploratory Behavior without Extrinsic Rewards

Faisal Mohamed; Catherine Ji; Benjamin Eysenbach; Glen Berseth

arXiv:2603.02008·cs.LG·April 21, 2026

Temporal Representations for Exploration: Learning Complex Exploratory Behavior without Extrinsic Rewards

Faisal Mohamed, Catherine Ji, Benjamin Eysenbach, Glen Berseth

PDF

1 Video

TL;DR

This paper introduces a novel exploration method in reinforcement learning that uses temporal contrastive representations to guide exploration, enabling complex behaviors without relying on extrinsic rewards.

Contribution

It proposes a new approach leveraging temporal similarities for exploration, avoiding explicit distance learning or episodic memory mechanisms.

Findings

01

Enables learning complex behaviors in locomotion, manipulation, and embodied-AI tasks.

02

Shows effectiveness without relying on extrinsic rewards.

03

Builds on temporal similarities, simplifying exploration strategies.

Abstract

Effective exploration in reinforcement learning requires not only tracking where an agent has been, but also understanding how the agent perceives and represents the world. To learn powerful representations, an agent should actively explore states that contribute to its knowledge of the environment. Temporal representations can capture the information necessary to solve a wide range of potential tasks while avoiding the computational cost associated with full state reconstruction. In this paper, we propose an exploration method that leverages temporal contrastive representations to guide exploration, prioritizing states with unpredictable future outcomes. We demonstrate that such representations can enable the learning of complex exploratory x in locomotion, manipulation, and embodied-AI tasks, revealing capabilities and behaviors that traditionally require extrinsic rewards. Unlike…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Temporal Representations for Exploration: Learning Complex Exploratory Behavior without Extrinsic Rewards· slideslive