Reward-Aware Proto-Representations in Reinforcement Learning

Hon Tik Tse; Siddarth Chandrasekar; Marlos C. Machado

arXiv:2505.16217·cs.LG·February 3, 2026

Reward-Aware Proto-Representations in Reinforcement Learning

Hon Tik Tse, Siddarth Chandrasekar, Marlos C. Machado

PDF

Open Access 1 Video

TL;DR

This paper introduces the default representation (DR), a reward-aware alternative to the successor representation (SR) in reinforcement learning, providing theoretical foundations and empirical benefits in exploration, transfer learning, and reward shaping.

Contribution

It develops the theoretical basis for the DR, extending it to function approximation, and demonstrates its advantages over SR in various RL tasks.

Findings

01

DR offers reward-aware behavior unlike SR.

02

DR outperforms SR in transfer learning and exploration.

03

Theoretical foundations for DR are established.

Abstract

In recent years, the successor representation (SR) has attracted increasing attention in reinforcement learning (RL), and it has been used to address some of its key challenges, such as exploration, credit assignment, and generalization. The SR can be seen as representing the underlying credit assignment structure of the environment by implicitly encoding its induced transition dynamics. However, the SR is reward-agnostic. In this paper, we discuss a similar representation that also takes into account the reward dynamics of the problem. We study the default representation (DR), a recently proposed representation with limited theoretical (and empirical) analysis. Here, we lay some of the theoretical foundation underlying the DR in the tabular case by (1) deriving dynamic programming and (2) temporal-difference methods to learn the DR, (3) characterizing the basis for the vector space of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Reward-Aware Proto-Representations in Reinforcement Learning· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Explainable Artificial Intelligence (XAI)

MethodsSoftmax · Attention Is All You Need