R2R2: Robust Representation for Intensive Experience Reuse via Redundancy Reduction in Self-Predictive Learning

Sanghyeob Song; Donghyeok Lee; Jinsik Kim; Sungroh Yoon

arXiv:2605.14026·cs.LG·May 15, 2026

R2R2: Robust Representation for Intensive Experience Reuse via Redundancy Reduction in Self-Predictive Learning

Sanghyeob Song, Donghyeok Lee, Jinsik Kim, Sungroh Yoon

PDF

1 Repo

TL;DR

R2R2 introduces a regularization technique for Self-Predictive Learning in reinforcement learning, reducing overfitting and improving performance in data-scarce, high UTD regimes, validated across multiple control tasks.

Contribution

The paper proposes R2R2, a novel regularization method that addresses spectral conflicts in SPL, and demonstrates its effectiveness and orthogonality through integration with SimbaV2, setting new state-of-the-art results.

Findings

01

R2R2 improves TD7 performance by approximately 22% at UTD ratio of 20.

02

R2R2 reduces overfitting in SPL-based reinforcement learning algorithms.

03

Integrating SPL into SimbaV2 enhances its performance, establishing new benchmarks.

Abstract

For reinforcement learning in data-scarce domains like real-world robotics, intensive data reuse enhances efficiency but induces overfitting. While prior works focus on critic bias, representation-level instability in Self-Predictive Learning (SPL) under high Update-to-Data (UTD) regimes remains underexplored. To bridge this gap, we propose Robust Representation via Redundancy Reduction (R2R2), a regularization method within SPL. We theoretically identify that standard zero-centering conflicts with SPL's spectral properties and design a non-centered objective accordingly. We verify R2R2 on SPL-native algorithms like TD7. Furthermore, to demonstrate its orthogonality to prior advancements, we extend the state-of-the-art SimbaV2, which originally lacks SPL, by integrating a tailored SPL module, termed SimbaV2-SPL. Experiments across 11 continuous control tasks confirm that R2R2…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

songsang7/R2R2
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.