Maximum Total Correlation Reinforcement Learning

Bang You; Puze Liu; Huaping Liu; Jan Peters; Oleg Arenz

arXiv:2505.16734·cs.LG·May 23, 2025

Maximum Total Correlation Reinforcement Learning

Bang You, Puze Liu, Huaping Liu, Jan Peters, Oleg Arenz

PDF

Open Access 1 Repo

TL;DR

This paper introduces a reinforcement learning method that maximizes total correlation in trajectories to promote simplicity, resulting in more robust and efficient policies in simulated robot tasks.

Contribution

It proposes a novel RL algorithm that maximizes total correlation within trajectories, enhancing policy simplicity and robustness beyond existing regularization techniques.

Findings

01

Policies induce periodic and compressible trajectories

02

Method improves robustness to noise and dynamic changes

03

Enhances performance in original tasks

Abstract

Simplicity is a powerful inductive bias. In reinforcement learning, regularization is used for simpler policies, data augmentation for simpler representations, and sparse reward functions for simpler objectives, all that, with the underlying motivation to increase generalizability and robustness by focusing on the essentials. Supplementary to these techniques, we investigate how to promote simple behavior throughout the episode. To that end, we introduce a modification of the reinforcement learning problem that additionally maximizes the total correlation within the induced trajectories. We propose a practical algorithm that optimizes all models, including policy and state representation, based on a lower-bound approximation. In simulated robot environments, our method naturally generates policies that induce periodic and compressible trajectories, and that exhibit superior robustness…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bangyou01/mtc
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Advanced Bandit Algorithms Research · Robot Manipulation and Learning