Adaptive Regularization of Representation Rank as an Implicit Constraint   of Bellman Equation

Qiang He; Tianyi Zhou; Meng Fang; Setareh Maghsudi

arXiv:2404.12754·cs.LG·April 22, 2024

Adaptive Regularization of Representation Rank as an Implicit Constraint of Bellman Equation

Qiang He, Tianyi Zhou, Meng Fang, Setareh Maghsudi

PDF

Open Access 1 Repo

TL;DR

This paper introduces BEER, a novel regularizer based on the Bellman equation, which adaptively controls the representation rank in deep reinforcement learning, leading to improved performance and better Q-value approximation in complex tasks.

Contribution

The paper proposes BEER, a new regularizer that adaptively manages representation rank using the Bellman equation, enhancing DRL performance and stability.

Findings

01

BEER outperforms baselines on 12 DeepMind control tasks.

02

Adaptive regularization improves Q-value approximation.

03

The method scales effectively to complex continuous control tasks.

Abstract

Representation rank is an important concept for understanding the role of Neural Networks (NNs) in Deep Reinforcement learning (DRL), which measures the expressive capacity of value networks. Existing studies focus on unboundedly maximizing this rank; nevertheless, that approach would introduce overly complex models in the learning, thus undermining performance. Hence, fine-tuning representation rank presents a challenging and crucial optimization problem. To address this issue, we find a guiding principle for adaptive control of the representation rank. We employ the Bellman equation as a theoretical foundation and derive an upper bound on the cosine similarity of consecutive state-action pairs representations of value networks. We then leverage this upper bound to propose a novel regularizer, namely BEllman Equation-based automatic rank Regularizer (BEER). This regularizer adaptively…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sweetice/beer-iclr2024
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNumerical methods in inverse problems

MethodsFocus