The Role of Deep Learning Regularizations on Actors in Offline RL

Denis Tarasov; Anja Surina; Caglar Gulcehre

arXiv:2409.07606·cs.LG·November 22, 2024

The Role of Deep Learning Regularizations on Actors in Offline RL

Denis Tarasov, Anja Surina, Caglar Gulcehre

PDF

Open Access 1 Repo

TL;DR

This paper investigates how deep learning regularizations like dropout and weight decay affect actor networks in offline reinforcement learning, showing they improve performance by around 6% on average.

Contribution

It provides the first empirical analysis of regularization effects on actor networks in offline RL, demonstrating consistent performance gains.

Findings

01

Regularization improves offline RL actor performance by 6%.

02

Standard techniques like dropout and weight decay are effective.

03

Enhances generalization in offline RL actor networks.

Abstract

Deep learning regularization techniques, such as dropout, layer normalization, or weight decay, are widely adopted in the construction of modern artificial neural networks, often resulting in more robust training processes and improved generalization capabilities. However, in the domain of Reinforcement Learning (RL), the application of these techniques has been limited, usually applied to value function estimators (Hiraoka et al., 2021; Smith et al., 2022), and may result in detrimental effects. This issue is even more pronounced in offline RL settings, which bear greater similarity to supervised learning but have received less attention. Recent work in continuous offline RL (Park et al., 2024) has demonstrated that while we can build sufficiently powerful critic networks, the generalization of actor networks remains a bottleneck. In this study, we empirically show that applying…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dt6a/actoreg
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Intelligent Tutoring Systems and Adaptive Learning · Multi-Agent Systems and Negotiation