The Role of Deep Learning Regularizations on Actors in Offline RL
Denis Tarasov, Anja Surina, Caglar Gulcehre

TL;DR
This paper investigates how deep learning regularizations like dropout and weight decay affect actor networks in offline reinforcement learning, showing they improve performance by around 6% on average.
Contribution
It provides the first empirical analysis of regularization effects on actor networks in offline RL, demonstrating consistent performance gains.
Findings
Regularization improves offline RL actor performance by 6%.
Standard techniques like dropout and weight decay are effective.
Enhances generalization in offline RL actor networks.
Abstract
Deep learning regularization techniques, such as dropout, layer normalization, or weight decay, are widely adopted in the construction of modern artificial neural networks, often resulting in more robust training processes and improved generalization capabilities. However, in the domain of Reinforcement Learning (RL), the application of these techniques has been limited, usually applied to value function estimators (Hiraoka et al., 2021; Smith et al., 2022), and may result in detrimental effects. This issue is even more pronounced in offline RL settings, which bear greater similarity to supervised learning but have received less attention. Recent work in continuous offline RL (Park et al., 2024) has demonstrated that while we can build sufficiently powerful critic networks, the generalization of actor networks remains a bottleneck. In this study, we empirically show that applying…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Intelligent Tutoring Systems and Adaptive Learning · Multi-Agent Systems and Negotiation
