Decorrelated Soft Actor-Critic for Efficient Deep Reinforcement Learning

Burcu K\"u\c{c}\"uko\u{g}lu; Sander Dalm; Marcel van Gerven

arXiv:2501.19133·cs.LG·February 3, 2025

Decorrelated Soft Actor-Critic for Efficient Deep Reinforcement Learning

Burcu K\"u\c{c}\"uko\u{g}lu, Sander Dalm, Marcel van Gerven

PDF

Open Access

TL;DR

This paper introduces a novel online decorrelation method integrated into deep RL training, enhancing sample efficiency and training speed by improving credit assignment, demonstrated through the DSAC approach on Atari benchmarks.

Contribution

The paper proposes a new online decorrelation technique for deep RL that is seamlessly integrated into the training process, improving efficiency and performance.

Findings

01

Faster training in 5 out of 7 Atari games

02

Approximately 50% reduction in wall-clock time

03

Maintained performance levels across tested games

Abstract

The effectiveness of credit assignment in reinforcement learning (RL) when dealing with high-dimensional data is influenced by the success of representation learning via deep neural networks, and has implications for the sample efficiency of deep RL algorithms. Input decorrelation has been previously introduced as a method to speed up optimization in neural networks, and has proven impactful in both efficient deep learning and as a method for effective representation learning for deep RL algorithms. We propose a novel approach to online decorrelation in deep RL based on the decorrelated backpropagation algorithm that seamlessly integrates the decorrelation process into the RL training pipeline. Decorrelation matrices are added to each layer, which are updated using a separate decorrelation learning rule that minimizes the total decorrelation loss across all layers, in parallel to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Evolutionary Algorithms and Applications · Reinforcement Learning in Robotics

MethodsDilated Convolution · Convolution · 1x1 Convolution · Average Pooling · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Global Average Pooling · Switchable Atrous Convolution