Decorrelated Soft Actor-Critic for Efficient Deep Reinforcement Learning
Burcu K\"u\c{c}\"uko\u{g}lu, Sander Dalm, Marcel van Gerven

TL;DR
This paper introduces a novel online decorrelation method integrated into deep RL training, enhancing sample efficiency and training speed by improving credit assignment, demonstrated through the DSAC approach on Atari benchmarks.
Contribution
The paper proposes a new online decorrelation technique for deep RL that is seamlessly integrated into the training process, improving efficiency and performance.
Findings
Faster training in 5 out of 7 Atari games
Approximately 50% reduction in wall-clock time
Maintained performance levels across tested games
Abstract
The effectiveness of credit assignment in reinforcement learning (RL) when dealing with high-dimensional data is influenced by the success of representation learning via deep neural networks, and has implications for the sample efficiency of deep RL algorithms. Input decorrelation has been previously introduced as a method to speed up optimization in neural networks, and has proven impactful in both efficient deep learning and as a method for effective representation learning for deep RL algorithms. We propose a novel approach to online decorrelation in deep RL based on the decorrelated backpropagation algorithm that seamlessly integrates the decorrelation process into the RL training pipeline. Decorrelation matrices are added to each layer, which are updated using a separate decorrelation learning rule that minimizes the total decorrelation loss across all layers, in parallel to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Evolutionary Algorithms and Applications · Reinforcement Learning in Robotics
MethodsDilated Convolution · Convolution · 1x1 Convolution · Average Pooling · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Global Average Pooling · Switchable Atrous Convolution
