AcceRL: Policy Acceleration Framework for Deep Reinforcement Learning
Hongjie Zhang

TL;DR
AcceRL is a novel framework that accelerates deep reinforcement learning by integrating neural network compression techniques, significantly reducing training time while maintaining policy quality.
Contribution
This paper introduces AcceRL, the first reinforcement learning framework combining multiple neural network compression methods for policy acceleration.
Findings
Reduces actor time cost by 2.0x to 4.13x.
Decreases total training time by 29.8% to 40.3%.
Maintains policy quality comparable to traditional methods.
Abstract
Deep reinforcement learning has achieved great success in various fields with its super decision-making ability. However, the policy learning process requires a large amount of training time, causing energy consumption. Inspired by the redundancy of neural networks, we propose a lightweight parallel training framework based on neural network compression, AcceRL, to accelerate the policy learning while ensuring policy quality. Specifically, AcceRL speeds up the experience collection by flexibly combining various neural network compression methods. Overall, the AcceRL consists of five components, namely Actor, Learner, Compressor, Corrector, and Monitor. The Actor uses the Compressor to compress the Learner's policy network to interact with the environment. And the generated experiences are transformed by the Corrector with Off-Policy methods, such as V-trace, Retrace and so on. Then the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Autonomous Vehicle Technology and Safety
MethodsV-trace · Retrace
