AcceRL: Policy Acceleration Framework for Deep Reinforcement Learning

Hongjie Zhang

arXiv:2211.15023·cs.LG·November 29, 2022

AcceRL: Policy Acceleration Framework for Deep Reinforcement Learning

Hongjie Zhang

PDF

Open Access

TL;DR

AcceRL is a novel framework that accelerates deep reinforcement learning by integrating neural network compression techniques, significantly reducing training time while maintaining policy quality.

Contribution

This paper introduces AcceRL, the first reinforcement learning framework combining multiple neural network compression methods for policy acceleration.

Findings

01

Reduces actor time cost by 2.0x to 4.13x.

02

Decreases total training time by 29.8% to 40.3%.

03

Maintains policy quality comparable to traditional methods.

Abstract

Deep reinforcement learning has achieved great success in various fields with its super decision-making ability. However, the policy learning process requires a large amount of training time, causing energy consumption. Inspired by the redundancy of neural networks, we propose a lightweight parallel training framework based on neural network compression, AcceRL, to accelerate the policy learning while ensuring policy quality. Specifically, AcceRL speeds up the experience collection by flexibly combining various neural network compression methods. Overall, the AcceRL consists of five components, namely Actor, Learner, Compressor, Corrector, and Monitor. The Actor uses the Compressor to compress the Learner's policy network to interact with the environment. And the generated experiences are transformed by the Corrector with Off-Policy methods, such as V-trace, Retrace and so on. Then the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Autonomous Vehicle Technology and Safety

MethodsV-trace · Retrace