The Impact of Quantization and Pruning on Deep Reinforcement Learning   Models

Heng Lu; Mehdi Alemi; and Reza Rawassizadeh

arXiv:2407.04803·cs.LG·July 9, 2024

The Impact of Quantization and Pruning on Deep Reinforcement Learning Models

Heng Lu, Mehdi Alemi, and Reza Rawassizadeh

PDF

Open Access

TL;DR

This paper investigates how quantization and pruning affect deep reinforcement learning models, focusing on performance trade-offs, energy efficiency, and deployment in resource-limited environments.

Contribution

It provides a comprehensive analysis of the effects of quantization and pruning on DRL models' performance, energy use, and resource requirements, offering deployment guidelines.

Findings

01

Model size decreases with compression techniques.

02

Energy efficiency does not significantly improve despite size reduction.

03

Performance trade-offs depend on the specific compression method and environment.

Abstract

Deep reinforcement learning (DRL) has achieved remarkable success across various domains, such as video games, robotics, and, recently, large language models. However, the computational costs and memory requirements of DRL models often limit their deployment in resource-constrained environments. The challenge underscores the urgent need to explore neural network compression methods to make RDL models more practical and broadly applicable. Our study investigates the impact of two prominent compression methods, quantization and pruning on DRL models. We examine how these techniques influence four performance factors: average return, memory, inference time, and battery utilization across various DRL algorithms and environments. Despite the decrease in model size, we identify that these compression techniques generally do not improve the energy efficiency of DRL models, but the model size…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsPruning