EDCompress: Energy-Aware Model Compression for Dataflows
Zhehui Wang, Tao Luo, Joey Tianyi Zhou, Rick Siow Mong Goh

TL;DR
EDCompress is a reinforcement learning-based method that optimizes CNN model compression for various dataflows, significantly reducing energy consumption on edge devices while maintaining accuracy.
Contribution
It introduces a novel energy-aware compression approach tailored for different dataflow types, addressing a gap in existing work.
Findings
Achieves up to 37X energy efficiency improvements.
Effectively finds optimal dataflow types for neural networks.
Maintains negligible accuracy loss during compression.
Abstract
Edge devices demand low energy consumption, cost and small form factor. To efficiently deploy convolutional neural network (CNN) models on edge device, energy-aware model compression becomes extremely important. However, existing work did not study this problem well because the lack of considering the diversity of dataflow types in hardware architectures. In this paper, we propose EDCompress, an Energy-aware model compression method for various Dataflows. It can effectively reduce the energy consumption of various edge devices, with different dataflow types. Considering the very nature of model compression procedures, we recast the optimization process to a multi-step problem, and solve it by reinforcement learning algorithms. Experiments show that EDCompress could improve 20X, 17X, 37X energy efficiency in VGG-16, MobileNet, LeNet-5 networks, respectively, with negligible loss of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Data and IoT Technologies · IoT and Edge/Fog Computing
