Extended Bit-Plane Compression for Convolutional Neural Network   Accelerators

Lukas Cavigelli; Luca Benini

arXiv:1810.03979·cs.CV·March 5, 2019

Extended Bit-Plane Compression for Convolutional Neural Network Accelerators

Lukas Cavigelli, Luca Benini

PDF

1 Repo

TL;DR

This paper presents a new hardware-friendly compression scheme for CNN feature maps that significantly reduces data size, enabling more efficient deployment on power-constrained devices and data centers.

Contribution

The authors introduce an extended bit-plane compression method that achieves higher compression ratios with minimal hardware overhead for CNN accelerators.

Findings

01

Average compression ratio of 4.4x for ResNet-34

02

60% gain over existing compression methods

03

Requires less than 300 bits of hardware logic

Abstract

After the tremendous success of convolutional neural networks in image classification, object detection, speech recognition, etc., there is now rising demand for deployment of these compute-intensive ML models on tightly power constrained embedded and mobile systems at low cost as well as for pushing the throughput in data centers. This has triggered a wave of research towards specialized hardware accelerators. Their performance is often constrained by I/O bandwidth and the energy consumption is dominated by I/O transfers to off-chip memory. We introduce and evaluate a novel, hardware-friendly compression scheme for the feature maps present within convolutional neural networks. We show that an average compression ratio of 4.4x relative to uncompressed data and a gain of 60% over existing method can be achieved for ResNet-34 with a compression block requiring <300 bit of sequential cells…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lukasc-ch/ExtendedBitPlaneCompression
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.