GrateTile: Efficient Sparse Tensor Tiling for CNN Processing

Yu-Sheng Lin; Hung Chang Lu; Yang-Bin Tsao; Yi-Min Chih; Wei-Chao; Chen; Shao-Yi Chien

arXiv:2009.08685·cs.LG·September 21, 2020

GrateTile: Efficient Sparse Tensor Tiling for CNN Processing

Yu-Sheng Lin, Hung Chang Lu, Yang-Bin Tsao, Yi-Min Chih, Wei-Chao, Chen, Shao-Yi Chien

PDF

Open Access

TL;DR

GrateTile introduces a hardware-friendly sparse tensor tiling scheme that significantly reduces DRAM bandwidth in CNN accelerators by efficiently storing and accessing sparse feature maps with minimal indexing overhead.

Contribution

It presents a novel sparse tensor tiling method that enables efficient on-the-fly decompression and access, reducing bandwidth and storage overhead in CNN processing.

Findings

01

55% average DRAM bandwidth reduction

02

Only 0.6% feature map size used for indexing

03

Compatible with modern CNN accelerators

Abstract

We propose GrateTile, an efficient, hardwarefriendly data storage scheme for sparse CNN feature maps (activations). It divides data into uneven-sized subtensors and, with small indexing overhead, stores them in a compressed yet randomly accessible format. This design enables modern CNN accelerators to fetch and decompressed sub-tensors on-the-fly in a tiled processing manner. GrateTile is suitable for architectures that favor aligned, coalesced data access, and only requires minimal changes to the overall architectural design. We simulate GrateTile with state-of-the-art CNNs and show an average of 55% DRAM bandwidth reduction while using only 0.6% of feature map size for indexing storage.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Parallel Computing and Optimization Techniques · Tensor decomposition and applications