VSCNN: Convolution Neural Network Accelerator With Vector Sparsity

Kuo-Wei Chang; and Tian-Sheuan Chang

arXiv:2205.02271·cs.AR·May 6, 2022

VSCNN: Convolution Neural Network Accelerator With Vector Sparsity

Kuo-Wei Chang, and Tian-Sheuan Chang

PDF

TL;DR

This paper introduces VSCNN, a hardware accelerator that efficiently supports both dense and vector sparse CNNs using a unified design, achieving significant speedup over traditional dense CNN accelerators.

Contribution

The paper presents a novel CNN accelerator supporting vector sparsity with low overhead, enabling flexible and efficient processing of both dense and sparse networks.

Findings

01

Achieves 1.93X speedup over dense CNN accelerators

02

Supports both dense and vector sparse CNNs with the same hardware

03

Reduces control complexity compared to fine-grained sparse support

Abstract

Hardware accelerator for convolution neural network (CNNs) enables real time applications of artificial intelligence technology. However, most of the accelerators only support dense CNN computations or suffers complex control to support fine grained sparse networks. To solve above problem, this paper presents an efficient CNN accelerator with 1-D vector broadcasted input to support both dense network as well as vector sparse network with the same hardware and low overhead. The presented design achieves 1.93X speedup over the dense CNN computations.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.