Efficient Residue Number System Based Winograd Convolution

Zhi-Gang Liu; Matthew Mattina

arXiv:2007.12216·cs.LG·July 27, 2020

Efficient Residue Number System Based Winograd Convolution

Zhi-Gang Liu, Matthew Mattina

PDF

Open Access

TL;DR

This paper extends the Winograd convolution algorithm to Residue Number System (RNS), enabling efficient low-precision CNN inference with reduced arithmetic complexity and significant performance gains without accuracy loss.

Contribution

It introduces an RNS-based Winograd convolution method that maintains accuracy while significantly reducing computational complexity for low-precision neural network inference.

Findings

01

Arithmetic complexity reduced by up to 7.03x

02

Performance improved by 2.30x to 4.69x

03

No degradation in network prediction accuracy

Abstract

Prior research has shown that Winograd algorithm can reduce the computational complexity of convolutional neural networks (CNN) with weights and activations represented in floating point. However it is difficult to apply the scheme to the inference of low-precision quantized (e.g. INT8) networks. Our work extends the Winograd algorithm to Residue Number System (RNS). The minimal complexity convolution is computed precisely over large transformation tile (e.g. 10 x 10 to 16 x 16) of filters and activation patches using the Winograd transformation and low cost (e.g. 8-bit) arithmetic without degrading the prediction accuracy of the networks during inference. The arithmetic complexity reduction is up to 7.03x while the performance improvement is up to 2.30x to 4.69x for 3 x 3 and 5 x 5 filters respectively.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Numerical Methods and Algorithms · Computational Physics and Python Applications

MethodsConvolution