Efficient Winograd Convolution via Integer Arithmetic

Lingchuan Meng; John Brothers

arXiv:1901.01965·cs.NE·January 9, 2019·27 cites

Efficient Winograd Convolution via Integer Arithmetic

Lingchuan Meng, John Brothers

PDF

Open Access

TL;DR

This paper introduces a novel class of Winograd convolution algorithms extended to complex fields, optimizing integer arithmetic to accelerate neural network inference with reduced bit widths and improved efficiency.

Contribution

It presents a new complex-field Winograd algorithm with fewer multiplications and an integer-based filter scaling scheme that reduces bit width without accuracy loss.

Findings

01

Arithmetic complexity reduced by 3.13x compared to direct method

02

Efficiency gain of up to 17.37% over rational algorithms

03

Filter bit width reduced by 30.77% without accuracy loss

Abstract

Convolution is the core operation for many deep neural networks. The Winograd convolution algorithms have been shown to accelerate the widely-used small convolution sizes. Quantized neural networks can effectively reduce model sizes and improve inference speed, which leads to a wide variety of kernels and hardware accelerators that work with integer data. The state-of-the-art Winograd algorithms pose challenges for efficient implementation and execution by the integer kernels and accelerators. We introduce a new class of Winograd algorithms by extending the construction to the field of complex and propose optimizations that reduce the number of general multiplications. The new algorithm achieves an arithmetic complexity reduction of $3.13$ x over the direct method and an efficiency gain up to $17.37%$ over the rational algorithms. Furthermore, we design and implement an integer-based…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications