Error Analysis and Improving the Accuracy of Winograd Convolution for   Deep Neural Networks

Barbara Barabasz; Andrew Anderson; Kirk M. Soodhalter; David Gregg

arXiv:1803.10986·cs.NA·May 3, 2019

Error Analysis and Improving the Accuracy of Winograd Convolution for Deep Neural Networks

Barbara Barabasz, Andrew Anderson, Kirk M. Soodhalter, David Gregg

PDF

TL;DR

This paper analyzes the floating point error in Winograd convolution algorithms used in deep neural networks, proves error bounds, and proposes methods like Huffman coding and mixed-precision to reduce errors and improve accuracy.

Contribution

It provides a detailed error analysis of Winograd algorithms, introduces new techniques to reduce FP errors, and demonstrates improved accuracy enabling larger block sizes.

Findings

01

Error bounds grow exponentially with convolution size

02

Modified algorithms have smaller error bounds

03

Proposed methods significantly reduce FP error

Abstract

Popular deep neural networks (DNNs) spend the majority of their execution time computing convolutions. The Winograd family of algorithms can greatly reduce the number of arithmetic operations required and is present in many DNN software frameworks. However, the performance gain is at the expense of a reduction in floating point (FP) numerical accuracy. In this paper, we analyse the worst case FP error and prove the estimation of norm and conditioning of the algorithm. We show that the bound grows exponentially with the size of the convolution, but the error bound of the \textit{modified} algorithm is smaller than the original one. We propose several methods for reducing FP error. We propose a canonical evaluation ordering based on Huffman coding that reduces summation error. We study the selection of sampling "points" experimentally and find empirically good points for the most…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.