Control Variate Approximation for DNN Accelerators

Georgios Zervakis; Ourania Spantidi; Iraklis Anagnostopoulos; Hussam; Amrouch; J\"org Henkel

arXiv:2102.09642·cs.LG·March 16, 2022

Control Variate Approximation for DNN Accelerators

Georgios Zervakis, Ourania Spantidi, Iraklis Anagnostopoulos, Hussam, Amrouch, J\"org Henkel

PDF

TL;DR

This paper presents a control variate technique to reduce error in low-precision DNN accelerators, enabling power savings with minimal accuracy loss without extensive retraining.

Contribution

It introduces a novel control variate method for variance reduction in approximate DNN accelerators, improving power efficiency while maintaining accuracy.

Findings

01

Achieves 24% power reduction in DNN accelerators.

02

Maintains same performance with only 0.16% accuracy loss.

03

Applicable to multiple DNN models on CIFAR datasets.

Abstract

In this work, we introduce a control variate approximation technique for low error approximate Deep Neural Network (DNN) accelerators. The control variate technique is used in Monte Carlo methods to achieve variance reduction. Our approach significantly decreases the induced error due to approximate multiplications in DNN inference, without requiring time-exhaustive retraining compared to state-of-the-art. Leveraging our control variate method, we use highly approximated multipliers to generate power-optimized DNN accelerators. Our experimental evaluation on six DNNs, for Cifar-10 and Cifar-100 datasets, demonstrates that, compared to the accurate design, our control variate approximation achieves same performance and 24% power reduction for a merely 0.16% accuracy loss.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.