Adaptive Block Floating-Point for Analog Deep Learning Hardware
Ayon Basumallik, Darius Bunandar, Nicholas Dronen, Nicholas Harris,, Ludmila Levkova, Calvin McCarter, Lakshmi Nair, David Walter, David Widemann

TL;DR
This paper introduces an adaptive block floating-point representation and a finetuning method for analog deep learning hardware, significantly improving accuracy and efficiency in DNN inference on AMS devices.
Contribution
It proposes a novel AMS-compatible adaptive block floating-point format and a noise-aware finetuning method to enhance DNN accuracy on analog hardware.
Findings
Achieves less than 1% accuracy loss on MLPerf benchmarks.
Demonstrates improved precision without increasing bit width.
Introduces a faster finetuning method for AMS devices.
Abstract
Analog mixed-signal (AMS) devices promise faster, more energy-efficient deep neural network (DNN) inference than their digital counterparts. However, recent studies show that DNNs on AMS devices with fixed-point numbers can incur an accuracy penalty because of precision loss. To mitigate this penalty, we present a novel AMS-compatible adaptive block floating-point (ABFP) number representation. We also introduce amplification (or gain) as a method for increasing the accuracy of the number representation without increasing the bit precision of the output. We evaluate the effectiveness of ABFP on the DNNs in the MLPerf datacenter inference benchmark -- realizing less than loss in accuracy compared to FLOAT32. We also propose a novel method of finetuning for AMS devices, Differential Noise Finetuning (DNF), which samples device noise to speed up finetuning compared to conventional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnalog and Mixed-Signal Circuit Design · Numerical Methods and Algorithms · Advancements in Semiconductor Devices and Circuit Design
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
