Variability-Aware Training and Self-Tuning of Highly Quantized DNNs for   Analog PIM

Zihao Deng; Michael Orshansky

arXiv:2111.06457·cs.LG·November 15, 2021

Variability-Aware Training and Self-Tuning of Highly Quantized DNNs for Analog PIM

Zihao Deng, Michael Orshansky

PDF

TL;DR

This paper introduces a variability-aware training algorithm for highly quantized DNNs on analog PIM architectures, significantly improving accuracy under fabrication variability, and proposes a self-tuning inference method to further reduce accuracy loss.

Contribution

It presents a novel joint variability- and quantization-aware training method and a self-tuning inference architecture for robust DNN deployment on analog PIM hardware.

Findings

01

Outperforms prior models on multiple datasets and models.

02

Achieves up to 35.7% accuracy gain in low-bitwidth models with high variability.

03

Self-tuning reduces accuracy loss to below 10%.

Abstract

DNNs deployed on analog processing in memory (PIM) architectures are subject to fabrication-time variability. We developed a new joint variability- and quantization-aware DNN training algorithm for highly quantized analog PIM-based models that is significantly more effective than prior work. It outperforms variability-oblivious and post-training quantized models on multiple computer vision datasets/models. For low-bitwidth models and high variation, the gain in accuracy is up to 35.7% for ResNet-18 over the best alternative. We demonstrate that, under a realistic pattern of within- and between-chip components of variability, training alone is unable to prevent large DNN accuracy loss (of up to 54% on CIFAR-100/ResNet-18). We introduce a self-tuning DNN architecture that dynamically adjusts layer-wise activations during inference and is effective in reducing accuracy loss to below 10%.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.