ApproxTrain: Fast Simulation of Approximate Multipliers for DNN Training   and Inference

Jing Gong; Hassaan Saadat; Hasindu Gamaarachchi; Haris Javaid; Xiaobo; Sharon Hu; Sri Parameswaran

arXiv:2209.04161·cs.AR·September 26, 2022·1 cites

ApproxTrain: Fast Simulation of Approximate Multipliers for DNN Training and Inference

Jing Gong, Hassaan Saadat, Hasindu Gamaarachchi, Haris Javaid, Xiaobo, Sharon Hu, Sri Parameswaran

PDF

Open Access 2 Repos

TL;DR

ApproxTrain is an open-source GPU-accelerated framework that enables rapid simulation of approximate multipliers in DNN training and inference, facilitating resource-efficient hardware design without significant accuracy loss.

Contribution

It introduces a fast, user-friendly simulation framework for evaluating approximate multipliers in DNN training, integrating a novel LUT-based GPU simulator into TensorFlow.

Findings

01

GPU simulation is over 2500x faster than CPU-based methods.

02

Training with approximate multipliers shows similar convergence to standard multipliers.

03

Approximate multipliers cause negligible accuracy loss on large datasets like ImageNet.

Abstract

Edge training of Deep Neural Networks (DNNs) is a desirable goal for continuous learning; however, it is hindered by the enormous computational power required by training. Hardware approximate multipliers have shown their effectiveness for gaining resource-efficiency in DNN inference accelerators; however, training with approximate multipliers is largely unexplored. To build resource efficient accelerators with approximate multipliers supporting DNN training, a thorough evaluation of training convergence and accuracy for different DNN architectures and different approximate multipliers is needed. This paper presents ApproxTrain, an open-source framework that allows fast evaluation of DNN training and inference using simulated approximate multipliers. ApproxTrain is as user-friendly as TensorFlow (TF) and requires only a high-level description of a DNN architecture along with C/C++…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Stochastic Gradient Optimization Techniques · Machine Learning and Data Classification

MethodsTest · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings