ApproxTrain: Fast Simulation of Approximate Multipliers for DNN Training and Inference
Jing Gong, Hassaan Saadat, Hasindu Gamaarachchi, Haris Javaid, Xiaobo, Sharon Hu, Sri Parameswaran

TL;DR
ApproxTrain is an open-source GPU-accelerated framework that enables rapid simulation of approximate multipliers in DNN training and inference, facilitating resource-efficient hardware design without significant accuracy loss.
Contribution
It introduces a fast, user-friendly simulation framework for evaluating approximate multipliers in DNN training, integrating a novel LUT-based GPU simulator into TensorFlow.
Findings
GPU simulation is over 2500x faster than CPU-based methods.
Training with approximate multipliers shows similar convergence to standard multipliers.
Approximate multipliers cause negligible accuracy loss on large datasets like ImageNet.
Abstract
Edge training of Deep Neural Networks (DNNs) is a desirable goal for continuous learning; however, it is hindered by the enormous computational power required by training. Hardware approximate multipliers have shown their effectiveness for gaining resource-efficiency in DNN inference accelerators; however, training with approximate multipliers is largely unexplored. To build resource efficient accelerators with approximate multipliers supporting DNN training, a thorough evaluation of training convergence and accuracy for different DNN architectures and different approximate multipliers is needed. This paper presents ApproxTrain, an open-source framework that allows fast evaluation of DNN training and inference using simulated approximate multipliers. ApproxTrain is as user-friendly as TensorFlow (TF) and requires only a high-level description of a DNN architecture along with C/C++…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Stochastic Gradient Optimization Techniques · Machine Learning and Data Classification
MethodsTest · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
