Training Neural Networks for Execution on Approximate Hardware

Tianmu Li; Shurui Li; Puneet Gupta

arXiv:2304.04125·cs.LG·April 11, 2023·1 cites

Training Neural Networks for Execution on Approximate Hardware

Tianmu Li, Shurui Li, Puneet Gupta

PDF

Open Access

TL;DR

This paper explores specialized training methods for neural networks on approximate hardware, demonstrating significant speedups and addressing the gap in training approaches for energy-efficient deep learning inference.

Contribution

It introduces training techniques tailored for approximate hardware and proposes methods to accelerate training by up to 18 times.

Findings

01

Training methods need to be adapted for approximate hardware

02

Proposed techniques achieve up to 18X faster training

03

Enhances the practicality of approximate computing in deep learning

Abstract

Approximate computing methods have shown great potential for deep learning. Due to the reduced hardware costs, these methods are especially suitable for inference tasks on battery-operated devices that are constrained by their power budget. However, approximate computing hasn't reached its full potential due to the lack of work on training methods. In this work, we discuss training methods for approximate hardware. We demonstrate how training needs to be specialized for approximate hardware, and propose methods to speed up the training process by up to 18X.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices · Advancements in Semiconductor Devices and Circuit Design

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings