Accelerating Neural Network Training: A Brief Review

Sahil Nokhwal; Priyanka Chilakalapudi; Preeti Donekal; Suman Nokhwal,; Saurabh Pahune; Ankit Chaudhary

arXiv:2312.10024·cs.LG·December 29, 2023·6 cites

Accelerating Neural Network Training: A Brief Review

Sahil Nokhwal, Priyanka Chilakalapudi, Preeti Donekal, Suman Nokhwal,, Saurabh Pahune, Ankit Chaudhary

PDF

Open Access 1 Repo

TL;DR

This paper reviews methods to accelerate deep neural network training, focusing on Gradient Accumulation, Automatic Mixed Precision, and Pin Memory, demonstrating their combined effectiveness in reducing training time for models like ResNet50, ViT, and EfficientNet.

Contribution

It provides an empirical evaluation of three advanced techniques to speed up DNN training, highlighting their synergistic benefits and practical implications.

Findings

01

Gradient Accumulation reduces training duration significantly.

02

Automatic Mixed Precision maintains accuracy while increasing speed.

03

Pin Memory improves data transfer efficiency between CPU and GPU.

Abstract

The process of training a deep neural network is characterized by significant time requirements and associated costs. Although researchers have made considerable progress in this area, further work is still required due to resource constraints. This study examines innovative approaches to expedite the training process of deep neural networks (DNN), with specific emphasis on three state-of-the-art models such as ResNet50, Vision Transformer (ViT), and EfficientNet. The research utilizes sophisticated methodologies, including Gradient Accumulation (GA), Automatic Mixed Precision (AMP), and Pin Memory (PM), in order to optimize performance and accelerate the training procedure. The study examines the effects of these methodologies on the DNN models discussed earlier, assessing their efficacy with regard to training rate and computational efficacy. The study showcases the efficacy of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sahilnokhwal/annt
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Currency Recognition and Detection · Industrial Vision Systems and Defect Detection

Methods*Communicated@Fast*How Do I Communicate to Expedia? · Multi-Head Attention · Attention Is All You Need · Depthwise Convolution · Pointwise Convolution · Batch Normalization · Depthwise Separable Convolution · Convolution · Sigmoid Activation · 1x1 Convolution