Accelerating Neural Network Training: A Brief Review
Sahil Nokhwal, Priyanka Chilakalapudi, Preeti Donekal, Suman Nokhwal,, Saurabh Pahune, Ankit Chaudhary

TL;DR
This paper reviews methods to accelerate deep neural network training, focusing on Gradient Accumulation, Automatic Mixed Precision, and Pin Memory, demonstrating their combined effectiveness in reducing training time for models like ResNet50, ViT, and EfficientNet.
Contribution
It provides an empirical evaluation of three advanced techniques to speed up DNN training, highlighting their synergistic benefits and practical implications.
Findings
Gradient Accumulation reduces training duration significantly.
Automatic Mixed Precision maintains accuracy while increasing speed.
Pin Memory improves data transfer efficiency between CPU and GPU.
Abstract
The process of training a deep neural network is characterized by significant time requirements and associated costs. Although researchers have made considerable progress in this area, further work is still required due to resource constraints. This study examines innovative approaches to expedite the training process of deep neural networks (DNN), with specific emphasis on three state-of-the-art models such as ResNet50, Vision Transformer (ViT), and EfficientNet. The research utilizes sophisticated methodologies, including Gradient Accumulation (GA), Automatic Mixed Precision (AMP), and Pin Memory (PM), in order to optimize performance and accelerate the training procedure. The study examines the effects of these methodologies on the DNN models discussed earlier, assessing their efficacy with regard to training rate and computational efficacy. The study showcases the efficacy of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Currency Recognition and Detection · Industrial Vision Systems and Defect Detection
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Multi-Head Attention · Attention Is All You Need · Depthwise Convolution · Pointwise Convolution · Batch Normalization · Depthwise Separable Convolution · Convolution · Sigmoid Activation · 1x1 Convolution
