Performance Evaluation of Swin Vision Transformer Model using Gradient   Accumulation Optimization Technique

Sanad Aburass; Osama Dorgham

arXiv:2308.00197·cs.CV·August 2, 2023·2 cites

Performance Evaluation of Swin Vision Transformer Model using Gradient Accumulation Optimization Technique

Sanad Aburass, Osama Dorgham

PDF

Open Access

TL;DR

This paper evaluates the impact of gradient accumulation optimization on Swin Vision Transformer models, finding that it decreases accuracy and increases training time, thus questioning its suitability for such models.

Contribution

The study provides the first detailed analysis of how gradient accumulation optimization affects Swin ViT performance, highlighting potential drawbacks.

Findings

01

GAO decreases Swin ViT accuracy

02

GAO increases training time for Swin ViT

03

Applying GAO may not be suitable for transformer models

Abstract

Vision Transformers (ViTs) have emerged as a promising approach for visual recognition tasks, revolutionizing the field by leveraging the power of transformer-based architectures. Among the various ViT models, Swin Transformers have gained considerable attention due to their hierarchical design and ability to capture both local and global visual features effectively. This paper evaluates the performance of Swin ViT model using gradient accumulation optimization (GAO) technique. We investigate the impact of gradient accumulation optimization technique on the model's accuracy and training time. Our experiments show that applying the GAO technique leads to a significant decrease in the accuracy of the Swin ViT model, compared to the standard Swin Transformer model. Moreover, we detect a significant increase in the training time of the Swin ViT model when GAO model is applied. These…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · CCD and CMOS Imaging Sensors · Infrared Target Detection Methodologies

MethodsMulti-Head Attention · Attention Is All You Need · Position-Wise Feed-Forward Layer · Layer Normalization · Softmax · Stochastic Depth · Linear Layer · Adam · Dense Connections · Label Smoothing