Fair Comparison between Efficient Attentions

Jiuk Hong; Chaehyeon Lee; Soyoun Bang; Heechul Jung

arXiv:2206.00244·cs.CV·June 2, 2022

Fair Comparison between Efficient Attentions

Jiuk Hong, Chaehyeon Lee, Soyoun Bang, Heechul Jung

PDF

Open Access 1 Repo

TL;DR

This paper systematically compares various efficient attention mechanisms in transformers for image classification, isolating their performance differences by standardizing model configurations and training schemes.

Contribution

It provides a direct, fair comparison of efficient attention methods on ImageNet1K, highlighting which approaches perform best under identical conditions.

Findings

01

Certain efficient attention methods outperform others in classification accuracy.

02

Trade-offs between computational efficiency and accuracy are identified.

03

Standardized evaluation framework for future attention model comparisons.

Abstract

Transformers have been successfully used in various fields and are becoming the standard tools in computer vision. However, self-attention, a core component of transformers, has a quadratic complexity problem, which limits the use of transformers in various vision tasks that require dense prediction. Many studies aiming at solving this problem have been reported proposed. However, no comparative study of these methods using the same scale has been reported due to different model configurations, training schemes, and new methods. In our paper, we validate these efficient attention models on the ImageNet1K classification task by changing only the attention operation and examining which efficient attention is better.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

CreamNuts/Fair-Comparison-between-Efficient-Attentions
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Neural Networks and Applications · Industrial Vision Systems and Defect Detection