Efficient Low-rank Backpropagation for Vision Transformer Adaptation
Yuedong Yang, Hung-Yueh Chiang, Guihong Li, Diana Marculescu, Radu, Marculescu

TL;DR
This paper introduces LBP-WHT, a low-rank backpropagation method using Walsh-Hadamard transformation, significantly reducing computation in ViT fine-tuning while improving accuracy on datasets like CIFAR100.
Contribution
We propose the first low-rank backpropagation technique for accelerating vision transformer adaptation using Walsh-Hadamard transformation.
Findings
Achieves 10.4% higher accuracy on CIFAR100 with less computation.
Reduces backpropagation computation by 9 MFLOPs.
Effective across different ViT and hybrid models.
Abstract
The increasing scale of vision transformers (ViT) has made the efficient fine-tuning of these large models for specific needs a significant challenge in various applications. This issue originates from the computationally demanding matrix multiplications required during the backpropagation process through linear layers in ViT. In this paper, we tackle this problem by proposing a new Low-rank BackPropagation via Walsh-Hadamard Transformation (LBP-WHT) method. Intuitively, LBP-WHT projects the gradient into a low-rank space and carries out backpropagation. This approach substantially reduces the computation needed for adapting ViT, as matrix multiplication in the low-rank space is far less resource-intensive. We conduct extensive experiments with different models (ViT, hybrid convolution-ViT model) on multiple datasets to demonstrate the effectiveness of our method. For instance, when…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Fusion Techniques · Sparse and Compressive Sensing Techniques · Advanced Vision and Imaging
