HAViT: Historical Attention Vision Transformer
Swarnendu Banik, Manish Das, Shiv Ram Dubey, Satish Kumar Singh

TL;DR
HAViT introduces a cross-layer attention propagation method for Vision Transformers, leveraging historical attention matrices to improve feature learning and accuracy with minimal architectural changes.
Contribution
The paper proposes a novel cross-layer attention propagation technique that refines inter-layer information flow in Vision Transformers, enhancing performance across multiple datasets and architectures.
Findings
Consistent accuracy improvements on CIFAR-100 and TinyImageNet.
Optimal blending hyperparameter identified at alpha=0.45.
Random initialization outperforms zero initialization for attention matrices.
Abstract
Vision Transformers have excelled in computer vision but their attention mechanisms operate independently across layers, limiting information flow and feature learning. We propose an effective cross-layer attention propagation method that preserves and integrates historical attention matrices across encoder layers, offering a principled refinement of inter-layer information flow in Vision Transformers. This approach enables progressive refinement of attention patterns throughout the transformer hierarchy, enhancing feature acquisition and optimization dynamics. The method requires minimal architectural changes, adding only attention matrix storage and blending operations. Comprehensive experiments on CIFAR-100 and TinyImageNet demonstrate consistent accuracy improvements, with ViT performance increasing from 75.74% to 77.07% on CIFAR-100 (+1.33%) and from 57.82% to 59.07% on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · CCD and CMOS Imaging Sensors · Big Data and Digital Economy
