AdaptViG: Adaptive Vision GNN with Exponential Decay Gating
Mustafa Munir, Md Mostafijur Rahman, Radu Marculescu

TL;DR
AdaptViG introduces an efficient hybrid Vision GNN with adaptive graph construction and exponential decay gating, achieving state-of-the-art accuracy-efficiency trade-offs in vision tasks.
Contribution
It proposes a novel adaptive graph construction mechanism with exponential decay gating and a hybrid strategy for improved efficiency and accuracy in Vision GNNs.
Findings
Achieves 82.6% top-1 accuracy with fewer parameters and GMACs.
Outperforms larger models on downstream tasks.
Provides a new state-of-the-art efficiency-accuracy balance.
Abstract
Vision Graph Neural Networks (ViGs) offer a new direction for advancements in vision architectures. While powerful, ViGs often face substantial computational challenges stemming from their graph construction phase, which can hinder their efficiency. To address this issue we propose AdaptViG, an efficient and powerful hybrid Vision GNN that introduces a novel graph construction mechanism called Adaptive Graph Convolution. This mechanism builds upon a highly efficient static axial scaffold and a dynamic, content-aware gating strategy called Exponential Decay Gating. This gating mechanism selectively weighs long-range connections based on feature similarity. Furthermore, AdaptViG employs a hybrid strategy, utilizing our efficient gating mechanism in the early stages and a full Global Attention block in the final stage for maximum feature aggregation. Our method achieves a new…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Multimodal Machine Learning Applications · Advanced Neural Network Applications
