GreedyViG: Dynamic Axial Graph Construction for Efficient Vision GNNs
Mustafa Munir, William Avery, Md Mostafijur Rahman, and Radu, Marculescu

TL;DR
This paper introduces GreedyViG, a hybrid CNN-GNN architecture with a novel, efficient graph construction method called DAGC, which outperforms existing models in accuracy and efficiency across multiple vision tasks.
Contribution
The paper proposes DAGC for efficient graph construction and a new hybrid CNN-GNN architecture, GreedyViG, achieving superior performance and efficiency in vision tasks.
Findings
GreedyViG surpasses existing ViG, CNN, and ViT models in accuracy and efficiency.
GreedyViG-S achieves 81.1% top-1 accuracy on ImageNet-1K, outperforming related models.
GreedyViG-B reduces parameters and GMACs significantly while maintaining or improving accuracy.
Abstract
Vision graph neural networks (ViG) offer a new avenue for exploration in computer vision. A major bottleneck in ViGs is the inefficient k-nearest neighbor (KNN) operation used for graph construction. To solve this issue, we propose a new method for designing ViGs, Dynamic Axial Graph Construction (DAGC), which is more efficient than KNN as it limits the number of considered graph connections made within an image. Additionally, we propose a novel CNN-GNN architecture, GreedyViG, which uses DAGC. Extensive experiments show that GreedyViG beats existing ViG, CNN, and ViT architectures in terms of accuracy, GMACs, and parameters on image classification, object detection, instance segmentation, and semantic segmentation tasks. Our smallest model, GreedyViG-S, achieves 81.1% top-1 accuracy on ImageNet-1K, 2.9% higher than Vision GNN and 2.2% higher than Vision HyperGraph Neural Network…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVisual Attention and Saliency Detection · Advanced Image and Video Retrieval Techniques · Advanced Neural Network Applications
