GreedyViG: Dynamic Axial Graph Construction for Efficient Vision GNNs

Mustafa Munir; William Avery; Md Mostafijur Rahman; and Radu; Marculescu

arXiv:2405.06849·cs.CV·May 14, 2024

GreedyViG: Dynamic Axial Graph Construction for Efficient Vision GNNs

Mustafa Munir, William Avery, Md Mostafijur Rahman, and Radu, Marculescu

PDF

Open Access 1 Repo 1 Models

TL;DR

This paper introduces GreedyViG, a hybrid CNN-GNN architecture with a novel, efficient graph construction method called DAGC, which outperforms existing models in accuracy and efficiency across multiple vision tasks.

Contribution

The paper proposes DAGC for efficient graph construction and a new hybrid CNN-GNN architecture, GreedyViG, achieving superior performance and efficiency in vision tasks.

Findings

01

GreedyViG surpasses existing ViG, CNN, and ViT models in accuracy and efficiency.

02

GreedyViG-S achieves 81.1% top-1 accuracy on ImageNet-1K, outperforming related models.

03

GreedyViG-B reduces parameters and GMACs significantly while maintaining or improving accuracy.

Abstract

Vision graph neural networks (ViG) offer a new avenue for exploration in computer vision. A major bottleneck in ViGs is the inefficient k-nearest neighbor (KNN) operation used for graph construction. To solve this issue, we propose a new method for designing ViGs, Dynamic Axial Graph Construction (DAGC), which is more efficient than KNN as it limits the number of considered graph connections made within an image. Additionally, we propose a novel CNN-GNN architecture, GreedyViG, which uses DAGC. Extensive experiments show that GreedyViG beats existing ViG, CNN, and ViT architectures in terms of accuracy, GMACs, and parameters on image classification, object detection, instance segmentation, and semantic segmentation tasks. Our smallest model, GreedyViG-S, achieves 81.1% top-1 accuracy on ImageNet-1K, 2.9% higher than Vision GNN and 2.2% higher than Vision HyperGraph Neural Network…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

SLDGroup/GreedyViG
pytorchOfficial

Models

🤗
SLDGroup/GreedyViG
model· ♡ 1
♡ 1

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVisual Attention and Saliency Detection · Advanced Image and Video Retrieval Techniques · Advanced Neural Network Applications