WiGNet: Windowed Vision Graph Neural Network

Gabriele Spadaro; Marco Grangetto; Attilio Fiandrotti; Enzo; Tartaglione; Jhony H. Giraldo

arXiv:2410.00807·cs.CV·October 2, 2024

WiGNet: Windowed Vision Graph Neural Network

Gabriele Spadaro, Marco Grangetto, Attilio Fiandrotti, Enzo, Tartaglione, Jhony H. Giraldo

PDF

Open Access 1 Repo

TL;DR

WiGNet introduces a windowed graph neural network approach for efficient image processing, reducing computational complexity while maintaining competitive accuracy on large-scale vision benchmarks.

Contribution

The paper proposes a novel windowed graph neural network architecture that partitions images into windows, enabling scalable and efficient vision GNNs for large images.

Findings

01

Achieves competitive accuracy on ImageNet-1k.

02

Reduces computational and memory complexity.

03

Demonstrates effectiveness on high-resolution CelebA-HQ images.

Abstract

In recent years, Graph Neural Networks (GNNs) have demonstrated strong adaptability to various real-world challenges, with architectures such as Vision GNN (ViG) achieving state-of-the-art performance in several computer vision tasks. However, their practical applicability is hindered by the computational complexity of constructing the graph, which scales quadratically with the image size. In this paper, we introduce a novel Windowed vision Graph neural Network (WiGNet) model for efficient image processing. WiGNet explores a different strategy from previous works by partitioning the image into windows and constructing a graph within each window. Therefore, our model uses graph convolutions instead of the typical 2D convolution or self-attention mechanism. WiGNet effectively manages computational and memory complexity for large image sizes. We evaluate our method in the ImageNet-1k…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

eidoslab/wignet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVideo Surveillance and Tracking Methods

MethodsConvolution · Graph Neural Network