GKGNet: Group K-Nearest Neighbor based Graph Convolutional Network for Multi-Label Image Recognition
Ruijie Yao, Sheng Jin, Lumin Xu, Wang Zeng, Wentao Liu, Chen Qian,, Ping Luo, Ji Wu

TL;DR
GKGNet is a novel graph convolutional network that models label-image relationships for multi-label image recognition, achieving state-of-the-art results with lower computational costs by dynamically constructing graphs.
Contribution
This work introduces the first fully graph-based model for MLIR, utilizing a Group KGCN module for flexible, multi-perspective graph construction and message passing.
Findings
Achieves state-of-the-art performance on MS-COCO and VOC2007 datasets.
Reduces computational costs compared to existing methods.
Effectively models complex label-image relationships.
Abstract
Multi-Label Image Recognition (MLIR) is a challenging task that aims to predict multiple object labels in a single image while modeling the complex relationships between labels and image regions. Although convolutional neural networks and vision transformers have succeeded in processing images as regular grids of pixels or patches, these representations are sub-optimal for capturing irregular and discontinuous regions of interest. In this work, we present the first fully graph convolutional model, Group K-nearest neighbor based Graph convolutional Network (GKGNet), which models the connections between semantic label embeddings and image patches in a flexible and unified graph structure. To address the scale variance of different objects and to capture information from multiple perspectives, we propose the Group KGCN module for dynamic graph construction and message passing. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Image Retrieval and Classification Techniques · Machine Learning in Bioinformatics
MethodsGraph Convolutional Network
