Multi-Label Image Recognition with Graph Convolutional Networks
Zhao-Min Chen, Xiu-Shen Wei, Peng Wang, Yanwen Guo

TL;DR
This paper introduces a graph convolutional network-based model for multi-label image recognition that captures label dependencies to improve accuracy, demonstrating superior performance and meaningful semantic topology in classifiers.
Contribution
The paper proposes a novel GCN-based approach with a re-weighted label correlation scheme for end-to-end multi-label image recognition.
Findings
Outperforms existing state-of-the-art methods on benchmark datasets.
Effectively models label dependencies through GCN.
Maintains meaningful semantic topology in learned classifiers.
Abstract
The task of multi-label image recognition is to predict a set of object labels that present in an image. As objects normally co-occur in an image, it is desirable to model the label dependencies to improve the recognition performance. To capture and explore such important dependencies, we propose a multi-label classification model based on Graph Convolutional Network (GCN). The model builds a directed graph over the object labels, where each node (label) is represented by word embeddings of a label, and GCN is learned to map this label graph into a set of inter-dependent object classifiers. These classifiers are applied to the image descriptors extracted by another sub-net, enabling the whole network to be end-to-end trainable. Furthermore, we propose a novel re-weighted scheme to create an effective label correlation matrix to guide information propagation among the nodes in GCN.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Advanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques
MethodsGraph Convolutional Network
