Semantic-Aware Representation Learning via Conditional Transport for Multi-Label Image Classification
Ren-Dong Xie, Zhi-Fen He, Bo Li, Bin Liu, Jin-Yan Hu

TL;DR
This paper introduces SCT, a novel framework that enhances multi-label image classification by learning semantic-aware features and achieving fine-grained visual-semantic alignment, leading to improved performance.
Contribution
It proposes a unified approach combining semantic-related feature learning and conditional transport-based alignment for better multi-label classification.
Findings
SCT outperforms existing methods on VOC2007 and MS-COCO datasets.
The semantic-related feature module improves discriminative label-specific features.
Conditional transport enables precise visual-semantic alignment.
Abstract
Multi-label image classification is a critical task in machine learning that aims to accurately assign multiple labels to a single image. While existing methods often utilize attention mechanisms or graph convolutional networks to model visual representations, their performance is still constrained by two critical limitations: the inability to learn discriminative semantic-aware features, and the lack of fine-grained alignment between visual representations and label embeddings. To tackle these issues in a unified framework, this paper proposes a novel approach named Semantic-aware representation learning via Conditional Transport for Multi-Label Image Classification (SCT). The proposed method introduces a semantic-related feature learning module that extracts discriminative label-specific features by emphasizing semantic relevance and interaction, along with a conditional…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification
