Semantic-Aware Representation Blending for Multi-Label Image Recognition with Partial Labels
Tao Pu, Tianshui Chen, Hefeng Wu, Liang Lin

TL;DR
This paper introduces a semantic-aware representation blending framework for multi-label image recognition with partial labels, effectively transferring label information across images without relying on extensive annotations.
Contribution
The proposed SARB framework uniquely blends category-specific representations at instance and prototype levels, eliminating the need for pre-training models and improving performance with limited labels.
Findings
Achieves higher mAP on MS-COCO, Visual Genome, Pascal VOC datasets.
Outperforms existing methods especially with low known label proportions.
Demonstrates robustness across different datasets and label settings.
Abstract
Training the multi-label image recognition models with partial labels, in which merely some labels are known while others are unknown for each image, is a considerably challenging and practical task. To address this task, current algorithms mainly depend on pre-training classification or similarity models to generate pseudo labels for the unknown labels. However, these algorithms depend on sufficient multi-label annotations to train the models, leading to poor performance especially with low known label proportion. In this work, we propose to blend category-specific representation across different images to transfer information of known labels to complement unknown labels, which can get rid of pre-training models and thus does not depend on sufficient annotations. To this end, we design a unified semantic-aware representation blending (SARB) framework that exploits instance-level and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsText and Document Classification Technologies · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
