Dual-Perspective Semantic-Aware Representation Blending for Multi-Label   Image Recognition with Partial Labels

Tao Pu; Tianshui Chen; Hefeng Wu; Yukai Shi; Zhijing Yang; Liang Lin

arXiv:2205.13092·cs.CV·January 30, 2024

Dual-Perspective Semantic-Aware Representation Blending for Multi-Label Image Recognition with Partial Labels

Tao Pu, Tianshui Chen, Hefeng Wu, Yukai Shi, Zhijing Yang, Liang Lin

PDF

Open Access 1 Repo

TL;DR

This paper introduces a dual-perspective semantic-aware representation blending method for multi-label image recognition with partial labels, effectively transferring label information across images and categories to improve performance.

Contribution

The paper proposes a novel dual-perspective blending approach that leverages instance and prototype representations to enhance label prediction in partially labeled datasets.

Findings

01

Outperforms state-of-the-art algorithms on MS-COCO, Visual Genome, and Pascal VOC 2007.

02

Effective across various known label proportion settings.

03

Improves unknown label prediction accuracy.

Abstract

Despite achieving impressive progress, current multi-label image recognition (MLR) algorithms heavily depend on large-scale datasets with complete labels, making collecting large-scale datasets extremely time-consuming and labor-intensive. Training the multi-label image recognition models with partial labels (MLR-PL) is an alternative way, in which merely some labels are known while others are unknown for each image. However, current MLP-PL algorithms rely on pre-trained image similarity models or iteratively updating the image classification models to generate pseudo labels for the unknown labels. Thus, they depend on a certain amount of annotations and inevitably suffer from obvious performance drops, especially when the known label proportion is low. To address this dilemma, we propose a dual-perspective semantic-aware representation blending (DSRB) that blends multi-granularity…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hcplab-sysu/hcp-mlr-pl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques · Text and Document Classification Technologies · Advanced Image and Video Retrieval Techniques