Cross-Modality Clustering-based Self-Labeling for Multimodal Data   Classification

Pawe{\l} Zyblewski; Leandro L. Minku

arXiv:2408.02568·cs.LG·August 6, 2024

Cross-Modality Clustering-based Self-Labeling for Multimodal Data Classification

Pawe{\l} Zyblewski, Leandro L. Minku

PDF

Open Access

TL;DR

This paper introduces CMCSL, a semi-supervised method that leverages cross-modality clustering to improve labeling and classification accuracy in multimodal data, especially with limited labeled data.

Contribution

The paper presents a novel cross-modality clustering-based self-labeling approach that propagates labels across modalities to enhance semi-supervised learning in multimodal classification.

Findings

01

Cross-propagation improves label accuracy in each modality.

02

Method outperforms existing semi-supervised approaches on multiple datasets.

03

Effective with small amounts of labeled data.

Abstract

Technological advances facilitate the ability to acquire multimodal data, posing a challenge for recognition systems while also providing an opportunity to use the heterogeneous nature of the information to increase the generalization capability of models. An often overlooked issue is the cost of the labeling process, which is typically high due to the need for a significant investment in time and money associated with human experts. Existing semi-supervised learning methods often focus on operating in the feature space created by the fusion of available modalities, neglecting the potential for cross-utilizing complementary information available in each modality. To address this problem, we propose Cross-Modality Clustering-based Self-Labeling (CMCSL). Based on a small set of pre-labeled data, CMCSL groups instances belonging to each modality in the deep feature space and then…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsText and Document Classification Technologies · Web Data Mining and Analysis

MethodsSparse Evolutionary Training · Focus