Cross-Modality Clustering-based Self-Labeling for Multimodal Data Classification
Pawe{\l} Zyblewski, Leandro L. Minku

TL;DR
This paper introduces CMCSL, a semi-supervised method that leverages cross-modality clustering to improve labeling and classification accuracy in multimodal data, especially with limited labeled data.
Contribution
The paper presents a novel cross-modality clustering-based self-labeling approach that propagates labels across modalities to enhance semi-supervised learning in multimodal classification.
Findings
Cross-propagation improves label accuracy in each modality.
Method outperforms existing semi-supervised approaches on multiple datasets.
Effective with small amounts of labeled data.
Abstract
Technological advances facilitate the ability to acquire multimodal data, posing a challenge for recognition systems while also providing an opportunity to use the heterogeneous nature of the information to increase the generalization capability of models. An often overlooked issue is the cost of the labeling process, which is typically high due to the need for a significant investment in time and money associated with human experts. Existing semi-supervised learning methods often focus on operating in the feature space created by the fusion of available modalities, neglecting the potential for cross-utilizing complementary information available in each modality. To address this problem, we propose Cross-Modality Clustering-based Self-Labeling (CMCSL). Based on a small set of pre-labeled data, CMCSL groups instances belonging to each modality in the deep feature space and then…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies · Web Data Mining and Analysis
MethodsSparse Evolutionary Training · Focus
