TL;DR
This paper introduces CAPIMAC, a novel method for filling missing and misaligned data in multimodal datasets using a self-repellent greedy anchor search and consistency-aware padding, enhancing data fusion quality.
Contribution
It proposes a new approach combining self-repellent greedy anchor search and noise-contrastive learning for better handling incomplete multimodal data.
Findings
Outperforms benchmark datasets in data alignment tasks
Effectively interpolates imbalanced and misaligned multimodal data
Improves multimodal data fusion quality
Abstract
Multimodal representation is faithful and highly effective in describing real-world data samples' characteristics by describing their complementary information. However, the collected data often exhibits incomplete and misaligned characteristics due to factors such as inconsistent sensor frequencies and device malfunctions. Existing research has not effectively addressed the issue of filling missing data in scenarios where multiview data are both imbalanced and misaligned. Instead, it relies on class-level alignment of the available data. Thus, it results in some data samples not being well-matched, thereby affecting the quality of data fusion. In this paper, we propose the Consistency-Aware Padding for Incomplete Multimodal Alignment Clustering Based on Self-Repellent Greedy Anchor Search(CAPIMAC) to tackle the problem of filling imbalanced and misaligned data in multimodal datasets.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
