Identifiable Shared Component Analysis of Unpaired Multimodal Mixtures

Subash Timilsina; Sagar Shrestha; Xiao Fu

arXiv:2409.19422·cs.LG·October 2, 2024

Identifiable Shared Component Analysis of Unpaired Multimodal Mixtures

Subash Timilsina, Sagar Shrestha, Xiao Fu

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper extends shared component analysis to unpaired multi-modal data, proposing a divergence minimization approach with milder conditions for identifiability, validated on synthetic and real data.

Contribution

It introduces a novel method for identifying shared components in unaligned multi-modal mixtures using distribution divergence minimization.

Findings

01

Proposed a divergence-based loss for unpaired multi-modal data.

02

Derived sufficient conditions for shared component identifiability.

03

Validated results on synthetic and real-world datasets.

Abstract

A core task in multi-modal learning is to integrate information from multiple feature spaces (e.g., text and audio), offering modality-invariant essential representations of data. Recent research showed that, classical tools such as {\it canonical correlation analysis} (CCA) provably identify the shared components up to minor ambiguities, when samples in each modality are generated from a linear mixture of shared and private components. Such identifiability results were obtained under the condition that the cross-modality samples are aligned/paired according to their shared information. This work takes a step further, investigating shared component identifiability from multi-modal linear mixtures where cross-modality samples are unaligned. A distribution divergence minimization-based loss is proposed, under which a suite of sufficient conditions ensuring identifiability of the shared…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

XiaoFuLab/Shared-Component-Analysis
pytorchOfficial

Videos

Identifiable Shared Component Analysis of Unpaired Multimodal Mixtures· slideslive

Taxonomy

TopicsAdvanced Algorithms and Applications · Advanced Data Compression Techniques