Identifiable Shared Component Analysis of Unpaired Multimodal Mixtures
Subash Timilsina, Sagar Shrestha, Xiao Fu

TL;DR
This paper extends shared component analysis to unpaired multi-modal data, proposing a divergence minimization approach with milder conditions for identifiability, validated on synthetic and real data.
Contribution
It introduces a novel method for identifying shared components in unaligned multi-modal mixtures using distribution divergence minimization.
Findings
Proposed a divergence-based loss for unpaired multi-modal data.
Derived sufficient conditions for shared component identifiability.
Validated results on synthetic and real-world datasets.
Abstract
A core task in multi-modal learning is to integrate information from multiple feature spaces (e.g., text and audio), offering modality-invariant essential representations of data. Recent research showed that, classical tools such as {\it canonical correlation analysis} (CCA) provably identify the shared components up to minor ambiguities, when samples in each modality are generated from a linear mixture of shared and private components. Such identifiability results were obtained under the condition that the cross-modality samples are aligned/paired according to their shared information. This work takes a step further, investigating shared component identifiability from multi-modal linear mixtures where cross-modality samples are unaligned. A distribution divergence minimization-based loss is proposed, under which a suite of sufficient conditions ensuring identifiability of the shared…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Algorithms and Applications · Advanced Data Compression Techniques
