Interventional Imbalanced Multi-Modal Representation Learning via $\beta$-Generalization Front-Door Criterion
Yi Li, Fei Song, Changwen Zheng, Jiangmeng Li, Fuchun Sun, Hui Xiong

TL;DR
This paper introduces a causal inference-based method for multi-modal representation learning that effectively captures the true causal relationships between modalities and labels, overcoming limitations of existing benchmark approaches.
Contribution
It proposes the $eta$-generalization front-door criterion and a novel network to better explore discriminative knowledge in multi-modal data from a causal perspective.
Findings
Outperforms benchmark methods in discriminative tasks
Provides theoretical analysis supporting the causal approach
Demonstrates improved interpretability and exploration capability
Abstract
Multi-modal methods establish comprehensive superiority over uni-modal methods. However, the imbalanced contributions of different modalities to task-dependent predictions constantly degrade the discriminative performance of canonical multi-modal methods. Based on the contribution to task-dependent predictions, modalities can be identified as predominant and auxiliary modalities. Benchmark methods raise a tractable solution: augmenting the auxiliary modality with a minor contribution during training. However, our empirical explorations challenge the fundamental idea behind such behavior, and we further conclude that benchmark approaches suffer from certain defects: insufficient theoretical interpretability and limited exploration capability of discriminative knowledge. To this end, we revisit multi-modal representation learning from a causal perspective and build the Structural Causal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImbalanced Data Classification Techniques
