Boomda: Balanced Multi-objective Optimization for Multimodal Domain Adaptation
Jun Sun, Xinxin Zhang, Simin Hong, Jian Zhu, Xiang Gao

TL;DR
Boomda introduces a novel balanced multi-objective optimization method for heterogeneous multimodal domain adaptation, effectively aligning multiple modalities in an unsupervised setting and outperforming existing approaches.
Contribution
It proposes a Pareto optimal multi-objective framework for multimodal domain adaptation, simplifying it to a quadratic programming problem with a closed-form solution.
Findings
Outperforms competing schemes in empirical tests
Efficient modality-balanced adaptation algorithm
Effective in heterogeneous multimodal domain shifts
Abstract
Multimodal learning, while contributing to numerous success stories across various fields, faces the challenge of prohibitively expensive manual annotation. To address the scarcity of annotated data, a popular solution is unsupervised domain adaptation, which has been extensively studied in unimodal settings yet remains less explored in multimodal settings. In this paper, we investigate heterogeneous multimodal domain adaptation, where the primary challenge is the varying domain shifts of different modalities from the source to the target domain. We first introduce the information bottleneck method to learn representations for each modality independently, and then match the source and target domains in the representation space with correlation alignment. To balance the domain alignment of all modalities, we formulate the problem as a multi-objective task, aiming for a Pareto optimal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Topic Modeling · Advanced Graph Neural Networks
