Med-UniC: Unifying Cross-Lingual Medical Vision-Language Pre-Training by Diminishing Bias
Zhongwei Wan, Che Liu, Mi Zhang, Jie Fu, Benyou Wang, Sibo Cheng, Lei, Ma, C\'esar Quilodr\'an-Casas, Rossella Arcucci

TL;DR
Med-UniC introduces a novel cross-lingual medical vision-language pre-training framework that reduces community bias by aligning semantic representations across English and Spanish medical reports, improving performance across multiple tasks.
Contribution
The paper proposes Cross-lingual Text Alignment Regularization (CTR) with latent language disentanglement to unify cross-lingual medical representations without relying on negative samples.
Findings
Med-UniC outperforms existing methods on 5 medical image tasks.
Reducing community bias improves performance in vision-language and uni-modal visual tasks.
The framework effectively unifies multi-modal data across diverse linguistic communities.
Abstract
The scarcity of data presents a critical obstacle to the efficacy of medical visionlanguage pre-training (VLP). A potential solution lies in the combination of datasets from various language communities. Nevertheless, the main challenge stems from the complexity of integrating diverse syntax and semantics, language-specific medical terminology, and culture-specific implicit knowledge. Therefore, one crucial aspect to consider is the presence of community bias caused by different languages. This paper presents a novel framework named Unifying Cross-Lingual Medical Vision-Language Pre-Training (Med-UniC), designed to integrate multimodal medical data from the two most prevalent languages, English and Spanish. Specifically, we propose Cross-lingual Text Alignment Regularization (CTR) to explicitly unify cross-lingual semantic representations of medical reports originating from diverse…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMultimodal Machine Learning Applications · COVID-19 diagnosis using AI · Biomedical Text Mining and Ontologies
