MeDSLIP: Medical Dual-Stream Language-Image Pre-training with Pathology-Anatomy Semantic Alignment
Wenrui Fan, Mohammod N.I. Suvon, Shuo Zhou, Xianyuan Liu, Samer, Alabed, Venet Osmani, Andrew J. Swift, Chen Chen, Haiping Lu

TL;DR
MeDSLIP is a novel medical vision-language pre-training framework that explicitly disentangles pathology and anatomy semantics, modeling their relationships to improve medical image analysis.
Contribution
It introduces a dual-stream mechanism and interaction modeling with contrastive learning to better capture medical semantics and their relationships.
Findings
Outperforms existing models on four benchmark datasets.
Demonstrates superior generalizability and transferability.
Effectively disentangles pathology and anatomy semantics.
Abstract
Pathology and anatomy are two essential groups of semantics in medical data. Pathology describes what the diseases are, while anatomy explains where the diseases occur. They describe diseases from different perspectives, providing complementary insights into diseases. Thus, properly understanding these semantics and their relationships can enhance medical vision-language models (VLMs). However, pathology and anatomy semantics are usually entangled in medical data, hindering VLMs from explicitly modeling these semantics and their relationships. To address this challenge, we propose MeDSLIP, a novel Medical Dual-Stream Language-Image Pre-training pipeline, to disentangle pathology and anatomy semantics and model the relationships between them. We introduce a dual-stream mechanism in MeDSLIP to explicitly disentangle medical semantics into pathology-relevant and anatomy-relevant streams…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiomics and Machine Learning in Medical Imaging · AI in cancer detection
MethodsALIGN · Contrastive Learning
