FetalCLIP: A Visual-Language Foundation Model for Fetal Ultrasound Image Analysis
Fadillah Maani, Numan Saeed, Tausifa Saleem, Zaid Farooq, Hussain Alasmawi, Werner Diehl, Ameera Mohammad, Gareth Waring, Saudabi Valappi, Leanne Bricker, Mohammad Yaqub

TL;DR
FetalCLIP is a large vision-language foundation model trained on over 210,000 fetal ultrasound images and text, enabling robust, generalizable analysis for various fetal ultrasound tasks.
Contribution
This work introduces FetalCLIP, the largest paired dataset and a novel multimodal foundation model specifically designed for fetal ultrasound image analysis.
Findings
Outperforms baseline models across multiple fetal ultrasound tasks
Demonstrates strong generalization with limited labeled data
Achieves state-of-the-art results in classification, gestational age estimation, CHD detection, and segmentation
Abstract
Foundation models are becoming increasingly effective in the medical domain, offering pre-trained models on large datasets that can be readily adapted for downstream tasks. Despite progress, fetal ultrasound images remain a challenging domain for foundation models due to their inherent complexity, often requiring substantial additional training and facing limitations due to the scarcity of paired multimodal data. To overcome these challenges, here we introduce FetalCLIP, a vision-language foundation model capable of generating universal representation of fetal ultrasound images. FetalCLIP was pre-trained using a multimodal learning approach on a diverse dataset of 210,035 fetal ultrasound images paired with text. This represents the largest paired dataset of its kind used for foundation model development to date. This unique training approach allows FetalCLIP to effectively learn the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFetal and Pediatric Neurological Disorders · AI in cancer detection
