VLCD: Vision-Language Contrastive Distillation for Accurate and Efficient Automatic Placenta Analysis

Manas Mehta; Yimu Pan; Kelly Gallagher; Alison D. Gernand; Jeffery A. Goldstein; Delia Mwinyelle; Leena Mithal; and James Z. Wang

arXiv:2506.02229·cs.CV·June 4, 2025

VLCD: Vision-Language Contrastive Distillation for Accurate and Efficient Automatic Placenta Analysis

Manas Mehta, Yimu Pan, Kelly Gallagher, Alison D. Gernand, Jeffery A. Goldstein, Delia Mwinyelle, Leena Mithal, and James Z. Wang

PDF

Open Access

TL;DR

This paper introduces VLCD, a novel vision-language contrastive distillation method that enhances accuracy and efficiency in automatic placenta analysis, making AI healthcare tools more accessible and deployable.

Contribution

It proposes a new text-anchored knowledge distillation strategy and unsupervised predistillation to improve medical vision-language models' performance and efficiency.

Findings

01

VLCD achieves model compression and acceleration.

02

Unsupervised predistillation improves robustness on low-quality images.

03

The approach surpasses previous methods in accuracy and efficiency.

Abstract

Pathological examination of the placenta is an effective method for detecting and mitigating health risks associated with childbirth. Recent advancements in AI have enabled the use of photographs of the placenta and pathology reports for detecting and classifying signs of childbirth-related pathologies. However, existing automated methods are computationally extensive, which limits their deployability. We propose two modifications to vision-language contrastive learning (VLC) frameworks to enhance their accuracy and efficiency: (1) text-anchored vision-language contrastive knowledge distillation (VLCD)-a new knowledge distillation strategy for medical VLC pretraining, and (2) unsupervised predistillation using a large natural images dataset for improved initialization. Our approach distills efficient neural networks that match or surpass the teacher model in performance while achieving…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques