Uterine Ultrasound Image Captioning Using Deep Learning Techniques
Abdennour Boulesnane, Boutheina Mokhtari, Oumnia Rana Segueni, and, Slimane Segueni

TL;DR
This paper presents a deep learning system combining CNNs and BiGRUs to generate descriptive captions for uterine ultrasound images, improving diagnostic support in obstetrics and gynecology.
Contribution
It introduces a novel hybrid deep learning model specifically designed for uterine ultrasound image captioning, outperforming baseline methods in accuracy.
Findings
Achieved higher BLEU and ROUGE scores compared to baseline models.
Demonstrated improved accuracy in generating informative captions.
Enhanced interpretation aids for medical professionals.
Abstract
Medical imaging has significantly revolutionized medical diagnostics and treatment planning, progressing from early X-ray usage to sophisticated methods like MRIs, CT scans, and ultrasounds. This paper investigates the use of deep learning for medical image captioning, with a particular focus on uterine ultrasound images. These images are vital in obstetrics and gynecology for diagnosing and monitoring various conditions across different age groups. However, their interpretation is often challenging due to their complexity and variability. To address this, a deep learning-based medical image captioning system was developed, integrating Convolutional Neural Networks with a Bidirectional Gated Recurrent Unit network. This hybrid model processes both image and text features to generate descriptive captions for uterine ultrasound images. Our experimental results demonstrate the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · AI in cancer detection · Video Analysis and Summarization
MethodsFocus
