Testing of Deep Learning Model in Real World Clinical Setting: A Case Study in Obstetric Ultrasound

Chun Kit Wong; Mary Ngo; Manxi Lin; Zahra Bashir; Amihai Heen; Morten Bo S{\o}ndergaard Svendsen; Martin Gr{\o}nneb{\ae}k Tolsgaard; Anders Nymark Christensen; Aasa Feragen

arXiv:2404.00032·cs.HC·January 30, 2026·1 cites

Testing of Deep Learning Model in Real World Clinical Setting: A Case Study in Obstetric Ultrasound

Chun Kit Wong, Mary Ngo, Manxi Lin, Zahra Bashir, Amihai Heen, Morten Bo S{\o}ndergaard Svendsen, Martin Gr{\o}nneb{\ae}k Tolsgaard, Anders Nymark Christensen, Aasa Feragen

PDF

Open Access

TL;DR

This paper presents a framework for deploying and evaluating AI models in real-world clinical obstetric ultrasound settings, highlighting practical challenges and user feedback for model refinement.

Contribution

It introduces a generic deployment framework and demonstrates its application in real-time fetal ultrasound, emphasizing the importance of real-world validation.

Findings

01

Model shows potential benefits for practitioners

02

Navigational guidance is a key area for improvement

03

Early deployment provides valuable user feedback

Abstract

Despite the rapid development of AI models in medical image analysis, their validation in real-world clinical settings remains limited. To address this, we introduce a generic framework designed for deploying image-based AI models in such settings. Using this framework, we deployed a trained model for fetal ultrasound standard plane detection, and evaluated it in real-time sessions with both novice and expert users. Feedback from these sessions revealed that while the model offers potential benefits to medical practitioners, the need for navigational guidance was identified as a key area for improvement. These findings underscore the importance of early deployment of AI models in real-world settings, leading to insights that can guide the refinement of the model and system based on actual user feedback.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education