Automating Sonologists USG Commands with AI and Voice Interface
Emad Mohamed, Shruti Tiwari, Sheena Christabel Pravin

TL;DR
This paper introduces an AI-driven ultrasound system that uses real-time image processing, organ tracking, and voice commands to improve diagnostic efficiency and accuracy for sonologists.
Contribution
It presents a novel integration of deep learning, computer vision, and voice technology to automate ultrasound imaging procedures.
Findings
Liver fibrosis detection accuracy of 98.6%
Organ segmentation confidence levels between 50% and 95%
Enhanced diagnostic workflow with hands-free operation
Abstract
This research presents an advanced AI-powered ultrasound imaging system that incorporates real-time image processing, organ tracking, and voice commands to enhance the efficiency and accuracy of diagnoses in clinical practice. Traditional ultrasound diagnostics often require significant time and introduce a degree of subjectivity due to user interaction. The goal of this innovative solution is to provide Sonologists with a more predictable and productive imaging procedure utilizing artificial intelligence, computer vision, and voice technology. The functionality of the system employs computer vision and deep learning algorithms, specifically adopting the Mask R-CNN model from Detectron2 for semantic segmentation of organs and key landmarks. This automation improves diagnostic accuracy by enabling the extraction of valuable information with minimal human input. Additionally, it includes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Imaging and Analysis
MethodsRoIAlign · Region Proposal Network · Convolution · Softmax · Mask R-CNN · Focus
