SonoSAMTrack -- Segment and Track Anything on Ultrasound Images
Hariharan Ravishankar, Rohan Patil, Vikram Melapudi, Harsh Suthar,, Stephan Anzengruber, Parminder Bhatia, Kass-Hout Taha, Pavan Annangi

TL;DR
SonoSAMTrack introduces a promptable segmentation and contour tracking framework for ultrasound images, achieving state-of-the-art accuracy, efficient annotation, and applicability to 2D+t and 3D data in clinical settings.
Contribution
The paper presents SonoSAMTrack, a novel model combining segmentation and tracking for ultrasound images, with fine-tuning and knowledge distillation for practical deployment.
Findings
State-of-the-art performance on 7 unseen ultrasound datasets.
Effective dense annotation with fewer user interactions.
Superior segmentation accuracy compared to existing methods.
Abstract
In this paper, we present SonoSAMTrack - that combines a promptable foundational model for segmenting objects of interest on ultrasound images called SonoSAM, with a state-of-the art contour tracking model to propagate segmentations on 2D+t and 3D ultrasound datasets. Fine-tuned and tested exclusively on a rich, diverse set of objects from k ultrasound image-mask pairs, SonoSAM demonstrates state-of-the-art performance on 7 unseen ultrasound data-sets, outperforming competing methods by a significant margin. We also extend SonoSAM to 2-D +t applications and demonstrate superior performance making it a valuable tool for generating dense annotations and segmentation of anatomical structures in clinical workflows. Further, to increase practical utility of the work, we propose a two-step process of fine-tuning followed by knowledge distillation to a smaller footprint model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Image Segmentation Techniques · Advanced Neural Network Applications · Multimodal Machine Learning Applications
MethodsSparse Evolutionary Training · Knowledge Distillation
