A Short Review and Evaluation of SAM2's Performance in 3D CT Image Segmentation
Yufan He, Pengfei Guo, Yucheng Tang, Andriy Myronenko, Vishwesh Nath,, Ziyue Xu, Dong Yang, Can Zhao, Daguang Xu, Wenqi Li

TL;DR
This paper evaluates SAM2's performance in 3D CT image segmentation, revealing limitations in zero-shot application and highlighting the need for further research for effective medical imaging use.
Contribution
It reproduces SAM2's zero-shot evaluation pipeline on 3D CT data and provides a critical analysis of its performance and limitations.
Findings
SAM2 performs poorly in zero-shot 3D CT segmentation.
It generates false positives when foreground objects disappear.
SAM2 is less effective than state-of-the-art 3D annotation methods.
Abstract
Since the release of Segment Anything 2 (SAM2), the medical imaging community has been actively evaluating its performance for 3D medical image segmentation. However, different studies have employed varying evaluation pipelines, resulting in conflicting outcomes that obscure a clear understanding of SAM2's capabilities and potential applications. We shortly review existing benchmarks and point out that the SAM2 paper clearly outlines a zero-shot evaluation pipeline, which simulates user clicks iteratively for up to eight iterations. We reproduced this interactive annotation simulation on 3D CT datasets and provided the results and code~\url{https://github.com/Project-MONAI/VISTA}. Our findings reveal that directly applying SAM2 on 3D medical imaging in a zero-shot manner is far from satisfactory. It is prone to generating false positives when foreground objects disappear, and annotating…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Image Segmentation Techniques
